Gene Hhal_1556 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHhal_1556 
Symbol 
ID4710921 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhodospira halophila SL1 
KingdomBacteria 
Replicon accessionNC_008789 
Strand
Start bp1691439 
End bp1692575 
Gene Length1137 bp 
Protein Length378 aa 
Translation table11 
GC content65% 
IMG OID639856020 
Productcellulose biosynthesis protein CelD 
Protein accessionYP_001003122 
Protein GI121998335 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG5653] Protein involved in cellulose biosynthesis (CelD) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAGCCCT CGGTTCAGCG ACAGCCTTTC GCGCAAGTGG CCTCTGCCTG GGATCGCCTG 
GCGCAAGATG CTCTCCCGAA TCCGTTCCTG ACGACGGCCT GGCACCGAAC CCTCCACAAA
TTAGCTCCAG ACGACCCGGT CCCCCCGGCC CTGGAAACCG TCCTGTACCA GTGGGGCGGC
GAGCCGAGAG CCTTGGCCAC GCTCGGCAGG GCCAGGGTTC GTCGGGCCCT TGTTTTCAGC
AGTCGGATGC TCTTTCTCAA CGAGACGGGC GATCCGCGCC TTGACTACTT GACGGTCGAG
CACAACGCGC CGCTGGCCCC GGTGGGGGCC GAGGCGAAGG CGTTCGCCGG CATGGTCGAG
GGTCTGCTCA CGGACACAGA CTGGGACGAA CTCTGCCTGG GGTGGGTCGA GGCGGATCGT
TGGCGGGCGT GCTGGTTGGA GTGTTCCCAC TTGCCGCTGA TGCCGGTGGT GATAGATCGT
CGACCCTACT ATTTTCGCGG GTTGCAGTCT CGGGATGCCA GGCCGGATCA ACTCCTCAGC
AGTCTCAGCA GCAACACGCG GCAACAGATT CGGCGGTCGA TCCGGCAGTA CGGTGGGCTC
GACGCCCTCG CCTTTGAGGT CGCCACGGAC CCGGCCATGG CCGTGCGGTG GTTCGAGCAT
ATGGTCGAGC TGCATCAAGC GCGCTGGCAG GCGCAGGGCA AGGTCGGCGC TTTCGCCGAT
CCGTTCATGC GTGCATTCCA TGAGCACCTA ATCGAGGCGG GCGCCCAGGA TGGTAGCGCT
CGCATGATCC GGGTGCAGAC CTCGGAGCGG GTGATTGGCT ATCTCTACAA CCTGCGGGCG
GGGGGCTATG AGTGCAATTA CCAGAGTGGC CTCGCTTATG AGGCAGATCC CCGCAGCAAG
CCGGGGCTCG TGAGCCATAT CCTCGCCATG GCGGCCGCGG CGGAGACCGG GGTCCACTGC
TACGATTTCC TCGTTGGTGA GAGCCAGTAC AAGCGCAGCC TGGCTAGCGG GAAGGGAGAG
ATGCTGCGGG TCTCCCTGCA GAGGCGGCGA CCGATGCTGT GGTTGGAGCG GCAACTTCGC
GCAGCCCGGG ATCGGATCCT GCAAAAGAGG GGGCGCGAAC GAAAGGAGGC GTGGTAA
 
Protein sequence
MEPSVQRQPF AQVASAWDRL AQDALPNPFL TTAWHRTLHK LAPDDPVPPA LETVLYQWGG 
EPRALATLGR ARVRRALVFS SRMLFLNETG DPRLDYLTVE HNAPLAPVGA EAKAFAGMVE
GLLTDTDWDE LCLGWVEADR WRACWLECSH LPLMPVVIDR RPYYFRGLQS RDARPDQLLS
SLSSNTRQQI RRSIRQYGGL DALAFEVATD PAMAVRWFEH MVELHQARWQ AQGKVGAFAD
PFMRAFHEHL IEAGAQDGSA RMIRVQTSER VIGYLYNLRA GGYECNYQSG LAYEADPRSK
PGLVSHILAM AAAAETGVHC YDFLVGESQY KRSLASGKGE MLRVSLQRRR PMLWLERQLR
AARDRILQKR GRERKEAW