Gene Cphamn1_0371 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCphamn1_0371 
Symbol 
ID6374033 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium phaeobacteroides BS1 
KingdomBacteria 
Replicon accessionNC_010831 
Strand
Start bp391883 
End bp393697 
Gene Length1815 bp 
Protein Length604 aa 
Translation table11 
GC content49% 
IMG OID642682890 
Productpolysaccharide biosynthesis protein CapD 
Protein accessionYP_001958819 
Protein GI189499349 
COG category[G] Carbohydrate transport and metabolism
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1086] Predicted nucleoside-diphosphate sugar epimerases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCTGCAGA AAACAATGCT TTCGCTGCCC TTAAGGGTAA AGCAGACTCT TGCGGTTATG 
CAGGATAGTC TGGCGGTAGT GTTTTCCGTA TGGCTCGCCT ATTCGCTTCG TTTTGAGGTA
TGGCATGTTC CTCAAAAAGC ACAATGGCTG GTATATGGTA TCGCTCTGGC TATTTCTCTG
CCTGTTTTTT ATGGCACCAG CCTCTACAAG TCAGTCTTTC GCTTTAGTGA GGCTATGGCA
TTTAAGCAGA TAACAAAAGC TGCGGCACTC TATGCGGTGC TGTTTTTTTT TACTCTTGTC
TTGTTCAAAA TAGATGGGGT CCCGCGTTCA ATCGGAATAA CGCAGCCAAT AATGCTTTTT
CTGCTGCTTC TCGCAAGCAG AGGCGCAGCC CGTTTTTTTC TGAACACCCG CTTGCAGTTG
GGTCATTGCA GTACTGCCGA CAAGCGGTTG CTGATCTACG GTGTCGGATC GGCAGGAATT
CAGCTCGCGT CAGCCATCGA ACAAACTACC CGCCACCTTC TTATCGGTTT TATCGATGAT
GATCCCAAAC TGCATGGACG GATGGTAAAA GGCCTGAAGG TATTTTCTTT TGGCCAGGTG
GCCCGGCTTG TAGAACAGGC GACGGTTACC GACATTCTGC TGGCGATACC CTCTGCAAGC
CGTTCAAGGC GCAATCAGAT TTTGCAGGCG TTACAGCCGT TTCCCGTGCA CGTGCAGACA
CTTCCAAGCC TTGAGGATTT GACAGATGGC AATATTTCAG TCAGAGATGT AAAAGAGATT
GAAATTGAAG ACCTGCTTGG CCGTGATCCT GTTTCTTCCG GGCCATCTCT TTTCAGGCGT
AACATTACCG GCAAGACCGT AGTTGTAACC GGTGCGGGGG GGAGTATCGG GAGAGAGTTG
TGCAGGCAGA TACTTATCGG GTGCGCCGAT AAGCTCATTA TGATTGATCA TGCTGAATTC
AATCTGCATG ACGCATATCT CGAGCTGGAA GGGTACAGAG AAAGAAAACA GCAGGACACG
GAGATTGTGC CTCTTTTGTG CAATGTGGCC GAGAACCATC GATTCAGCGC AATCTGCTCT
TCATATCACC CGCATACCAT TTACCATGCT GCGGCGTACA AACATGTTCC CATGGTTGAG
CGCAATCCTG TCGAAGGCGT TCGGAATAAT GTTTTCGGGA CTCTTCGTTC AGCTCTTGCC
GCGCAGGAAT ACGGCGTTGA AAGTTTTATC CTGGTCAGTA CCGATAAGGC TGTTCGTCCA
GCAAATTTCA TGGGGGCGAG CAAGCGATTG TGTGAGATTA TTCTTCAGGC GCTATCTGCT
GAAGAGAACG GTAGCGGCAC GTGTTTCTCT ATGGTCCGTT TTGGTAATGT GCTGGATTCA
AGTGGTTCTG TAGTGCCGCT TTTTCGCCAG CAGATAAAAG ACGGCGGGCC TGTAACCGTA
ACACATCCGG AAATAACCCG ATATTTTATG ACTATTCCGG AAGCTGTGCA GCTTGTTATT
CAGTCAGGCG CGATGTCTCT GGGCGGAGAG GTTTTTGTCC TTGATATGGG TGAGCCTGTC
AAGATAATTG ATCTTGCGAA ACGCATGATC GGATTGTCAG GTTTAACGGT ATTGGACAGG
AATAACAGAA AAGGTGATAT TCCTATAGAA GTAATCGGCC TGCGTCCGGG GGAAAAGCTC
TATGAAGAGC TGTTGATAAG CGGCGATCCT CAGCCGACTG ATCACCCGAG GATATTCAAG
GCGCATGAAG AGTTTATTGC ATGGCCGGCA CTTCAGGATG AACTTTCGGA GATGGAGCAG
GTTCTTAATA CCTGA
 
Protein sequence
MLQKTMLSLP LRVKQTLAVM QDSLAVVFSV WLAYSLRFEV WHVPQKAQWL VYGIALAISL 
PVFYGTSLYK SVFRFSEAMA FKQITKAAAL YAVLFFFTLV LFKIDGVPRS IGITQPIMLF
LLLLASRGAA RFFLNTRLQL GHCSTADKRL LIYGVGSAGI QLASAIEQTT RHLLIGFIDD
DPKLHGRMVK GLKVFSFGQV ARLVEQATVT DILLAIPSAS RSRRNQILQA LQPFPVHVQT
LPSLEDLTDG NISVRDVKEI EIEDLLGRDP VSSGPSLFRR NITGKTVVVT GAGGSIGREL
CRQILIGCAD KLIMIDHAEF NLHDAYLELE GYRERKQQDT EIVPLLCNVA ENHRFSAICS
SYHPHTIYHA AAYKHVPMVE RNPVEGVRNN VFGTLRSALA AQEYGVESFI LVSTDKAVRP
ANFMGASKRL CEIILQALSA EENGSGTCFS MVRFGNVLDS SGSVVPLFRQ QIKDGGPVTV
THPEITRYFM TIPEAVQLVI QSGAMSLGGE VFVLDMGEPV KIIDLAKRMI GLSGLTVLDR
NNRKGDIPIE VIGLRPGEKL YEELLISGDP QPTDHPRIFK AHEEFIAWPA LQDELSEMEQ
VLNT