Gene Cag_1788 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCag_1788 
Symbol 
ID3747208 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium chlorochromatii CaD3 
KingdomBacteria 
Replicon accessionNC_007514 
Strand
Start bp2308541 
End bp2309782 
Gene Length1242 bp 
Protein Length413 aa 
Translation table11 
GC content47% 
IMG OID637774326 
Product3,4-dihydroxy-2-butanone 4-phosphate synthase 
Protein accessionYP_380082 
Protein GI78189744 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0108] 3,4-dihydroxy-2-butanone 4-phosphate synthase
[COG0807] GTP cyclohydrolase II 
TIGRFAM ID[TIGR00505] GTP cyclohydrolase II
[TIGR00506] 3,4-dihydroxy-2-butanone 4-phosphate synthase 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCACACAG CTATTGATTC GATTGATGCC GCTCTTGAAG ATATTCGGCA GGGTAAATTG 
GTGATTGTTA TTGATGATGA AGATCGAGAA GATGAGGGTG ATTTTATTGG CGCTGCCGAT
TTAGTTACCA CTGAAATGAT CAACTTTATT ACGCGCGAAG CTCGTGGCTT ACTGTGCGTT
GCCGTAACCA TGGAGCGAGC AAAAGAGTTA CAGCTTGACC CTATGGTGCA GCGCAACACA
TCGCAACACG AAACCAACTT TACTGTTTCG GTTGACGCTA TTGCTGAAGG CGTTACCACC
GGTATTTCCG TGTATGACCG CACCATGACC ATTAAAATGT TAGGCGATCC CTCCACCAAA
GCGGATGACT TTTCACGTCC CGGACACATT TTCCCTCTTC GAGCTATGAA TGGTGGTGTG
CTTCGCCGCG TTGGGCACAC CGAAGCGGCA GTTGACCTTG CTCACCTTGC TGGACGCTCA
CCCGTTGGCT TGCTCTGCGA AATTCTTAAT GAGGATGGCA GCATGGCGCG TTTGCCTGAG
CTTATTAAAC TCAAGGAGAA GTTCGGCTTA AAGCTCATTA CCATTAAGGA TTTAGTTGCC
TACCAAATGC AGCGTAATGC GTTAGTAAAG CGTGCCGTTG AATCGCGCTT ACCAACCGCT
TATGGCGAAT TTAAACTCAT TGCTTACGAT TCATTTATTG ATCACCACAA CCATATTGCC
TTTATAAAAG GGGATGTATC CACCGATGAA CCCGTGTTGG TGCGCGTCCA TTCACAATGC
GCTACGGGCG ACACCTTTGC CTCACTCCGT TGCGATTGCG GGCATCAACT TGCCTCAGCA
CTTACCATGA TTGAAAAGGA GGGGCGTGGC GTGCTGGTTT ATTTAATGCA AGAGGGGCGT
GGTATTGGTT TAGTCAATAA GCTGAAAGCC TACAACTTGC AAGATGAAGG GCTTGATACC
GTTGAAGCAA ACGAAAAGCT TGGCTTTAAA GCCGACTTGC GTGATTACGG CATTGGCGCT
CAAATTCTTA AAGATCTTGG CATTCGTAAA ATGCGCTTAA TGACCAACAA CCCGAAAAAA
ATTGTCGGGC TTGAAGGGTA CGGACTGGAA ATTGTAGAGC GTGTACCTAT TGAAATAGCA
CCTAACGCCG TGAATGAAAG CTACTTGCAA ACCAAGCGCG ATAAAATGGG GCACATGCTT
GGTTGTTCAT GCAGCTCAAC AGCTTCGCAT ACGCATAAAT AA
 
Protein sequence
MHTAIDSIDA ALEDIRQGKL VIVIDDEDRE DEGDFIGAAD LVTTEMINFI TREARGLLCV 
AVTMERAKEL QLDPMVQRNT SQHETNFTVS VDAIAEGVTT GISVYDRTMT IKMLGDPSTK
ADDFSRPGHI FPLRAMNGGV LRRVGHTEAA VDLAHLAGRS PVGLLCEILN EDGSMARLPE
LIKLKEKFGL KLITIKDLVA YQMQRNALVK RAVESRLPTA YGEFKLIAYD SFIDHHNHIA
FIKGDVSTDE PVLVRVHSQC ATGDTFASLR CDCGHQLASA LTMIEKEGRG VLVYLMQEGR
GIGLVNKLKA YNLQDEGLDT VEANEKLGFK ADLRDYGIGA QILKDLGIRK MRLMTNNPKK
IVGLEGYGLE IVERVPIEIA PNAVNESYLQ TKRDKMGHML GCSCSSTASH THK