Gene Cag_1984 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCag_1984 
Symbol 
ID3747363 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium chlorochromatii CaD3 
KingdomBacteria 
Replicon accessionNC_007514 
Strand
Start bp2519000 
End bp2521015 
Gene Length2016 bp 
Protein Length671 aa 
Translation table11 
GC content47% 
IMG OID637774521 
Productalpha amylase domain-containing protein 
Protein accessionYP_380275 
Protein GI78189937 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0366] Glycosidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCTTAC ATTCTACCTT CGAGCACACA TTGCCCTTAT TTTGCCAGCA GGCGTTTGAC 
GGTCGTCAGC GGGTTATTAT TGAAAATATT TCACCTGAAA TTGATGGTGG TAGCCATCCC
GCAAAAGCGG TTGCGGGCGA GCTTGTTGAG GTTGAGGCTG ATATTTTTGT GGATGGTGCC
GATACCATTT CGGCAATGTT GTGCGCTCGT CCTATGGGGA GTAGCGAATG GCAGCAAAGC
GCTATGCAGC CTTTAGTAAA CGACCGATGG CAGGGGGCGT TCCGTGTGGG CGAAGCGGGC
GTGTTTGAAT ATACCATTAC GGCATGGGTT GACCATTTTC AAACGTGGCG TAAGGGATTT
ATTAAAAAAG TTGATGCAGG GCAGAATGTT TCTTTGGAGT TGCAAATAGG TACTATTCAC
CTTGAAAAAG CGGCGCTCCG TGCCACTAAA AGCGATGCTG AGCTGCTTCA TGTGTTGGTG
CAGCGAATTA GCACCGCCGA TGATGCTGAG GCAATTGCGT TAATCACCTC TGATGCGCTT
GCTCAAGTAA TGGAGCGCAA CCCCGATACG TCGTTAGCAA CAACATACCA AAAAGTGCTA
CGCGTAACGG TTGAGCAAAC CAAAGCGGGA TGCAGTGCGT GGTATGAATT TTTCCCACGC
TCGTGGTCTG AAATTCCCGG CAAGCATGGC ACCTTTAACG AGTGCCTCCG CTTGTTGCCA
CTTATTGCAG GCATGGGCTT CGATGTTATT TACCTTCCCC CTATTCATCC CATTGGCTAC
GCTAAACGCA AAGGGCGCAA TAATTCGCTT GTTGCGTTGC CCGATGATCC CGGTAGTTGC
TGGGCAATTG GCAATAGCGA TGGTGGGCAT AAAGCGGTTC ATCCCGAACT TGGCACGCTG
GAGGACTTTA CTGCTTTTGT GCAAGCGGCT GAGGCGCAAG GTATTTCGGT GGCGCTCGAT
ATTGCATTCC AATGCTCACC CGATCACCCT TACGTTCAAG AGCATCCACA GTGGTTTACA
TGGCGCCCCG ATGGCACGGT GCAGTTTGCT GAAAATCCGC CGAAGCGCTA TGAAGATATT
CTTCCCATCA ATTTTGAAAA TGATGATTGG CAAAATCTCT GGATTGAGTT GCGGAGCATT
TTCCTCTTTT GGGTTGAACG AGGCGTAAAA ATTTTCCGTG TGGACAACCC TCATACCAAA
GCATTTCCAT TTTGGGAATG GGCAATTCGT ACCATTAGAG CTGAGCACCC CGACACCGTA
TTTCTTGCAG AAGCCTTTAC GCGCCCTAAG CTTATGGCGC GCTTAGCAAA AATCGGTTAT
AGCCAATCCT ACAGCTACTT TACATGGCGT AACACGAAGC ATGAGTTGCA GGAGTATGTA
ACGGAACTCA CTTCCGAGCC ATTAAAGCAT GTTATGCGTG CCAACTTTTG GCCAAACACG
CCCGATATTT TGCATGATGA GTTCCATAAT GGCGAGCGCG AAAAGTTCAT TATTCGCCTT
GTGCTTGCCG CTACACTTTC AGCAAATTAT GGCATGTATG GACCAGCGTA TGAATTGTGC
GAGCATGTGC CGATAGCGCA TGGTAAAGAG GAGTATCTTG ATTCCGAGAA GTATGAAATT
AAGCAGTGGG ATATGGATCG TCCGGGCAAC ATTCGGGCTG AAATTACGGC AATCAATCGT
ATTCGTAAAG AGAATCCCGC TTTGCAGCAA ACCGCTGATA TTAGCTTTTT GCACATTGAT
GCCTCTCCCG GTAATGAGCA CAATATGCTG ATGGCATACG TCAAACGTTC CGAGAATGAT
GCCAATATCA TTTTAGTGGT TGTGAATCTT GATCCTATTA CCACACAACG TGGATGGCTT
CGCTTTCCGT TAGAGCAATT TGGATTAACG CATTTACACC GTTTTCATGT TGAGGATTTG
CTGAGTGGGC AGTGCCATAC ATGGCATGGC GAGTGGAATT ATGTGGAATT AAATCCTCAT
GTTATGCCTG CTCATATTTT TAAAATTTCG CTTTAA
 
Protein sequence
MTLHSTFEHT LPLFCQQAFD GRQRVIIENI SPEIDGGSHP AKAVAGELVE VEADIFVDGA 
DTISAMLCAR PMGSSEWQQS AMQPLVNDRW QGAFRVGEAG VFEYTITAWV DHFQTWRKGF
IKKVDAGQNV SLELQIGTIH LEKAALRATK SDAELLHVLV QRISTADDAE AIALITSDAL
AQVMERNPDT SLATTYQKVL RVTVEQTKAG CSAWYEFFPR SWSEIPGKHG TFNECLRLLP
LIAGMGFDVI YLPPIHPIGY AKRKGRNNSL VALPDDPGSC WAIGNSDGGH KAVHPELGTL
EDFTAFVQAA EAQGISVALD IAFQCSPDHP YVQEHPQWFT WRPDGTVQFA ENPPKRYEDI
LPINFENDDW QNLWIELRSI FLFWVERGVK IFRVDNPHTK AFPFWEWAIR TIRAEHPDTV
FLAEAFTRPK LMARLAKIGY SQSYSYFTWR NTKHELQEYV TELTSEPLKH VMRANFWPNT
PDILHDEFHN GEREKFIIRL VLAATLSANY GMYGPAYELC EHVPIAHGKE EYLDSEKYEI
KQWDMDRPGN IRAEITAINR IRKENPALQQ TADISFLHID ASPGNEHNML MAYVKRSEND
ANIILVVVNL DPITTQRGWL RFPLEQFGLT HLHRFHVEDL LSGQCHTWHG EWNYVELNPH
VMPAHIFKIS L