Gene Cag_1983 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCag_1983 
Symbol 
ID3747362 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium chlorochromatii CaD3 
KingdomBacteria 
Replicon accessionNC_007514 
Strand
Start bp2515296 
End bp2518592 
Gene Length3297 bp 
Protein Length1098 aa 
Translation table11 
GC content47% 
IMG OID637774520 
Productalpha amylase domain-containing protein 
Protein accessionYP_380274 
Protein GI78189936 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0366] Glycosidases
[COG3281] Uncharacterized protein, probably involved in trehalose biosynthesis 
TIGRFAM ID[TIGR02456] trehalose synthase
[TIGR02457] trehalose synthase-fused probable maltokinase 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTATCAAC CCGAACCCCT CTGGTACAAA GACGCCATTA TTTACGAGGC GCACGTTAAA 
ACCTTTTACG ATAGCGATAA TGATGGCATT GGCGATTTTC AAGGATTGCG CCAAAAGCTT
GGTTACTTGC AAAGTCTTGG TATTACGGCA ATTTGGTTGC TTCCCTTTTA TCCCTCGCCA
CTGCGTGATG ATGGATACGA TATTGCTGAT TACATGACGG TTAACCCCGA TTATGGCACT
ATGGATGATT TTCGTGCCTT TCTTGAGGAG GCGCATTCAT TGGGCTTAAA GGTGATTACC
GAGTTGGTTG TAAACCACAC CTCCGACCAA CATGCGTGGT TCCAGCGTGC GCGCCATGCA
CCAAAGGATT CGCCTGAGCG CAATTTTTAT GTGTGGAGCG ATGATCCCAA CAAATATTCC
GAAACGCGCA TCATTTTTCA AGATTTTGAA GCCTCTAACT GGACGTATGA TTCCGTTGCA
GGGCAATATT ATTGGCATCG CTTTTACCAC CATCAGCCCG ATTTAAATTT TGAAAATCCT
GCGGTTCATG CTGCCTTGCT CCATGTGCTT GACTTCTGGC TTGGCATGGG CGTTGATGGG
CTTCGTCTCG ATGCGGTGCC TTACCTGTAT GAGGAAGAGG GCACCAATTG CGAAAATCTC
CCCAGAACCT ATCAGTACCT GCGTGATTTA CGCTCTTATA TTGATGAAAA ATATCCCAAC
CGCATGTTGC TTGCCGAAGC CAATCAGTGG CCCGAAGATT CGGCAGCATA TTTAGGCAAT
GGCGATATGT GCCATATGAA CTTCCACTTC CCGCTCATGC CACGTATGTA CATGGCGTTA
GCAACGGAAG ATCGCTTCCC TATTCTTGAT ATTCTTGAGC AGACGCCCGA AATTCCAGAA
AGTTGCCAAT GGGCATCTTT TTTGCGTAAC CACGATGAGC TAACGCTTGA AATGGTAACC
GACGAAGAGC GCGACTACAT GCGCCGTGTG TATGCCAATG ATCCTCGTGC CCGCATTAAC
CTTGGTATTC GTCGCCGTCT TGCGCCGCTC ATGTCGAATG ATCGCCGCAA AATTGAGCTG
ATGAACATTA TGCTGCTCTC TTTGCCCGGC ACCCCTGTGC TTTACTACGG CGATGAAATT
GGTATGGGTG ATAACTTCTA CCTTGGCGAT CGTGATGGCG TGCGTACCCC AATGCAGTGG
AATGCTGACC GCAACGCTGG CTTTTCGCGT GCTAATCCGC AACGCTTGCA ACTGCCTGTT
ATTATTGATC CCGAATACCA TTACGAAGCC GTAAACGTGG AGGTGCAAGA GAGCAACATC
CATTCGCTTT TGTGGTGGAT GCGCCAAACT ATTTCCACAG CTCATCGTCA TAAAGCTTTT
AGCCGTGGCA CTATTGAGTT CCTACCCGTC AAAAATTCTA AGGTACTCTC TTTTATTCGT
CAATATGAAG ACGAAACCAT GCTCTGCGTT ATTAACCTTT CGAAAAATGC ACAGGCTGTA
ACAGTTGATC TTTCTCGTTT TAATGGTTAC ACGCCCGAAG AGGTTTTTAG CTTAAACCGT
TTCCCCAAAA TTAGAAGCAC GCCTTACATG TTGGCGCTTG GCGCTTATGG ATACTTCTGG
CTCAAATTAA TTAAGGAGGA AAAAGAGGTT GATCGCCATG CGTTGCTTGA TGGCTCTGTA
GTAAGCGTGA ACCGTTGGCA ATCACTCTTT ATTGGGAAAA ATCGCGAAAA GCTTGAAACG
GCTGTTTTTT CAAGCTATTA CATGGCAGCA CGCTGGTTTG GAGGCAAAGC ACGCACCATC
ATCCGCATCT CAATTACCGA TACCATTCCT ATTGCGAATG TAGCTAATAC CAAGTTGTTA
GTAACTGAAG TGCGCTATTC AAGTGGTGAA AATGAGAACT ATCAGTTGCC AGTAACCTTT
GTACCGCTTG CAAACCTTCA GCCATCCGAT GAGTACTTTA GCAAGCAAGT TATTGCTCGC
ATAACTGTTG GCGATGAAGA GGGTTACCTT TGCGATGCTA CCTTTACACC TGCATTTTTG
CAAGAGCTGT ATAGCGTTGC AACCGCTAAA GGCTCATGGC AAGGCAAACA AGGCGTGGTA
AACGGTAGTT CGGCTCCAAA GCTTGCCGCT TTTCTTGCCA ATGTAGCTGA TGCAGCGCCC
GAGCTGATGG GTGCAGAGCA AAGCAACACC TCAATTCGCT ATGCCGATAA TCTTTGCTTA
AAGCTTTATC GCCGCATTGA ATCGGGTGTT TCGCCTGAGG TTGAAATGTG CAGCGCCTTG
AGTGAGCGCA CAAGTTTTAC CAATTTGCCA ACCTATCTTG GCACAGTGAA CTATAGCCGC
AGCCGTAGCA GCCGCTGTTC CATTGGCATT TTACAAACCT ACGTGCCAAA CCAAGGCGAC
GCATGGCAAC TTTCGCTTGA CCAAGCACGC CGCTACTTCG ATGCCATCCA TTCAGCCTTA
CCAAATGCGC TTGCCATGCC AGCTTTACCT GCATTAAGTG GCAATCCAGC TCCACTGCCC
GAATTAATGC AAGAGCTTAT TGGGGGGCAT TATCTTGGTA TGATTGAAAA GCTTGCCGAG
CGCACTGCCG AAATGCACCT TGCCCTTGCA ACGCTCGAAA GCGATCCCGC TTTTGCGCCC
GAAGCCTTTA CTTCGCTTTA TCAGCGCTCC ATTTACCAAG CCATGTGCGA ACAAGTAAAG
CGTTCGGTTA TTTTAATTCG TGAGCTACTT CCATCCCTAA ACGGAGAGCA GCAAACGCTT
GCTACACAGT TCGTGCAAAA GCAAAAGCAA ATTCTGCAAC AGTTTGATCC TATTCGTACC
GAAAAAATTG AAGCCCTAAA AATTCGCATT CACGGCGATT ACCATCTTGG GCAGGTGCTC
TTTACGGGTA AGGATTTCAC CATTATTGAT TTTGAAGGTG AGCCAGCACG TCCGCTTTCG
GAGCGTAAAA TTAAGCGCTC AGTTTTTCGT GATGTGGCTG GAATGCTTCG CTCATTTGAT
TACGCCGCCT TTAGCGCATT GCGTCAAATT GCACCAACCC TTCGCCCCGA CGAGTTGCCA
ATGCTTGACG CATGGGCAGA GCGCTGGAGT TTTTACGTGG GGCAGCACTT TATTAACCGC
TATTTTGAAG CCACTAATGG TAGCTCTATT GTGCCCGTTG AGGCTCCACA GCGTGAGCAC
TTGCTGCGCG GCTACTTAAT GAACAAAGCG ATTTATGAGT TGAATTATGA GCTAAACAAC
CGTCCCGATT GGGCAGCAAT TCCATTACGT GGTATTTTAA AGCTCATAGA GCAATAA
 
Protein sequence
MYQPEPLWYK DAIIYEAHVK TFYDSDNDGI GDFQGLRQKL GYLQSLGITA IWLLPFYPSP 
LRDDGYDIAD YMTVNPDYGT MDDFRAFLEE AHSLGLKVIT ELVVNHTSDQ HAWFQRARHA
PKDSPERNFY VWSDDPNKYS ETRIIFQDFE ASNWTYDSVA GQYYWHRFYH HQPDLNFENP
AVHAALLHVL DFWLGMGVDG LRLDAVPYLY EEEGTNCENL PRTYQYLRDL RSYIDEKYPN
RMLLAEANQW PEDSAAYLGN GDMCHMNFHF PLMPRMYMAL ATEDRFPILD ILEQTPEIPE
SCQWASFLRN HDELTLEMVT DEERDYMRRV YANDPRARIN LGIRRRLAPL MSNDRRKIEL
MNIMLLSLPG TPVLYYGDEI GMGDNFYLGD RDGVRTPMQW NADRNAGFSR ANPQRLQLPV
IIDPEYHYEA VNVEVQESNI HSLLWWMRQT ISTAHRHKAF SRGTIEFLPV KNSKVLSFIR
QYEDETMLCV INLSKNAQAV TVDLSRFNGY TPEEVFSLNR FPKIRSTPYM LALGAYGYFW
LKLIKEEKEV DRHALLDGSV VSVNRWQSLF IGKNREKLET AVFSSYYMAA RWFGGKARTI
IRISITDTIP IANVANTKLL VTEVRYSSGE NENYQLPVTF VPLANLQPSD EYFSKQVIAR
ITVGDEEGYL CDATFTPAFL QELYSVATAK GSWQGKQGVV NGSSAPKLAA FLANVADAAP
ELMGAEQSNT SIRYADNLCL KLYRRIESGV SPEVEMCSAL SERTSFTNLP TYLGTVNYSR
SRSSRCSIGI LQTYVPNQGD AWQLSLDQAR RYFDAIHSAL PNALAMPALP ALSGNPAPLP
ELMQELIGGH YLGMIEKLAE RTAEMHLALA TLESDPAFAP EAFTSLYQRS IYQAMCEQVK
RSVILIRELL PSLNGEQQTL ATQFVQKQKQ ILQQFDPIRT EKIEALKIRI HGDYHLGQVL
FTGKDFTIID FEGEPARPLS ERKIKRSVFR DVAGMLRSFD YAAFSALRQI APTLRPDELP
MLDAWAERWS FYVGQHFINR YFEATNGSSI VPVEAPQREH LLRGYLMNKA IYELNYELNN
RPDWAAIPLR GILKLIEQ