Gene Cpha266_1085 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCpha266_1085 
Symbol 
ID4570029 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium phaeobacteroides DSM 266 
KingdomBacteria 
Replicon accessionNC_008639 
Strand
Start bp1226746 
End bp1228695 
Gene Length1950 bp 
Protein Length649 aa 
Translation table11 
GC content48% 
IMG OID639765682 
Productalpha-amylase family protein 
Protein accessionYP_911550 
Protein GI119356906 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0366] Glycosidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0151666 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAACAAA ACAGCACACC ACATCAAACA ACAAACCTTA AGCTCGTTCT TCATGCATTG 
GAGCAACTGC CTGAACTGAA ACCCGGCCAG CCATATTACA TTCCCGGACT CTGGAATGGG
GGTGATAGTG GTTTTGAACC GGTTCAGCCG GGAAAATACT TTGCCGATAT CATTACGAAC
ATGTTATCGG GCAACGCAAA CGAAATCCGC AACCCTTTGC AATCTTCAGG GAACTGGACG
CCGGATGCTG TCGTGTATAA CCTGTTTATA CGGCTCACCA CTGCATATGA TCACAATAAT
GATGGCATTG TCAATACCGA ACCTCTTGAA AATGGCTTCA GGGAAACAGG AACGCTCCTC
AAGGCAACAG GCCTGTTACC CTACATTAAA AAACTCGGTG TCAACACGCT TTACCTCCTT
CCGGTTACGT TGACCGGATC CGATGATAAA AAAGGCTCCC TCGGCTCCCC TTACGCAGTA
AAAGACCCTT TTGCCATAGA CCCTATGCTC GACGAACCGG CTCTCGGGCT TTCGGCTGAC
ATCCTCCTGA AAGCGTTTGT CGAAGCCGCT CATCTGCTCA ACATGCGTGT TCTTTTTGAG
TTCGTCTTTC GAACAGCATC AGTTGACAGC AACTGGATTC AGGATCATCC GGACTGGTTT
TACTGGATTT CAGAGACTGG GTGCGCAACG AAGTATGGTC CGCCGCACTT CAGCGATGAA
ATCCTTCACA CCATTTATGA AAAGGTCGAC AAGCATGACC TGAACAATCT GCCGGCTCCG
GAAGCCGATT ACAAAAAAAT GTTTACCTCC GTTCCTGTCG CCATCCACAA GAAGGATGAA
AAACTCAGAG GAATCACAAA ACAGGGAGAA GAGTGCAGAA TTGCCAGCGC ATTTTCAGAC
TGGCCGCCGG ATGACAGGCA GCCGCCATGG ACCGACGTAA CCTATCTGAA AATGCACAAC
CATCCGGAAT TCAACTATAT CGCGTACAAC ACCATCCGGA TGTATGACGT TGCTCTCGAT
AATCCCGAAT ACCGGAATAA TCTCCTCTGG GAAACAATAG AAGCGATCAT CCCCCATTTT
CAGGAGAACT ATTGCATTGA CGGCGCAATG ATCGACATGG GGCATGCGCT GCCATCAGCA
CTCAAACACT CCATCGTCAA ACGTGCGCGA CGCAACAATC CGAATTTCGC ATTCTGGGAT
GAAAATTTTG ACCCGACGCC ATCCGTCAGG GATGAAGGGT TCAATGCCGT ATTCGGCTCC
CTTCCCTTTG TTATTCACGA TCCTGTTTTT ATACGAGGAT TGCTCAATTA TCTCAATAAA
ACCGGTGTAG CACTTCCTTT TTTTGGTACG GGAGAAAACC ACAACACTCC GAGAGTATGT
CACGGCTATC CCGGTATGGA AACTGGCAGA AACAGGTCTT CCTTCATTTT CACCCTCTGC
TGTATTCTGC CGGCAATCCC GTTTCTACAC TCAGGCATGG AGCTCTGTGA ATGGCATCCG
GTTAATCTCG GACTGAACTT CACCGCTGAA GACAGGGAAC GCTTTCCGTC AGACAAGCTC
CCGCTTTTCA GTGCGTTCAG TTACGATTGG AAAACAGCCA ACGGCCTTAA GACTCTCAAC
AGCTACATCA GCAAGATCCT GTCCATAAGA GCCAACTATC GTGAACTTGT TCAGTGTATG
GATAAGGGTT CAATGATTCT TCCCTACATC ACCAATCCTG AACTTTTGGC CGTCATGAGA
AAAAACGGCG ACACAACGCT GCTCTTCATT GGAAACAGCA ATGGATCTGA AGCGCAGAAT
GGCATCATTG AATTCGGCAT CACTGACGCA ATCCTCTTTG ATCTTATCGA AGAAAAAGAG
TACCCTGTTT CAAATCACTC CCTGAGCTTG CATTTCAAGC CGGGTCAAAG TATGCTTTTC
GAACTTCCTG TAGAGGAAAA ACCATCGTAG
 
Protein sequence
MKQNSTPHQT TNLKLVLHAL EQLPELKPGQ PYYIPGLWNG GDSGFEPVQP GKYFADIITN 
MLSGNANEIR NPLQSSGNWT PDAVVYNLFI RLTTAYDHNN DGIVNTEPLE NGFRETGTLL
KATGLLPYIK KLGVNTLYLL PVTLTGSDDK KGSLGSPYAV KDPFAIDPML DEPALGLSAD
ILLKAFVEAA HLLNMRVLFE FVFRTASVDS NWIQDHPDWF YWISETGCAT KYGPPHFSDE
ILHTIYEKVD KHDLNNLPAP EADYKKMFTS VPVAIHKKDE KLRGITKQGE ECRIASAFSD
WPPDDRQPPW TDVTYLKMHN HPEFNYIAYN TIRMYDVALD NPEYRNNLLW ETIEAIIPHF
QENYCIDGAM IDMGHALPSA LKHSIVKRAR RNNPNFAFWD ENFDPTPSVR DEGFNAVFGS
LPFVIHDPVF IRGLLNYLNK TGVALPFFGT GENHNTPRVC HGYPGMETGR NRSSFIFTLC
CILPAIPFLH SGMELCEWHP VNLGLNFTAE DRERFPSDKL PLFSAFSYDW KTANGLKTLN
SYISKILSIR ANYRELVQCM DKGSMILPYI TNPELLAVMR KNGDTTLLFI GNSNGSEAQN
GIIEFGITDA ILFDLIEEKE YPVSNHSLSL HFKPGQSMLF ELPVEEKPS