Gene Cpha266_2537 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCpha266_2537 
Symbol 
ID4569727 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium phaeobacteroides DSM 266 
KingdomBacteria 
Replicon accessionNC_008639 
Strand
Start bp2911348 
End bp2913426 
Gene Length2079 bp 
Protein Length692 aa 
Translation table11 
GC content51% 
IMG OID639767102 
Productalpha amylase, catalytic region 
Protein accessionYP_912949 
Protein GI119358305 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0366] Glycosidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCGATAC ATGTTTTTTA TCCTTCAACC CTTTTCTCTG GTTTCCAATC TGAATTTCCG 
ATGATGAATG AAGTTTTTTT CCCCGCTACT CCTCACTATA TCAGCAAGGA TTTTGATGGT
CGCCGCCGTG TTGTTATTGA GCAGGTTTCC CCCGAGCTTG ATGGGGGAAA GTATGCTGTA
AAAGGCGTTG AGGGCGACCG AGTGGCTGTT GAGGCCGATA TTTTTACTGA TGGCGCCGAT
TCGATACGCG CAGAGCTTCT TTTTCGCCCG CTCGGCGAGT CTCTCTGGCA GCGGACGCCG
ATGGAGCCGA TTGGCAACGA TCGATGGACG GGGGCTTTTA TCGTTGGGGC TCCGGGTGGA
TATGTGTATA CCATCAAGGC GTGGATTGAT CATTTCAAGA CATGGAAGAG CGGTCTGAAG
AAAAAAATCG AGGCGGCTCA GGATGTTGCG CTTGATCTGA AAATCGGCGG CGCGCTGGTT
GAAAAGGGCG CTGCGAGGGC AGAAGGCGGC GATGCTGTTT TACTTGCCGC ATTTGCCGTT
ACCTTAAGCG GCGACGATGG TGAAAAAGCA CTTGATGCGG CTTTTGCTGT GGAACTTGAA
CAGCTTATGG ATCGAAATCC TGACGGCTCT ATGGCATCGA TGTATGATAA GGAGCTTCCG
GTTTCTGCTG AACCGAAAAG AGCGGCTTTC AGCTCATGGT ATGAGTTTTT TCCTCGTTCC
TGGGCTTTGG AACCGGGAAA GCACGGGACT TTCAGGGAGT GCCAACGGCT TTTGCCGCTG
ATTGCTGGTA TGGGGTTTGA TGTGATCTAT CTGCCGCCCA TTCATCCGAT AGGAAGGAGT
AAACGTAAAG GGAAGAACAA TGCGCTGGTT GCCGCACCTC TCGATCCCGG AAGTTGCTGG
GCCATAGGGA GCAGTGATGG CGGCCACAAG GCGGTGCATC CCGAGCTTGG CACGATTGAT
GATTTCAGGG TTTTTGTTGC CGAGGCTCAA AGCGAGGGTC TTGCCGTTGC GCTTGATATC
GCTTTCCAGT GTTCACCCGA TCATCCCTAT GTCAGGGAGC ATTCGCAGTG GTTCAAATGG
CGGCCGGACG GTACTGTGCA GTTTGCTGAA AATCCTCCGA AGCGTTATGA GGATATTCTT
CCTATCGATT TTGAGACTTC CGATTGGCAG AATCTCTGGA TAGAGCTGAG GAGTATTTTT
CTTTTCTGGA TAGAACAGGG TGTCAGTATT TTTCGCGTTG ATAATCCGCA TACCAAGGCG
TTTCCGTTCT GGGAGTGGGC TCTCGCTTCT ATCCGTTCGG AGCATCCCGA TACCATTTTT
CTTGCCGAGG CTTTTACCCG TCCCCGATTG ATGGAGCGTC TTGCAAAAGC AGGATATACC
CAGTCCTACA GCTATTTCAC CTGGAGAAAT ACCAAGCACG AGATCGAGGA GTACCTGGGA
GAGCTTGCCA GTGCTCCGCT GAAATATTAT ATGAGGCCGA ATTTCTGGCC TAATACGCCG
GATATTCTGC ATGAAGAGTT GCAGACCGGG GGTCGTCCAA AGTTTCTTAT TCGTCTGTTT
CTTGCGGCTA CGCTCTCGTC GAATTACGGG ATGTACGGCC CGGCTTATGA ACTCTGCGAG
CATCTGCCTG TTGGCGAAGG TCGGGAGGAG TATCTCGATT CGGAGAAATA CGAGATCAAG
CAGTGGGATA TTGATCGCCC AGGCAATATT CGTGCGGAAA TTACCCGAAT CAACCAGATC
AGAAAAGCAA ATCCGGCATT GCAGAGGACG GACAATATCA CTTTTGTGCG CGTTGAGGCT
TCCGCTGGCG TGGAGCACCA GAAAATGATC TGTTATGTGA AGCGTTCCCC CGATGATCGT
AACGTGATTC TTTCGGTGGT CAATCTTGAT GCTTCTTCGA CGCATGGCGG CTGGCTGCGT
TTTCCTCTTG AACTGTTCGG GCTTTCTCAT GATCACCATT TTATGGTGGA GGATCTGCTT
TCGGGTAAAT CGTTCAACTG GAACGGTGAA TGGAACTTTG TCGAGCTTAA TCCTCACGAT
ATGCCGGCGC ATGTGTTCAG GGTGGAGCTC TTCCTTTGA
 
Protein sequence
MPIHVFYPST LFSGFQSEFP MMNEVFFPAT PHYISKDFDG RRRVVIEQVS PELDGGKYAV 
KGVEGDRVAV EADIFTDGAD SIRAELLFRP LGESLWQRTP MEPIGNDRWT GAFIVGAPGG
YVYTIKAWID HFKTWKSGLK KKIEAAQDVA LDLKIGGALV EKGAARAEGG DAVLLAAFAV
TLSGDDGEKA LDAAFAVELE QLMDRNPDGS MASMYDKELP VSAEPKRAAF SSWYEFFPRS
WALEPGKHGT FRECQRLLPL IAGMGFDVIY LPPIHPIGRS KRKGKNNALV AAPLDPGSCW
AIGSSDGGHK AVHPELGTID DFRVFVAEAQ SEGLAVALDI AFQCSPDHPY VREHSQWFKW
RPDGTVQFAE NPPKRYEDIL PIDFETSDWQ NLWIELRSIF LFWIEQGVSI FRVDNPHTKA
FPFWEWALAS IRSEHPDTIF LAEAFTRPRL MERLAKAGYT QSYSYFTWRN TKHEIEEYLG
ELASAPLKYY MRPNFWPNTP DILHEELQTG GRPKFLIRLF LAATLSSNYG MYGPAYELCE
HLPVGEGREE YLDSEKYEIK QWDIDRPGNI RAEITRINQI RKANPALQRT DNITFVRVEA
SAGVEHQKMI CYVKRSPDDR NVILSVVNLD ASSTHGGWLR FPLELFGLSH DHHFMVEDLL
SGKSFNWNGE WNFVELNPHD MPAHVFRVEL FL