Gene Cpha266_0431 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCpha266_0431 
Symbol 
ID4569220 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium phaeobacteroides DSM 266 
KingdomBacteria 
Replicon accessionNC_008639 
Strand
Start bp477245 
End bp480748 
Gene Length3504 bp 
Protein Length1167 aa 
Translation table11 
GC content51% 
IMG OID639765031 
Productalpha amylase, catalytic region 
Protein accessionYP_910913 
Protein GI119356269 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0366] Glycosidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACAAGCA ATAACTATCA GCCCGCTCCG CCCGGAAAAG AACATTTTTA TCTGAACCGT 
CAGGCAAGAG AACTCTTCAA GCAGCGTGAC CCCGAATTTC TTGTTCTCTT CGACCGTTCG
GGAGATAGTC TGGCACATGC CAGAGAACAG ACCGCAATCA TCAACAGTTT CCTGAAGAAA
AAGAAAGGTG ATCAGGCAAA ACTGATACTC CCGGCACAGT TTCATGGCAT GAAACTGCTT
CACGAGGTAT TTCATGCACT TATTGCGCAA TCGGTTGCCA GTCGAAAGCC GGATTTTTTT
AAAACCGTGG ACGACAACTT CCATCGCGAA CTCTCACAAA ACCGGGCCAA TGAGTACGAA
AAAAGGTTTC TCGATGATTT CCCTCCCGAC GCTCTCTTTC GCGGCGTTCA GTCAACCGAA
GAGTACCTTT CGATTCCGGA AAAGAGAGAG CAACTGATCG AGGAGTCTTT TCTGGTATGG
TTGAATAATC AGAATCCCGC CCTTGGCCAG TTTGAAGATC TTATCACCGA CAGGAACCTT
GTAAAGGACG ATGCCTATCT CCAGCTTATC CGCTCTCTCA AGGGCTCAAT TCACGATATG
GGAGCAGTTG GCCCCGGCGA TATTGATCTC CTTGAACTTC TCTGGCTTCC GATCAGGCAT
TCGCCCGATT CGATCCTTGA ACAGCTCCGT TATATCAGGC TTCACTGGGC GGACTTGCTT
GAAGATTCGC CGTTCTGGTC GCTGCTTGCC GATGCCATTA TCCTGATCCA GGATGAGGAT
CGCTATATTT TTGTTGAGCA GCTTTCAAAA CATCAGGACT CCCGCCATCC GGATTTCTGG
ATGGAAAAAG AGACCCATGT GCCCGACTAC GGCGATCCCG GCGAGGCAGG AGCACACTAT
TCAGCAGATT CATCATGGAT GCCTGAGGTG GTCATGCTGG CAAAAAGTAC CTATGTATGG
CTCGACCAGC TCAGTAAAAT CTATCATCGC CATATCGCGA GGCTTCAGGA TATTCCCGAC
AGGGAACTTG ATCTTCTCAG TGAACGAGGA TTCACTGCGC TCTGGCTTAT CGGACTCTGG
CAGCGAAGCG AGGCATCTGA AACCATCAAG CGCATCCAGG GAAACCCTGA AGCAAAAGCT
TCGGCATATG CCCTTGAACA ATACAATATC TCACGGGATA TCGGCGGTTA CGAGGGTTAT
CTTGATCTCC GCAACCGGGC GATGCAACGA GGTATACGGC TTGCAAGCGA TATGGTTCCG
AACCATACCG GCATTGATTC GGAACTGGTA AGGAACAACC CTGACTGGTT CCTCTCTGCC
TCCTCGGCAC CTTATGTCAA CTACTCCTAT AACGGCCCTA ACCTTTCGAG TGATGCCCGC
TACGGCATTT TCCTTGAAGA TGGCTACTGG AACAGAAGCG ATGCCGCGGT CACCTTCAAG
CGTGTTGATT ATCTGAATGG CGATACCCGC TATATCTATC ATGGCAATGA CGGAACGACA
ATGCCATGGA ACGATACCGC TCAGCTTGAC TTTCTGAGTG CAGAAGTCCG CGAGGGTGTT
ATCCAGCAGA TTCTTCATGT CGCAAGGATG TTTCCGATTA TCCGCTTTGA TGCGGCAATG
GTTCTGGCTA AACGGCACAT TCAGCGTCTC TGGTACCCGC TCCACGGTCA AACCGCAGGG
GTGCCGTCGC GCTCTTCCTT TTCGATGAGC ATGGAGGAGT TCAACGCAGC CATACCTGAA
GAGTTCTGGC GGGAGGTTGT CGACCGCATT CAGGCGGAGG TTCCCGGTAC GTTGCTGCTT
GCCGAAGCCT TCTGGATGCT TGAGGGATAT TTTGTCAGAA CCCTCGGCAT GCACAGAGTA
TACAACAGCG CTTTCATGCA TATGCTGAAA AAAGAAGATA ACGCCAGTTA CCGCTACCTG
ATCAAAAATA CTCTCGAATT CGATGCGGAA ATTCTCAAGC GATATGTCAA CTTCATGAAC
AATCCGGACG AGGATACCGC CATTGCCCAG TTCGGCAGGG GCGACAAGTA TTTCGGAGTC
TGCATGATGA TGATTACAAT GCCGGGTCTT CCCATGCTCG GACACGGACA GGTTGAGGGC
TTTACCGAGA AGTACGGAAT GGAATATGCC AAAGCCTATT ATGACGAGCA GCCCGACCAG
CATCTCGTAG ATCGCCACTA CCGGGAAATT TTTCCCGTCA TGAAAAAGCG GCCGCTTTTT
GCCGAGGTTG AAAACTTCTT TCTCTACGAC GTCTACTCCC CTGAAGGGAG CGTCAACGAA
AATATTTTCG CATACTCAAA CCGGCTCGGC GAGGAAAAAG CACTTGTTAT CTTCAATAAT
TGCGCAGTAC AGGCTGCCGG CTGGGTCAGT ACTTCCGTCG GCTACCGGAA GGGGGATGAA
ATCCGCCAGA CCTCCCTTGC CGATGGCCTC TTTCTCAGCA GAGAGGATAA TCTGTATGTG
ATCTTCAGGG ATCAATGTTC AGGTATGGAG TTTATCCGCT CAAACAGGAT CATTGCAGAA
CAGGGGCTGT TTGTCGCTCT CGAAGGGTAC CGATACAATC TCTTTCTTGA TTTCAGAGAG
GTCAAACCCT CAAGACTCAC CCCCTACGAC AGGTTGTGCG CCGAACTCAA CGGAAACGGT
ACCGCGTCAA TAGAGCATGA CGTCCTCTTC ATGAGCCTTG AGCCGCTTCA CCGTCTCTTA
TCGGAATTTC TCGCGCCTGA CAACCTTGCT CCGTTGCACG GGACAACCGA AAACGAGTTA
ATCTTTCCGG TTTTCAAAAC CATGATTGCC ACACTGCTGC AAGAGGTTGC CGTGAAGTAT
GGCAACCTCC TGGAGAAACC GGTTGATGTG CCTGCGACTC TTGCGGACGA AACCGTATCA
CGTTTCCGTC ATGCCTGCCA GATGGTCGAA GAGATGGCAA ATCTGCCCGA AAGTGACCGA
TTCAATGCGG CCCTCGGGAC CACAGAGAGA GAGAAATCAG GCTTTTCCAC CATCCTGCTG
CACTGGCTTG CGCTCGACTC GCTGCAGGAG ATGCTTCGCG CCAACGAGCT CTTGAGCTCG
AATCTGATCG ATGACTGGCT GATGAGTAAC ACACTTCAGC AACTTTACAG TCAAAAGGGT
CTGAACGGCA TTCCCGGTGA TAACATCCCT GATCTCCTCT ACTGCCTGCT TTCAGAAAAA
CCTGCAACGC TGCCTGCCGA CGAAGAGTCT GCTCTCTACG CCCTGCTGGC GCTCCTCGAT
GCATCGCACA GTGAACATGT AGCCCGCATT CTTCAGTTTG AAAACCGGCA TGAAAAAACA
TGGTTCCGCG AACACCGGTT CAGCGTGCTT GCAGCATGGC TGTCGGTAGA AGCTATGCTG
AAAAAAGAGA ACCTGCCTGT TGAACAAGAG AGCGGGAAAA CAGAGCATCC TGTTGCCGGT
TGGGTAGCAG CCGCAAGAAA ACTCGATATG CAGGCCTTCC TTTCAGGCTA TGAAATGGGC
GCCTTGCTCC GGCAAAAAGC CTGA
 
Protein sequence
MTSNNYQPAP PGKEHFYLNR QARELFKQRD PEFLVLFDRS GDSLAHAREQ TAIINSFLKK 
KKGDQAKLIL PAQFHGMKLL HEVFHALIAQ SVASRKPDFF KTVDDNFHRE LSQNRANEYE
KRFLDDFPPD ALFRGVQSTE EYLSIPEKRE QLIEESFLVW LNNQNPALGQ FEDLITDRNL
VKDDAYLQLI RSLKGSIHDM GAVGPGDIDL LELLWLPIRH SPDSILEQLR YIRLHWADLL
EDSPFWSLLA DAIILIQDED RYIFVEQLSK HQDSRHPDFW MEKETHVPDY GDPGEAGAHY
SADSSWMPEV VMLAKSTYVW LDQLSKIYHR HIARLQDIPD RELDLLSERG FTALWLIGLW
QRSEASETIK RIQGNPEAKA SAYALEQYNI SRDIGGYEGY LDLRNRAMQR GIRLASDMVP
NHTGIDSELV RNNPDWFLSA SSAPYVNYSY NGPNLSSDAR YGIFLEDGYW NRSDAAVTFK
RVDYLNGDTR YIYHGNDGTT MPWNDTAQLD FLSAEVREGV IQQILHVARM FPIIRFDAAM
VLAKRHIQRL WYPLHGQTAG VPSRSSFSMS MEEFNAAIPE EFWREVVDRI QAEVPGTLLL
AEAFWMLEGY FVRTLGMHRV YNSAFMHMLK KEDNASYRYL IKNTLEFDAE ILKRYVNFMN
NPDEDTAIAQ FGRGDKYFGV CMMMITMPGL PMLGHGQVEG FTEKYGMEYA KAYYDEQPDQ
HLVDRHYREI FPVMKKRPLF AEVENFFLYD VYSPEGSVNE NIFAYSNRLG EEKALVIFNN
CAVQAAGWVS TSVGYRKGDE IRQTSLADGL FLSREDNLYV IFRDQCSGME FIRSNRIIAE
QGLFVALEGY RYNLFLDFRE VKPSRLTPYD RLCAELNGNG TASIEHDVLF MSLEPLHRLL
SEFLAPDNLA PLHGTTENEL IFPVFKTMIA TLLQEVAVKY GNLLEKPVDV PATLADETVS
RFRHACQMVE EMANLPESDR FNAALGTTER EKSGFSTILL HWLALDSLQE MLRANELLSS
NLIDDWLMSN TLQQLYSQKG LNGIPGDNIP DLLYCLLSEK PATLPADEES ALYALLALLD
ASHSEHVARI LQFENRHEKT WFREHRFSVL AAWLSVEAML KKENLPVEQE SGKTEHPVAG
WVAAARKLDM QAFLSGYEMG ALLRQKA