Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cpha266_1085 |
Symbol | |
ID | 4570029 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium phaeobacteroides DSM 266 |
Kingdom | Bacteria |
Replicon accession | NC_008639 |
Strand | - |
Start bp | 1226746 |
End bp | 1228695 |
Gene Length | 1950 bp |
Protein Length | 649 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 639765682 |
Product | alpha-amylase family protein |
Protein accession | YP_911550 |
Protein GI | 119356906 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG0366] Glycosidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.0151666 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAACAAA ACAGCACACC ACATCAAACA ACAAACCTTA AGCTCGTTCT TCATGCATTG GAGCAACTGC CTGAACTGAA ACCCGGCCAG CCATATTACA TTCCCGGACT CTGGAATGGG GGTGATAGTG GTTTTGAACC GGTTCAGCCG GGAAAATACT TTGCCGATAT CATTACGAAC ATGTTATCGG GCAACGCAAA CGAAATCCGC AACCCTTTGC AATCTTCAGG GAACTGGACG CCGGATGCTG TCGTGTATAA CCTGTTTATA CGGCTCACCA CTGCATATGA TCACAATAAT GATGGCATTG TCAATACCGA ACCTCTTGAA AATGGCTTCA GGGAAACAGG AACGCTCCTC AAGGCAACAG GCCTGTTACC CTACATTAAA AAACTCGGTG TCAACACGCT TTACCTCCTT CCGGTTACGT TGACCGGATC CGATGATAAA AAAGGCTCCC TCGGCTCCCC TTACGCAGTA AAAGACCCTT TTGCCATAGA CCCTATGCTC GACGAACCGG CTCTCGGGCT TTCGGCTGAC ATCCTCCTGA AAGCGTTTGT CGAAGCCGCT CATCTGCTCA ACATGCGTGT TCTTTTTGAG TTCGTCTTTC GAACAGCATC AGTTGACAGC AACTGGATTC AGGATCATCC GGACTGGTTT TACTGGATTT CAGAGACTGG GTGCGCAACG AAGTATGGTC CGCCGCACTT CAGCGATGAA ATCCTTCACA CCATTTATGA AAAGGTCGAC AAGCATGACC TGAACAATCT GCCGGCTCCG GAAGCCGATT ACAAAAAAAT GTTTACCTCC GTTCCTGTCG CCATCCACAA GAAGGATGAA AAACTCAGAG GAATCACAAA ACAGGGAGAA GAGTGCAGAA TTGCCAGCGC ATTTTCAGAC TGGCCGCCGG ATGACAGGCA GCCGCCATGG ACCGACGTAA CCTATCTGAA AATGCACAAC CATCCGGAAT TCAACTATAT CGCGTACAAC ACCATCCGGA TGTATGACGT TGCTCTCGAT AATCCCGAAT ACCGGAATAA TCTCCTCTGG GAAACAATAG AAGCGATCAT CCCCCATTTT CAGGAGAACT ATTGCATTGA CGGCGCAATG ATCGACATGG GGCATGCGCT GCCATCAGCA CTCAAACACT CCATCGTCAA ACGTGCGCGA CGCAACAATC CGAATTTCGC ATTCTGGGAT GAAAATTTTG ACCCGACGCC ATCCGTCAGG GATGAAGGGT TCAATGCCGT ATTCGGCTCC CTTCCCTTTG TTATTCACGA TCCTGTTTTT ATACGAGGAT TGCTCAATTA TCTCAATAAA ACCGGTGTAG CACTTCCTTT TTTTGGTACG GGAGAAAACC ACAACACTCC GAGAGTATGT CACGGCTATC CCGGTATGGA AACTGGCAGA AACAGGTCTT CCTTCATTTT CACCCTCTGC TGTATTCTGC CGGCAATCCC GTTTCTACAC TCAGGCATGG AGCTCTGTGA ATGGCATCCG GTTAATCTCG GACTGAACTT CACCGCTGAA GACAGGGAAC GCTTTCCGTC AGACAAGCTC CCGCTTTTCA GTGCGTTCAG TTACGATTGG AAAACAGCCA ACGGCCTTAA GACTCTCAAC AGCTACATCA GCAAGATCCT GTCCATAAGA GCCAACTATC GTGAACTTGT TCAGTGTATG GATAAGGGTT CAATGATTCT TCCCTACATC ACCAATCCTG AACTTTTGGC CGTCATGAGA AAAAACGGCG ACACAACGCT GCTCTTCATT GGAAACAGCA ATGGATCTGA AGCGCAGAAT GGCATCATTG AATTCGGCAT CACTGACGCA ATCCTCTTTG ATCTTATCGA AGAAAAAGAG TACCCTGTTT CAAATCACTC CCTGAGCTTG CATTTCAAGC CGGGTCAAAG TATGCTTTTC GAACTTCCTG TAGAGGAAAA ACCATCGTAG
|
Protein sequence | MKQNSTPHQT TNLKLVLHAL EQLPELKPGQ PYYIPGLWNG GDSGFEPVQP GKYFADIITN MLSGNANEIR NPLQSSGNWT PDAVVYNLFI RLTTAYDHNN DGIVNTEPLE NGFRETGTLL KATGLLPYIK KLGVNTLYLL PVTLTGSDDK KGSLGSPYAV KDPFAIDPML DEPALGLSAD ILLKAFVEAA HLLNMRVLFE FVFRTASVDS NWIQDHPDWF YWISETGCAT KYGPPHFSDE ILHTIYEKVD KHDLNNLPAP EADYKKMFTS VPVAIHKKDE KLRGITKQGE ECRIASAFSD WPPDDRQPPW TDVTYLKMHN HPEFNYIAYN TIRMYDVALD NPEYRNNLLW ETIEAIIPHF QENYCIDGAM IDMGHALPSA LKHSIVKRAR RNNPNFAFWD ENFDPTPSVR DEGFNAVFGS LPFVIHDPVF IRGLLNYLNK TGVALPFFGT GENHNTPRVC HGYPGMETGR NRSSFIFTLC CILPAIPFLH SGMELCEWHP VNLGLNFTAE DRERFPSDKL PLFSAFSYDW KTANGLKTLN SYISKILSIR ANYRELVQCM DKGSMILPYI TNPELLAVMR KNGDTTLLFI GNSNGSEAQN GIIEFGITDA ILFDLIEEKE YPVSNHSLSL HFKPGQSMLF ELPVEEKPS
|
| |