Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_2764 |
Symbol | |
ID | 4072386 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | - |
Start bp | 3272905 |
End bp | 3274458 |
Gene Length | 1554 bp |
Protein Length | 517 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 637984781 |
Product | 4-alpha-glucanotransferase |
Protein accession | YP_591839 |
Protein GI | 94969791 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1640] 4-alpha-glucanotransferase |
TIGRFAM ID | [TIGR00217] 4-alpha-glucanotransferase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTCACGA AGCGAGCTTC CGGCATTCTT CTCCATCCAT CTTCCTTGCC GTCGCGTGGC GGCATTGGCG ATTTCGGCCC CGCTGCTTAT GAATTCGTAA ATTGGCTGGC GGAGGGCAAG CAAACGCTGT GGCAGATCCT GCCGCTCGGG CCTCCGGGGA TCGGGAACTC GCCGTATTCA TCGACGTCGG CATTTGCGGG AAACATTGTG CTCATCAGCC TCGAGCGGCT GGCGGAGCGC GGAATGCTCG ACAACGCTGC GCTGAAGAGC TTGCCGGAAA GCGATGGGAG CCGCGTTAAT TTTGAGAACG TATTGAAGGT CAAACTCCCG CTGTTGCGAC AGGCGGCGGA GAGCTTTTTG AAGAACGCTT ACGGAAATTC GCGCGAGCGG TTCAATCGCT TTTGCCGCGA CAACACCTGG TGGCTCGACG ATTTCGTGCT CTTCGACGCT CTGCGCGAAC GTTACCAGGG AGCGAGCTGG AATACGTGGC CGACCGAGAT TTCGCATTGC CAGCCGGAGG CAATTGCCAA GACGCGCAGC GAACTGGCGC ACGAGTTGGA GGTCGCAAAG TTCCTGCAGT TTGCGTTCTT CGAGCAGTGG GGCGCGCTGC GAAACTATTG TCATCAGCGG CGAATCCGGA TTGTGGGCGA CGTGGCGATA TTCGTCAGCT ACGACAGCGC TGACGTCTGG ACGCACCGCG ACATCTTCCG TCTGCGCGAC GTGGAACCCG AAGTCGTTGC CGGAGTGCCA CCGGATGCAT TCAGCGATAC CGGCCAGCGA TGGGGGAATC CGCTATATGA CTGGAACCGC CTGCGCGAGC GCGGCTACGA CTGGTGGGTG AGCCGCATGC GCTGGGCGCA CACCTGGTGC GACATCCTGC GCATCGATCA CTTCCGCGGC TTTGAATCGT ATTGGGAGAT TCCGGCGGAC GAACCGACGG CGATCCACGG TAGCTGGGCG AAAGGTCCCG CTGACGAGTT TTTCCACGTG ATCAACCGCG AACTCGGCGA GTTGCCATTC ATTGCCGAAG ACCTGGGAAT GATCACTCCG GAAGTGCATC AATTGCGCGA GCGGCTGAAG ATTCCCGGTA TGCGCGTGCT GCAGTTTGCG TTTGGCGATC GCGGCGCGCA CATGTACTTG CCGCACCGCT ACGACACAAA CACCGTGGTC TATACCGGCA CCCACGACAA CGACACCACG ATGGGTTGGT GGCAAGGTGA CGCCCAGCCG CATGAAAAGC GCGACGCCGC CGCAGCCTTT GGCGCGAACG ACCAGAACGT GCATTGGGCG TTCATTCGCG CGGCGCAGAC ATCGCTGGCA ACGCTGAGCG TGGTGCCGCT GCAGGACGTC TTCGGCCTTG ACAGTTCCGC GCGCATGAAT ACGCCGAGCC TTTCCGACGG CAACTGGGGC TGGCGCTACA AGCGCGGGCT GCTTACGCAA GATGCAGCGA AAACGCTGTC AGAACTAGCG GAGACGACGG ATCGCGACGA GTTGTTGTTA TCAGGCGGAA ACCAGCAGGG CAACGGGGAA CGACCGGAAG ATTTCGCTGC ATAA
|
Protein sequence | MFTKRASGIL LHPSSLPSRG GIGDFGPAAY EFVNWLAEGK QTLWQILPLG PPGIGNSPYS STSAFAGNIV LISLERLAER GMLDNAALKS LPESDGSRVN FENVLKVKLP LLRQAAESFL KNAYGNSRER FNRFCRDNTW WLDDFVLFDA LRERYQGASW NTWPTEISHC QPEAIAKTRS ELAHELEVAK FLQFAFFEQW GALRNYCHQR RIRIVGDVAI FVSYDSADVW THRDIFRLRD VEPEVVAGVP PDAFSDTGQR WGNPLYDWNR LRERGYDWWV SRMRWAHTWC DILRIDHFRG FESYWEIPAD EPTAIHGSWA KGPADEFFHV INRELGELPF IAEDLGMITP EVHQLRERLK IPGMRVLQFA FGDRGAHMYL PHRYDTNTVV YTGTHDNDTT MGWWQGDAQP HEKRDAAAAF GANDQNVHWA FIRAAQTSLA TLSVVPLQDV FGLDSSARMN TPSLSDGNWG WRYKRGLLTQ DAAKTLSELA ETTDRDELLL SGGNQQGNGE RPEDFAA
|
| |