Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpet_1555 |
Symbol | |
ID | 5170303 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermotoga petrophila RKU-1 |
Kingdom | Bacteria |
Replicon accession | NC_009486 |
Strand | + |
Start bp | 1548790 |
End bp | 1550628 |
Gene Length | 1839 bp |
Protein Length | 612 aa |
Translation table | 11 |
GC content | 46% |
IMG OID | 640564081 |
Product | arabinogalactan endo-1,4-beta-galactosidase |
Protein accession | YP_001245138 |
Protein GI | 148270678 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3867] Arabinogalactan endo-1,4-beta-galactosidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.428291 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGGAGG TTGGTTTGAT GAGAGGGGTG CTTTTCGTGC TGATGATTTC TTCCATGGCC TTTGGTTTGA TAGTCAACCC GGTGAAAAAC CTGTGCGAGG ATTTCATCTT TGGAATGGAT GTTTCTATGC TCTACGAGAT CGAGCAACTG GGTGGGAAAT ATTTCGAGAA TGGTGTGGAA AAAGATTGTC TTGAAATACT GAAGAATCAT GGAATAAACT GGATCAGGTT GAGGGTGTGG AATGATCCGA GAGACGAGAA TGGAAATCCT CTCGGAGGAG GAAACTGCGA TTACCTGAAG ATGACAGAAA TCGCTAAAAG GGCAAAGAAA CTCGGAATGA AAGTGCTTCT TGACTTCCAT TACAGCGACT GGTGGGCGGA TCCTGGAAAG CAGAACAAAC CAAAAGAGTG GGAATATCTT CATGGAGAAC TTCTGGAGAG GGCGGTTTAC TCCTACACGA AACTTGTACT GAACCACATG CGAAGAAACG GGGCACTACC AGATATGGTC CAGGTGGGGA ATGAGGTGAA CAACGGTTTT CTCTGGCCTG ACGGCAAGAT TTCTGGAGAA GGTGCAGGTG GTTTCGACGG ATTCACAAGA CTTTTGAAAG CTGCCATCAA GGCCGTTAGA GAGGTTGATC CGGATATAAA GATCGTTATT CATCTGGCGG AAGGTGGAAA CAACTCTCTC TTCAGATGGT TCTTCGACGA GATCACAAGA AGAAACGTGG ACTTCGATGT AATAGGTGTA TCTTACTACC CGTACTGGCA CGGAACCCTC GAGGATCTGA AAAACAACCT CTACGACATA GCCACAAGAT ATAACAAGGA TGTGCTCGTT GTTGAAACAG CTTACGCCTG GACACTCGAG GATGGAGATG GTTATCCCAA CATCTTCAAT GGTGAAGAAA TGGAACTAAC AGGTGGCTAC AAGGCAACCG TTCAGGGACA GGCAACATTT CTGAGAGATC TCATGGAAGT GGTAAACAGC GTTCCCAACG GCCATGGACT CGGGATTTTC TACTGGGAAG GAGATTGGAT CCCTGTGAGG GGGGCTGGAT GGAAAACCGG AGAAGGAAAC CCCTGGGACA ACCAAGCTAT GTTCGATTTC AGTGGGAACG CTCTCCCATC ACTGAATGTT TTCAAACTGG TGAAAACATC ATCGCCAGTG GAGATTGCGA TAAAAGAGAT CCTTCCTGTG GAGGTTACAA CCAACCTGGG AGAGGTTCCA AAATTTCCAG ATGCTGTGAA AGTTCTGTTC AGCGACGATT CCATCAGATC TTTACCTGTC GAATGGAACT TTGATTCTGC CCTTGTTGAA GAATCCGGTG TTTACAAAGT GGAAGGCTAC ATTAAAGACA TTGACCGGAA AATTTTCGCG ACACTCACCG TGAAGGGTAG CAGAAACTAT CTGAAAAATC CGGGCTTCGA AACAGGAGAA TTTTCGCCTT GGCAGGTCTC GGGAGACAAA AAAGCGGTGA AAGTTGTAAA AGTCAATCCT TCAAGCAATG CGCACCAGGG AGAGTACGCA GTGAATTTCT GGCTCGATGA ATCCTTCAGT TTCGAACTGT CACAAGAAGT GGAACTTCCA GCAGGTGTGT ACAGAGTAGG GTTCTGGACC CATGGAGAAA AAGGTGTGAA GATTGCTCTG AAGGTAAGTG ATTACGGAGG AGATGAACGA TCTGTAGAAG TTGAAACAAC GGGCTGGCTC GAATGGAAGA ACCCGGAGAT AAGGAACATA AAAGTTGAAA CAGGAAGAAT AAAGATTACC GTTTCTGTCG AGGGAAGGGC AGGTGACTGG GGGTTCATTG ATGATTTCTA TCTTTTCAGA GAAGAGTAA
|
Protein sequence | MKEVGLMRGV LFVLMISSMA FGLIVNPVKN LCEDFIFGMD VSMLYEIEQL GGKYFENGVE KDCLEILKNH GINWIRLRVW NDPRDENGNP LGGGNCDYLK MTEIAKRAKK LGMKVLLDFH YSDWWADPGK QNKPKEWEYL HGELLERAVY SYTKLVLNHM RRNGALPDMV QVGNEVNNGF LWPDGKISGE GAGGFDGFTR LLKAAIKAVR EVDPDIKIVI HLAEGGNNSL FRWFFDEITR RNVDFDVIGV SYYPYWHGTL EDLKNNLYDI ATRYNKDVLV VETAYAWTLE DGDGYPNIFN GEEMELTGGY KATVQGQATF LRDLMEVVNS VPNGHGLGIF YWEGDWIPVR GAGWKTGEGN PWDNQAMFDF SGNALPSLNV FKLVKTSSPV EIAIKEILPV EVTTNLGEVP KFPDAVKVLF SDDSIRSLPV EWNFDSALVE ESGVYKVEGY IKDIDRKIFA TLTVKGSRNY LKNPGFETGE FSPWQVSGDK KAVKVVKVNP SSNAHQGEYA VNFWLDESFS FELSQEVELP AGVYRVGFWT HGEKGVKIAL KVSDYGGDER SVEVETTGWL EWKNPEIRNI KVETGRIKIT VSVEGRAGDW GFIDDFYLFR EE
|
| |