Gene Tpet_1555 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpet_1555 
Symbol 
ID5170303 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermotoga petrophila RKU-1 
KingdomBacteria 
Replicon accessionNC_009486 
Strand
Start bp1548790 
End bp1550628 
Gene Length1839 bp 
Protein Length612 aa 
Translation table11 
GC content46% 
IMG OID640564081 
Productarabinogalactan endo-1,4-beta-galactosidase 
Protein accessionYP_001245138 
Protein GI148270678 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3867] Arabinogalactan endo-1,4-beta-galactosidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.428291 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGGAGG TTGGTTTGAT GAGAGGGGTG CTTTTCGTGC TGATGATTTC TTCCATGGCC 
TTTGGTTTGA TAGTCAACCC GGTGAAAAAC CTGTGCGAGG ATTTCATCTT TGGAATGGAT
GTTTCTATGC TCTACGAGAT CGAGCAACTG GGTGGGAAAT ATTTCGAGAA TGGTGTGGAA
AAAGATTGTC TTGAAATACT GAAGAATCAT GGAATAAACT GGATCAGGTT GAGGGTGTGG
AATGATCCGA GAGACGAGAA TGGAAATCCT CTCGGAGGAG GAAACTGCGA TTACCTGAAG
ATGACAGAAA TCGCTAAAAG GGCAAAGAAA CTCGGAATGA AAGTGCTTCT TGACTTCCAT
TACAGCGACT GGTGGGCGGA TCCTGGAAAG CAGAACAAAC CAAAAGAGTG GGAATATCTT
CATGGAGAAC TTCTGGAGAG GGCGGTTTAC TCCTACACGA AACTTGTACT GAACCACATG
CGAAGAAACG GGGCACTACC AGATATGGTC CAGGTGGGGA ATGAGGTGAA CAACGGTTTT
CTCTGGCCTG ACGGCAAGAT TTCTGGAGAA GGTGCAGGTG GTTTCGACGG ATTCACAAGA
CTTTTGAAAG CTGCCATCAA GGCCGTTAGA GAGGTTGATC CGGATATAAA GATCGTTATT
CATCTGGCGG AAGGTGGAAA CAACTCTCTC TTCAGATGGT TCTTCGACGA GATCACAAGA
AGAAACGTGG ACTTCGATGT AATAGGTGTA TCTTACTACC CGTACTGGCA CGGAACCCTC
GAGGATCTGA AAAACAACCT CTACGACATA GCCACAAGAT ATAACAAGGA TGTGCTCGTT
GTTGAAACAG CTTACGCCTG GACACTCGAG GATGGAGATG GTTATCCCAA CATCTTCAAT
GGTGAAGAAA TGGAACTAAC AGGTGGCTAC AAGGCAACCG TTCAGGGACA GGCAACATTT
CTGAGAGATC TCATGGAAGT GGTAAACAGC GTTCCCAACG GCCATGGACT CGGGATTTTC
TACTGGGAAG GAGATTGGAT CCCTGTGAGG GGGGCTGGAT GGAAAACCGG AGAAGGAAAC
CCCTGGGACA ACCAAGCTAT GTTCGATTTC AGTGGGAACG CTCTCCCATC ACTGAATGTT
TTCAAACTGG TGAAAACATC ATCGCCAGTG GAGATTGCGA TAAAAGAGAT CCTTCCTGTG
GAGGTTACAA CCAACCTGGG AGAGGTTCCA AAATTTCCAG ATGCTGTGAA AGTTCTGTTC
AGCGACGATT CCATCAGATC TTTACCTGTC GAATGGAACT TTGATTCTGC CCTTGTTGAA
GAATCCGGTG TTTACAAAGT GGAAGGCTAC ATTAAAGACA TTGACCGGAA AATTTTCGCG
ACACTCACCG TGAAGGGTAG CAGAAACTAT CTGAAAAATC CGGGCTTCGA AACAGGAGAA
TTTTCGCCTT GGCAGGTCTC GGGAGACAAA AAAGCGGTGA AAGTTGTAAA AGTCAATCCT
TCAAGCAATG CGCACCAGGG AGAGTACGCA GTGAATTTCT GGCTCGATGA ATCCTTCAGT
TTCGAACTGT CACAAGAAGT GGAACTTCCA GCAGGTGTGT ACAGAGTAGG GTTCTGGACC
CATGGAGAAA AAGGTGTGAA GATTGCTCTG AAGGTAAGTG ATTACGGAGG AGATGAACGA
TCTGTAGAAG TTGAAACAAC GGGCTGGCTC GAATGGAAGA ACCCGGAGAT AAGGAACATA
AAAGTTGAAA CAGGAAGAAT AAAGATTACC GTTTCTGTCG AGGGAAGGGC AGGTGACTGG
GGGTTCATTG ATGATTTCTA TCTTTTCAGA GAAGAGTAA
 
Protein sequence
MKEVGLMRGV LFVLMISSMA FGLIVNPVKN LCEDFIFGMD VSMLYEIEQL GGKYFENGVE 
KDCLEILKNH GINWIRLRVW NDPRDENGNP LGGGNCDYLK MTEIAKRAKK LGMKVLLDFH
YSDWWADPGK QNKPKEWEYL HGELLERAVY SYTKLVLNHM RRNGALPDMV QVGNEVNNGF
LWPDGKISGE GAGGFDGFTR LLKAAIKAVR EVDPDIKIVI HLAEGGNNSL FRWFFDEITR
RNVDFDVIGV SYYPYWHGTL EDLKNNLYDI ATRYNKDVLV VETAYAWTLE DGDGYPNIFN
GEEMELTGGY KATVQGQATF LRDLMEVVNS VPNGHGLGIF YWEGDWIPVR GAGWKTGEGN
PWDNQAMFDF SGNALPSLNV FKLVKTSSPV EIAIKEILPV EVTTNLGEVP KFPDAVKVLF
SDDSIRSLPV EWNFDSALVE ESGVYKVEGY IKDIDRKIFA TLTVKGSRNY LKNPGFETGE
FSPWQVSGDK KAVKVVKVNP SSNAHQGEYA VNFWLDESFS FELSQEVELP AGVYRVGFWT
HGEKGVKIAL KVSDYGGDER SVEVETTGWL EWKNPEIRNI KVETGRIKIT VSVEGRAGDW
GFIDDFYLFR EE