Gene Tpet_0355 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpet_0355 
Symbol 
ID5170207 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermotoga petrophila RKU-1 
KingdomBacteria 
Replicon accessionNC_009486 
Strand
Start bp338170 
End bp339549 
Gene Length1380 bp 
Protein Length459 aa 
Translation table11 
GC content45% 
IMG OID640562859 
Productmajor facilitator transporter 
Protein accessionYP_001243960 
Protein GI148269500 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000000720237 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGAAGG CCTATCTCTT TCTCACACTC GAAGGTGTCT TCTCTCTGTT CTACGCGCTT 
CTAATACAGG GACCTGTGTT CACAGGTCTT GCGATGCTTT TCAATCTGGA CGAGTTTCTC
CTGAGTGTCG CCGCTGCAAT CCCTCCCATG ATGCAGTTTT TTCAGCTGTT TGCCTCTTTC
TTTGTTCAGA AATACAGAAA GAGGCGTTTT CTGGTCAACG TGTTCAACGC TTTCAGCAGA
TTCAGTTTCG CTTTTCTCAT CGTTTTTCTA CTGCTTGGGA AAACAGAACC GTTGATTTTC
ATCGTTGCGC TCGTGATTTC CCAGATCTTC GCGGCCCTCT CCAGCAGTAC GTGGAACTCA
TGGATGAGAG ATCTCGTTCC TCCAGAGGAA AGAGGAAGGG TGTTTGGAAA CAGAAACATG
TTTCTATCCA TAGGAAATGC TCTCATTATA TACCTCTATT CTTCTATAGT TGATCATTTC
TCTTCGGGTT TTGTTCTTGT GCTTCTCATA TCCACAGTGG GAACCGTCCT TTCTATCTTT
GCCATGAACA AAATACCGGA CGTTCCTGTT AGGGAAACAG TAACGGGAAT TCCTCTCAAA
GCGGTGTTCA AGGACGAAAA CTTTATGAAG TTTGTCTTCT TCACCTTTTA CTGGAACATG
GCGGTCACGT TCTCTTCTGC TTTCTATCAC TATCATCTTT TGAAGAACCT GGGTGTGGAT
TACACGTACA TCGCTTACAT GATGATCGTG AACAACTTTG TGGCGATGCT GGTTTACAGG
ATCCTTGGAA AAGTGTCTGA CAGGGTAGGA CACAAAACGA TAGCCGAGTT CGGTATCATC
CTTGCTTCTT TTGTCTCCGG TATGTGGTTT TTCATGAACA CCAATACTTA CAGAACCCTG
ATGGTAGCGG ACGCCTTCCT TTCTTCCATC GCGTGGTCCG CTATAAACCT GTCACTCGCC
ATCCTTCCTA TGGAGGTTGC TTTCGAGTCT GATCCCGTTT TCTTCGGTTT GAACGCGTCT
TTTGCCAGCG CTGGAAGTCT TATAGGCTCA TTTGCAGGTG GAATCACGGC GAAATTTCTC
TCTGATATTT ACGTAAATCT CCACGGTTTC GAAATTTTTG GACTTCAACT GCTCTTTTTG
ATGGCAGGAA TCTTCAGATT CTCCGCTGTG TTCTTTTTGA GAAAAGTCAA GGTGAAGAAG
CACATCCCGT TCAAAGCCTT CATGTTGAGT ACCCTTTCTG TCACTCTGAG ACGTCCCATA
GACAGAATGC TCGATGTGTA TCTACTTTTG AAAAGGGGGA ATGAACGTGT ACGAGAGCTT
GTTGGAAGAT CTAAGAGAAG GGAGAGTAAT ACTGACGAGC AAGGCGGGAA ACAAGGTTAA
 
Protein sequence
MKKAYLFLTL EGVFSLFYAL LIQGPVFTGL AMLFNLDEFL LSVAAAIPPM MQFFQLFASF 
FVQKYRKRRF LVNVFNAFSR FSFAFLIVFL LLGKTEPLIF IVALVISQIF AALSSSTWNS
WMRDLVPPEE RGRVFGNRNM FLSIGNALII YLYSSIVDHF SSGFVLVLLI STVGTVLSIF
AMNKIPDVPV RETVTGIPLK AVFKDENFMK FVFFTFYWNM AVTFSSAFYH YHLLKNLGVD
YTYIAYMMIV NNFVAMLVYR ILGKVSDRVG HKTIAEFGII LASFVSGMWF FMNTNTYRTL
MVADAFLSSI AWSAINLSLA ILPMEVAFES DPVFFGLNAS FASAGSLIGS FAGGITAKFL
SDIYVNLHGF EIFGLQLLFL MAGIFRFSAV FFLRKVKVKK HIPFKAFMLS TLSVTLRRPI
DRMLDVYLLL KRGNERVREL VGRSKRRESN TDEQGGKQG