Gene Tpen_1149 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpen_1149 
Symbol 
ID4600955 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermofilum pendens Hrk 5 
KingdomArchaea 
Replicon accessionNC_008698 
Strand
Start bp1087814 
End bp1088890 
Gene Length1077 bp 
Protein Length358 aa 
Translation table11 
GC content61% 
IMG OID639773925 
ProductABC transporter related 
Protein accessionYP_920550 
Protein GI119720055 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3839] ABC-type sugar transport systems, ATPase components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAGCGTGG AGCTTAGAAA CGTAACGAAG AGGTTCGGGG AAGTAGCCGC GGTGTACAAC 
GTCAACCTGC GCGTAGAGAG CGGAGAGTTC TTCGTGCTAC TCGGCCCCTC GGGGTCGGGT
AAGTCTACGC TCCTACGCAT AATAGCGGGC CTCGAAGAGC CCGACGAGGG AGAGGTTATC
ATCGGGGGCA GAGTCGTGAA CGACGTAGAC CCGAGCGAGA GGGACATAGC TTTCGTGTTC
CAGAACTACG CCCTCTACCC GCACATGACG GTCTACGACA ACATAGCGTT CCCGCTCAGG
ATGAGGAAGG TCCCGAAGGA CCAGATCCAC GCGCGCGTGC TCGAAGTCGC CTCCATGCTA
GGGCTTACGA ACCACCTCTC CAAGTACCCC TACCAGCTCT CGGGCGGCGA GCAACAGAGA
GTAGCCCTGG CGAGGGCTAT CGTGAGGAAG CCCCGGGTAT TCCTCCTCGA CGAGCCTCTC
AGCAACCTGG ACGCCAAGCT CCGCGTAAAG CTAAGGTTCG AGTTGAGGAA GCTCCTCCAC
GACGAGCTGA AAACAACCAC GATCTACGTC ACCCACGACC AGGTCGAAGC AATGACGATG
GCGGATCGCG TAGGCGTTAT CAACAGAGGG CAACTGGTAC AGGTAGGCAC CCCCGACGAG
CTCTTCGAGA AGCCATCAAA CACCTTCGTC GCGGGATTCA TAGGAACACC GCCCATGAAC
TTCCTGCCAG CCAGGGTCGC TGAGAAGACG CTACGCCTCG GCAACCTCGC TATCCGCTCC
GAGGAGCTAG AGGGGCTAGC CGAGGGCGAG GTAATCCTCG GCATACGCCC CCAGCACCTC
GAGGTAGGCG AGGAGGGGCT CCCAGTGCGG GTCGTGGGCG TCGAGAGGCT CGGCACGGGC
TCGATACTCC ACGGAGTCTT CGAAGGCTTC GAGGTCACCG CGTACTCCGA GAAAAGGGAG
CACGCCCAGC CGGAGCTTAA CGCCGAGGTA CGCTTAAAGC CCGTGGGGCC CCTCTACCTC
TTCGACTCGA GGAGCGAGGA GCTACTCAGG GTTGTGCGGA GCTACAGGGT CGAGTAG
 
Protein sequence
MSVELRNVTK RFGEVAAVYN VNLRVESGEF FVLLGPSGSG KSTLLRIIAG LEEPDEGEVI 
IGGRVVNDVD PSERDIAFVF QNYALYPHMT VYDNIAFPLR MRKVPKDQIH ARVLEVASML
GLTNHLSKYP YQLSGGEQQR VALARAIVRK PRVFLLDEPL SNLDAKLRVK LRFELRKLLH
DELKTTTIYV THDQVEAMTM ADRVGVINRG QLVQVGTPDE LFEKPSNTFV AGFIGTPPMN
FLPARVAEKT LRLGNLAIRS EELEGLAEGE VILGIRPQHL EVGEEGLPVR VVGVERLGTG
SILHGVFEGF EVTAYSEKRE HAQPELNAEV RLKPVGPLYL FDSRSEELLR VVRSYRVE