Gene Tpen_1599 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpen_1599 
Symbol 
ID4601530 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermofilum pendens Hrk 5 
KingdomArchaea 
Replicon accessionNC_008698 
Strand
Start bp1548268 
End bp1549533 
Gene Length1266 bp 
Protein Length421 aa 
Translation table11 
GC content58% 
IMG OID639774372 
Productmajor facilitator transporter 
Protein accessionYP_920997 
Protein GI119720502 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2211] Na+/melibiose symporter and related transporters 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAGAGGT TCAGCTACGG GAAGATATTC CTCCTGGGGT TCGGGTTCTT CGGGATAAGC 
ATCCTGTGGT CTATCTACAA CTCCTACGTC CCGATATTCC TGAAAGAGTT CGGGCTCGCA
TCGTGGCTCG TAGGGTTCAT AATGACTATC GACAACATAT TCGCGGTCGT GCTTCTCCCC
TACATAGGCG TGCTGAGCGA CGTAACTAGG ACGAGGATAG GCAGAAGGAA GCCCTACATA
ATCCTCGGGG CCCCTCCAGC CGCGCTGACA TTCGCGCTGA TACCGCTACT CAGAGGAGAC
TTCTACGCGA TGCTAGCCGT TATAGTCGTG ATGAACTTTT CGATGGCGTT GTTCAGGTCG
CCTGTAATAG CGTTCATGCC CGACATAACC CCCTCGGAGA AGAGAAGCCA GGCGAACGGC
ATAATAAACT TCATGGGCGG CGTTGGATCC CTCCTAGCGT TCTTCGTGGG CGCAAAGCTC
TACGAAATGA ACCCCTCCTA CCCCTTCGTA GCCGCGGCAG TCACGATGCT CCTGGCATCG
CTACTCGTTG TCCTACTCGT AGACGAGCCC GAGGAGTTCA AGGCGAGGGG AGGGTCCGTC
AGGCTGGGCG AGCTCCTGCG GGAGTCGTTT AGGAAGAGCT TCTCAGAGCT CTCGGCGAAC
CTCAGGGAGG CGTTCCTGGG CGAGGATAAA AGCCTCCTCT TCATGCTAGC CTCGATCTTC
CTGTGGTTCA TAGGCTACAA CGCGATAGAG ACTTTCTTCA CCAGCTACGC GAAGTGGTAC
CTGGGGATCG GGGAGGCGGC GGGCTCCCTG ATCCTGGGCT TCGTGGCGCT CGGGTTCCTC
GTGTTCTCCC TACCCGCGGG CTTTATCGGG GCGAGGCTCG GCAGGAGGAA GACCATGACG
CTCGGGCTCG CCCTGCTGGT AGTCCTGCTC GGCTTAGCGT TCTACGCCTC GACAGCCGTG
AAGACAGGGG CGGTGATCTA CGTTCTAGGC GCCATATTCT TCTTCGGAGG ATTCGCTTGG
GCCCTCGTCA ACGTCAACTC CCTTCCAACC GTGGTGGACA TGACCAGTAG GGAGAGGCTC
GGAGCATACA CGGGGCTCTA CTACTTCGCG TCCCAGAGCG CCGCTATAAC CGCCCCGCCG
CTGGCAGGCC TCTTCATAGA CGTGCTCGGC TACCAGGCGC TATTCCCCTA CTCGATAGTC
TTCCTCCTAG CGTCCGCGGT AACACTACAG TTCGTTAAGC GCGGGGAAGC CAGGAAAGGG
TTCTAA
 
Protein sequence
MERFSYGKIF LLGFGFFGIS ILWSIYNSYV PIFLKEFGLA SWLVGFIMTI DNIFAVVLLP 
YIGVLSDVTR TRIGRRKPYI ILGAPPAALT FALIPLLRGD FYAMLAVIVV MNFSMALFRS
PVIAFMPDIT PSEKRSQANG IINFMGGVGS LLAFFVGAKL YEMNPSYPFV AAAVTMLLAS
LLVVLLVDEP EEFKARGGSV RLGELLRESF RKSFSELSAN LREAFLGEDK SLLFMLASIF
LWFIGYNAIE TFFTSYAKWY LGIGEAAGSL ILGFVALGFL VFSLPAGFIG ARLGRRKTMT
LGLALLVVLL GLAFYASTAV KTGAVIYVLG AIFFFGGFAW ALVNVNSLPT VVDMTSRERL
GAYTGLYYFA SQSAAITAPP LAGLFIDVLG YQALFPYSIV FLLASAVTLQ FVKRGEARKG
F