Gene Tpen_1831 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpen_1831 
Symbol 
ID4602068 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermofilum pendens Hrk 5 
KingdomArchaea 
Replicon accessionNC_008698 
Strand
Start bp1773655 
End bp1774947 
Gene Length1293 bp 
Protein Length430 aa 
Translation table11 
GC content57% 
IMG OID639774604 
Productmajor facilitator transporter 
Protein accessionYP_921229 
Protein GI119720734 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2211] Na+/melibiose symporter and related transporters 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.612991 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGTTAACGA GAAAGCAGAG AGTAGCGTAC GGCTTTGCAC GTTTTGGATC CACAATATTC 
ATGGGGCTCT ACGACTTTGC GAGCTTCTAC ATCTACTGGC AGGTCTTCCG GCTCGACCCC
TTGCTGTCCG GCTACATGGG CGCGATAGGT AAGATAACCA TTATGGTTGC AAGCCTGCTG
GTAGGGTATC TCAGCGACTC CATATGGACC CGGTGGGGGC GGCGTAAGCC CTTCATAGCT
ACTGGGGCGC CCCTCCTGGC TTTCTCAGGG TTCCTTCACT TCACCCCTAT ATATTTCCTT
CAGGGAGCTA CACAGGAAGC CTTGTTCCTG TGGGGCGCCG CTACAAGCTC CATGTTCCAC
TTCTTCTACG CGTGGCTTTT GACACCCTTC CAGGCGTGGC TACCGGAGAT ATCCGAGCCC
TCCGAGAGGA TAGACATTTC GATGCTACAG AACGTGGCTA ATATACTCGG GAACATCGTC
AGCACGGTTA CGGGGTTCCT CGCAAAGATG CTGGTAGCAC GTGGCCTTCT AATGCCGCTA
GTAGGTGCGT ACGCTTTGGT TCTCGTAGCG CTCTTTACGC CACCGGTAGT CCTGCTACCT
GTTGAGAGGA GAGCGGAGCC CACGAGACCG TCCCTCAAAG ACCTTCTAAC CGTGTTCAGG
TACAGGGAGT ACATGAAGTG GATGGCTGTT CGAGGACTGA TGTCCTCCAG CGTTCAAATG
CTTACAATAA CGATAATCGC GTACATAGAG AAGGTGATCG GGGTAGAACA CTCGCTTGCC
TCGGCATCGT TCGGCGTTAT ACTCGTCGTC TTCGTAGCCG GCGCGTTCCC GCTATGGGGT
AGGCTGGGCA AGAGGGCGGG AAAGGGGAGG GCGCTCACGC TTTCAATGAC GTTGCTGGGA
GCTACGCTAC TCATGACCCC GCTCCCCCAC TTCGTCGACC CCGGTGTTTT CAGGGTGCTC
CTAGGCTACG TGCTGGTAGC CCTCGGCGCT GTCGCCGTCT CAGCGTACGT CCTATTCCCC
TACGCAGTGC TAGCTGACCT CGCGCACTGG TACGAGGTGC AGACCGGGGA AAAGCGGGCG
GGTCTCTTCA CGGGGTTCGA GGGGATACCT ATCAACGTGT TCGAGTCGCT GGCCTACCTA
GTCACGGGGT TCCTAATGAG CTTGCCGCAG GTGCCTGGCT CCGAGTACAC CTACGGGCTG
ATCCTGTGGG GACCCGTAGC TTCGCTGTTC ACACTGGTAG CTCTCGCCAT CCTAAGGAGA
ACGAACGTAG ACCCCTTCCT CGCCAAGAAG TAG
 
Protein sequence
MLTRKQRVAY GFARFGSTIF MGLYDFASFY IYWQVFRLDP LLSGYMGAIG KITIMVASLL 
VGYLSDSIWT RWGRRKPFIA TGAPLLAFSG FLHFTPIYFL QGATQEALFL WGAATSSMFH
FFYAWLLTPF QAWLPEISEP SERIDISMLQ NVANILGNIV STVTGFLAKM LVARGLLMPL
VGAYALVLVA LFTPPVVLLP VERRAEPTRP SLKDLLTVFR YREYMKWMAV RGLMSSSVQM
LTITIIAYIE KVIGVEHSLA SASFGVILVV FVAGAFPLWG RLGKRAGKGR ALTLSMTLLG
ATLLMTPLPH FVDPGVFRVL LGYVLVALGA VAVSAYVLFP YAVLADLAHW YEVQTGEKRA
GLFTGFEGIP INVFESLAYL VTGFLMSLPQ VPGSEYTYGL ILWGPVASLF TLVALAILRR
TNVDPFLAKK