Gene Tpen_1203 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpen_1203 
Symbol 
ID4600405 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermofilum pendens Hrk 5 
KingdomArchaea 
Replicon accessionNC_008698 
Strand
Start bp1141701 
End bp1143098 
Gene Length1398 bp 
Protein Length465 aa 
Translation table11 
GC content61% 
IMG OID639773979 
Productcitrate transporter 
Protein accessionYP_920604 
Protein GI119720109 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1055] Na+/H+ antiporter NhaD and related arsenite permeases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTCGTGG AGCGCGCGGC CTCCCAGCAG TACGCCTTCA GGGTTCTCCT AGTGCTGGTA 
GCCGGTCTGG TCACCGCTCT CGCGTCCTCG CTCCTCGGGT TGGAGGCCCA GCAAGTGCTC
GCCCTGACAG CCTTCCTGAT GACGATATAC GCGACCCTGC TACTCTGGAC CTACAGGCTC
CCCTTCGCGT TCCTGGGGGT CTCCGCCCTC TTCCTGCTCG GCGTGCTCGA TGTAGAGTAC
TTTGTCGAGC ACTCGCACCT GGACGTGATA GCCTTCCTGA TAGCGATGAT GACTATCGTG
GGCTACCTGG AGGAGGACAG GTTCTTCGAG TTCATCGCCC AGGAGATCGT GAGGAGGGTC
GGTGTAAACT TCAGGGCAAC GTTCCTGGTG GTAGTCTTCC TGTCCGGCTT CCTGGCCCCG
CTGGTCGACG AGGTTACCTC GATACTCGTT ATGCTGTCCG TAGTGCTCCC GCTGAGCGGA
AAGATAGGCG TCGACCCCCT ACCGCTAGTC ATTGCCTCCA TCTTCGCGAC GAACATAGGT
AGCGCTATGA CCCCGCTCGG GAACCCTGTG GGCGTTCTCG TGGCGTTCGA GTCCGGGCTG
ACCTTCTCGG ACTTCCTGGC GCGGGCCGCG CCCGTCTCCG CGCTGTCCCT GGTGGTAGCG
GCGGCTATAC TCATGCATTT GTTTAGGGGG TACATCGAGG AGGGAAACGC CCTCGCCTCG
CAGAGGTTTA CCGATGGGTG GAGCGTGGCA TCCCTGGAAA GGAGAACCCT CTACAGGGAC
GCGTCCGTGT TCTCCGCTAC GATACTCTTC ATAGCCGCGC ACCACGTTTT AGAGGAGGCT
CTCGGCTTGC CGAAGAACTC CCTCCTCCTA GCAGCCCCAC TGATGGTGGC TGGGCTCATA
ATGTTGCTAG ACCCTTCGAG GGGGTTTCAC GCGCTGGAAA CTAAGGTGGA GTGGCCTACC
CTCGTATTCT TCTTGTTGCT CTTCGCATCG GTCGGAGCCT TGGAGAAAAC GGGCGTCGTA
GAGGTTCTGT CGAAGAGTCT AGGCTCTCTG TCCGCGTCGG GGGTAGGCGC CTTCATGGGA
GCGTTCACGC TTTCTTCCTC CCTTATGAGC GCCTTCATGG ACAACGTGAT CGCCGTCGCG
ATTCTATCCC GGGTTGTACA CGAGCTAGGC GCCCAGGGGT TCCACACAGA GCCGTTCTGG
TGGCTGACGC TATTCTCGGC CGTCTACGCC GGGAACCTTT CACCGATAGG TAGCACTGCG
AACATAGTGG CGCTCAGCGT CCTGGAGAAA AGGCTGGGCA GGTCCGCCGG GTTCAAGGAG
TGGCTGAGAG TCGGGCTACC GGTTACGGCG GCAACCCTCG CCCTGGGCTT CGCGGCTGTC
TACCTCCAGA TCCCGTAG
 
Protein sequence
MVVERAASQQ YAFRVLLVLV AGLVTALASS LLGLEAQQVL ALTAFLMTIY ATLLLWTYRL 
PFAFLGVSAL FLLGVLDVEY FVEHSHLDVI AFLIAMMTIV GYLEEDRFFE FIAQEIVRRV
GVNFRATFLV VVFLSGFLAP LVDEVTSILV MLSVVLPLSG KIGVDPLPLV IASIFATNIG
SAMTPLGNPV GVLVAFESGL TFSDFLARAA PVSALSLVVA AAILMHLFRG YIEEGNALAS
QRFTDGWSVA SLERRTLYRD ASVFSATILF IAAHHVLEEA LGLPKNSLLL AAPLMVAGLI
MLLDPSRGFH ALETKVEWPT LVFFLLLFAS VGALEKTGVV EVLSKSLGSL SASGVGAFMG
AFTLSSSLMS AFMDNVIAVA ILSRVVHELG AQGFHTEPFW WLTLFSAVYA GNLSPIGSTA
NIVALSVLEK RLGRSAGFKE WLRVGLPVTA ATLALGFAAV YLQIP