Gene Moth_2118 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_2118 
Symbol 
ID3833269 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp2214131 
End bp2215447 
Gene Length1317 bp 
Protein Length438 aa 
Translation table11 
GC content44% 
IMG OID637830043 
Producturacil-xanthine permease 
Protein accessionYP_430953 
Protein GI83590944 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG2233] Xanthine/uracil permeases 
TIGRFAM ID[TIGR00801] uracil-xanthine permease
[TIGR03173] xanthine permease 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.398858 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGAGGG GGAAAACGGT GAGTAGTCGT TTTATTGGTG TGGACGAAAA ACCTGCCCTC 
CCATACTTAA TTATGTATGG GATCCAGCAT GTCCTGGCTA TGTTTGCCGG TATCGTAGCT
GTACCCCTAA TGGTTGGCAC TGCATTAAAA TTACCCGGCG AACAGATTAC TATTCTCGTT
CAAGGTTCGC TTTTAACCAG TGGGATTGGG ACATTAGTCC AATCCCTTGG TATTGGTAGG
CTGGGTGCTC GCCTCCCTAT ATGTATGGGT ACAGCATTTG TCTTTATATC GCCATTTATA
AGCGTAGGTT CTCAATTGGG TATCCAGGCT ATTCTTGGAG CAGTTATTGT TGGTGGTATC
GTTGAATTTG TTTTTAGCTT TTTCGTTTGG CGTATTCAGA AATATGTACC TCCAGTTGTT
ACAGGGACCG TTGTTGTCTT GATTGGTATG GGATTGATGC CCCTAGGTTT TACTTGGCTA
GCAGGTGGGG AAAGCTCGCT GTTTGGTCAG CCAATAAGTT TTGCTATCGG TGGTTTAGTT
TTAGTTATAC TGATTTTGAT TAACCAATTT ACAAAAGGAT TTTGGCCTTC TATATCTGTT
GCTCTGGCCA TTGTTGTAGG TTACCTGGCT GCTGGTATCG CTGGTGTTTT GAATCTTGGG
TTGGTCAAGG AGGCAACATG GTTTGCTATA CCAAAAGTTT TTTCCTTTGG ACTGCCAAAG
TTTTCGTTTC CTGCTATTTT AGCGGTTCTG GTAGCACAAT TTGCCTCTAT GCTGGAGACT
ATAGGGGATA CTTATGCAAC AGGCTTAGTA GCTCATAAGG AAATAGGTCG GCGCGAGTTA
TCCGGAGCAA TCAGTGTTGA CGGTTTGCTG TCATCAATTG CAGTTTTATT CAATGGGTTA
TCAATCACTT CCTTTAGCCA GAACATTGGG GTCATAAGTA TTACAGGAGT GGCCAGTCGC
TTTGCTGTAG CTGTCAGTGG CATTATATTA TTGCTTATGG GCCTCGTACC AAAATTTGCG
GCTCTGATTG CAAGTATGCC CGCGCCAGTA CTTGGTGGTG CTGCCCTGGT AATGTTTGGC
GCAATTGCAG GATCAGGTAT TTTACAGTTC CGCGAAGCAA AAGTTTTTGG GGAACGGGAG
ATTTTCATTT TTGCTATCTC AGTTGCCCTG GGGATGGGTT TTGGGTTGCA TCCAGAAGGT
GCACTAGAGC ATTTACCATC TTACCTTACA GTTATCCTGG GATCCGGTGT TGCGGTAGGG
GGGATAACGG CCATTATACT TAATCAGCTA TTGCCACTCA GACAGAAAGA AGAATAA
 
Protein sequence
MKRGKTVSSR FIGVDEKPAL PYLIMYGIQH VLAMFAGIVA VPLMVGTALK LPGEQITILV 
QGSLLTSGIG TLVQSLGIGR LGARLPICMG TAFVFISPFI SVGSQLGIQA ILGAVIVGGI
VEFVFSFFVW RIQKYVPPVV TGTVVVLIGM GLMPLGFTWL AGGESSLFGQ PISFAIGGLV
LVILILINQF TKGFWPSISV ALAIVVGYLA AGIAGVLNLG LVKEATWFAI PKVFSFGLPK
FSFPAILAVL VAQFASMLET IGDTYATGLV AHKEIGRREL SGAISVDGLL SSIAVLFNGL
SITSFSQNIG VISITGVASR FAVAVSGIIL LLMGLVPKFA ALIASMPAPV LGGAALVMFG
AIAGSGILQF REAKVFGERE IFIFAISVAL GMGFGLHPEG ALEHLPSYLT VILGSGVAVG
GITAIILNQL LPLRQKEE