Gene Moth_2478 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_2478 
Symbol 
ID3831212 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp2584369 
End bp2585664 
Gene Length1296 bp 
Protein Length431 aa 
Translation table11 
GC content46% 
IMG OID637830397 
Producturacil-xanthine permease 
Protein accessionYP_431303 
Protein GI83591294 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG2233] Xanthine/uracil permeases 
TIGRFAM ID[TIGR00801] uracil-xanthine permease
[TIGR03173] xanthine permease 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000102506 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00126456 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGTTACCGA CCGGACAACT TTTTCTGTAC GGGTTACAGC ATGTACTGGC TATGTATGCT 
GGAGCTGTCG CTGTACCGCT GATCATTGCC GGTGCTGCTC ACTTAACGAA AGCGGAAACG
GCGTTTTTAA TCAATGCCGA TATGTTTACC TGCGGTATTG CGACTTTGAT TCAAACCCTT
GGCTTCTGGA AACTGGGTAT CCGTTTACCA GTCATTCAAG GAGTTTCCTT TGCCGCTGTA
GCACCCATGG TTATTATCGC CAGGAGCATG GGGATGGAAG CTGTTTATGG TGCGGTAATA
GTTGCCGGTC TGATTGCGTT TTTCCTTGCG CCTTATTTTA GCAAGCTTTT GCATTTCTTT
CCCCCAGTTG TCACCGGCAG TGTAATAACT ATTATTGGAA TATCCTTGCT TCCGGTTGGT
GTGGAATGGG CTGCTGGAGG CACAGGTAAT GCAAATTACG GGGCGCTAAC TAATCTTTTT
ATAGCAGGAA TTGTCCTTCT GGCCATTCTT TTGATCCAAA AGTATTTTAA AGGCTTTATT
GCCAATATTT CCGTCTTATT AGGTCTATTT ATCGGTATGC TTATTGCAAT TCCCCTGGGC
TTGGTCAACT TTTCCGGCGT AACCACAGCC CCCTGGTTGG GAATCGACAG ACCATTTTAT
TTCGGATTCC CCAAGTTTGA TTGGGGAGCT ATTGGAGCCA TGATACTGGT TATGCTGGTA
ACGATGGTGG AATCCACAGG TGATTTCTTG GCCCTGGGGG AGATTGTTGG TAAGCAAATC
GATGAAGAAG ATTTAGCGCG GGGCTTAAGA GGCGATGGGT TTGCCACCAT GCTCGGTGGG
GTACTGAATG CCTTCCCCTA TACTGCCTTT GCACAAAACG TTGGACTGGT AGGTTTGAGT
GGCGTGAAAA GTCGTTTTGT AGTGGCAACA TCTGGTATTA TTCTGGCTGC CCTGGGATTA
TTTCCGAAAC TGGCCACTAT CATTGCCTCC ATACCTTATG CTGTTTTGGG GGGAGCCGGT
ATTGCCATGT TCGGGGTTGT GGCAGCCAAC GGGATAAAAA CCCTTTCTCG CGTTGATTTC
GAAAACAACC CCCATAATAT CTTCATCGTG GCCATTAGTA TAGGTGTTGG CCTGATCCCA
ATGGTTGCAC CAGACTTCTT CAAAATGTTC CCCGCCTGGA GTCAGATTAT ACTCCACAGT
GGTATTACTC TTGGCTCCTT AACTGCGATC ATTCTCAACA TTTTCTTTAA CTATCCCAAC
TCATTAACGT TGTATAAAAA GTCTTTCACT AATTGA
 
Protein sequence
MLPTGQLFLY GLQHVLAMYA GAVAVPLIIA GAAHLTKAET AFLINADMFT CGIATLIQTL 
GFWKLGIRLP VIQGVSFAAV APMVIIARSM GMEAVYGAVI VAGLIAFFLA PYFSKLLHFF
PPVVTGSVIT IIGISLLPVG VEWAAGGTGN ANYGALTNLF IAGIVLLAIL LIQKYFKGFI
ANISVLLGLF IGMLIAIPLG LVNFSGVTTA PWLGIDRPFY FGFPKFDWGA IGAMILVMLV
TMVESTGDFL ALGEIVGKQI DEEDLARGLR GDGFATMLGG VLNAFPYTAF AQNVGLVGLS
GVKSRFVVAT SGIILAALGL FPKLATIIAS IPYAVLGGAG IAMFGVVAAN GIKTLSRVDF
ENNPHNIFIV AISIGVGLIP MVAPDFFKMF PAWSQIILHS GITLGSLTAI ILNIFFNYPN
SLTLYKKSFT N