Gene Moth_1232 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_1232 
Symbol 
ID3833173 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp1271178 
End bp1272503 
Gene Length1326 bp 
Protein Length441 aa 
Translation table11 
GC content53% 
IMG OID637829167 
Productxanthine/uracil/vitamin C permease 
Protein accessionYP_430089 
Protein GI83590080 
COG category[R] General function prediction only 
COG ID[COG2252] Permeases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value0.444446 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTGGAC AACAAGAAGG GGCGGGCTTC CTGGAAAAGA CCTTTAAGTT GAGGGCCAAC 
GGTACTGACG CCCGAACCGA AGTGCTGGCC GGCGTTACAA CCTTTATGAC CATGGCCTAC
ATTATCTTTG TAAATCCGAC GATTCTCAGT AGCACCGGCA TGGATTTCGG CGCGGTAATG
GTGGCTACCA TCCTTTCAGC GGCCATAGCC ACCCTGATCA TGAGTTTTAG CGCCAATTAC
CCTATTGCCA TTGCGCCCGG TATGGGTCTC AACGCCTTTT TTGCCTTTAC TATTGTAAAG
CAAATGCATT ACCCCTGGGA AGTAGCCCTG GCAGCAGTAT TTATGAGCGG CGTTATCTTT
ATCATCTTGA CCCTTACTAA AGCCCGGGAG GCTATTGTTA ACTCCATCCC CCTGTCCCTC
AAGCTGGCGA TCAGTGCCGG TATCGGCCTT TTTATCGCCC TGATCGGCCT GCAAAATGCC
GGCCTGGTAG TGCCCAATCC CGATACCCTG GTTCAACTGG GCGACTTGAG TAAGCCTCCC
GTCCTGCTGG CGGCCATGGG CCTGGTAATT ACGGCCCTCC TGGTGGCCCT CCGGGTCCGG
GGGGCACTGC TCCTGAGTAT CATTATCATC ACTATAATCG GCATCCCCAT GGGAGTTACC
AAAATCGACA GTTTCAAGCT CCTGAGCCTG CCGCCCAGCC TGGCTCCTAC ATTCGGGGCC
TTTACCAGGG GCCTGCCGGG CCTATGGGCC ACCGGTCTCA TTCCCATAAT TTTTACCTTT
ACCTTTGTCG ACCTCTTCGA TACCATCGGT ACCTTGATCG GCGTCAGTAG TAAAGCTAAC
TTACTGGATG AAAACGGCAA CCTGCCCAGG GCCGGCAAAG CCCTGATCTC CGACGCTGTA
GGTACCACCC TGGGTGCCAT CCTGGGAACC AGTACCCTGA CAGCCTATAT CGAGAGTGCT
GCCGGCGTAG CCGAAGGTGG GCGTACAGGA TTGACCAGTC TGGTAGTTGC TATTTTATTC
CTGGCTTGTT TGTTTATCTC GCCCCTGGTG GGCATCGTAC CGGCGGTAGC TACCGCGCCC
ATCCTGATCA TCGTCGGTAT TTTTATGATG GAACCGATCA TGAAAATCGA TTTTAGCAAT
TTCCTGGAAG CAGCCCCGGC CTTTTTAACC ATTGCTATGA TGCCCTTTAC CTATAATATT
GCCGAGGGTA TCGTGTGGGG CGTCCTGGCC TACGTCTTCC TGCACCTGGT TACCGGCAAT
ACGAAAAAGA TCAGCATTAC CATGTGGATT CTGGCGGTTC TCTTTATCAT TCGTTTCTTT
GCTTAG
 
Protein sequence
MSGQQEGAGF LEKTFKLRAN GTDARTEVLA GVTTFMTMAY IIFVNPTILS STGMDFGAVM 
VATILSAAIA TLIMSFSANY PIAIAPGMGL NAFFAFTIVK QMHYPWEVAL AAVFMSGVIF
IILTLTKARE AIVNSIPLSL KLAISAGIGL FIALIGLQNA GLVVPNPDTL VQLGDLSKPP
VLLAAMGLVI TALLVALRVR GALLLSIIII TIIGIPMGVT KIDSFKLLSL PPSLAPTFGA
FTRGLPGLWA TGLIPIIFTF TFVDLFDTIG TLIGVSSKAN LLDENGNLPR AGKALISDAV
GTTLGAILGT STLTAYIESA AGVAEGGRTG LTSLVVAILF LACLFISPLV GIVPAVATAP
ILIIVGIFMM EPIMKIDFSN FLEAAPAFLT IAMMPFTYNI AEGIVWGVLA YVFLHLVTGN
TKKISITMWI LAVLFIIRFF A