Gene Moth_0699 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_0699 
Symbol 
ID3832700 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp729145 
End bp730221 
Gene Length1077 bp 
Protein Length358 aa 
Translation table11 
GC content48% 
IMG OID637828631 
ProductABC transporter substrate-binding protein 
Protein accessionYP_429561 
Protein GI83589552 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1879] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000602744 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00000163268 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGTCCATTT ATGAAAAAGG TATGAATCGG AGGGAGTTTA TTGTCAAAAG TCTGATGGCG 
AGTGGGTTGG TAGCAGGTAG CTCTCTGTTG CTGAGCGGGT GCTCATCGGG CTCAACGGGT
ACTACTGCAA AAGAAGGCAA ACGCTTGAAG GCGGCATTCT CCAATGCTGG ACTCCAAGCC
ACCTGGTGCG CCCAGGGCAA GGATACTGTG GAACGTTGGG GTAAATGGTT GGGAGTAGAT
ATTACCTGGT ATGACGGCGC TCTTAGTGTA GACAAGCAGC GTGCGGCCGT AGAAGATATG
GCAACTAAAG ACTGGGATTT TGTCGCTATC CAACCATTGG GTATCGGTAC TTTAAATGAG
CCGGTCAAAA AGATGCTGGA ACGCGGCATC CCCGTAATCG ATATGGATAC CATGATCGCT
CAACCGGGTG AATTGCCTAT CACCTGCTTT ATCGCTCCGG ATAATGTGTG GGGAGCTGAA
CAGGTAACCG AAGCCCTGAT GCAGGCTATT GGTGGGAAAG GTAATGTGGT AATGACTCAA
GGGTCTTTAG GGCATACAGG AGCACAAGGC CGCGCCCAAG GCTTCCATAA TGTCATAAAA
CGTTATCCTG ATGTTAAAGT GATAGATGAA ACGCCAGCCG ACTTCGACGT CAATAAAGTT
GCCCAGATTT GGGAAAATTT GCTTAATCGT TACGATAAAA TTGATGCCGC GTATTTTCAT
AACGACGATA TGGCCCTGGC AGCCTACCAG GTTATTAAAA ATGCCGGCAG GGAAAAAGAA
ATTAAAATTG GTGGTAATGA TGGTATGCAA CCGGCAGTAG AAGCCGTCCA AAAGGGTATT
ATGGTTGCTA CGGCCCGCAA TTCGGCACCA CGTATTCACT GGGGTGCATT GATGATTGGG
TATTATGCTG CTACTGAAAA AGATGCGAAT AAAAAAATTC CACCCTTTAT TCTGGCTGAT
GGTCCAATTA TCACCTACAA TGTAGACCAG AGTAACAAGC AACCTTGGCT CAACAAGGGC
TACGGCCAGT CTCTCGCTCC TGGCCTGATC TGGCAAGAAG ATCACTTTAT GGTCTAA
 
Protein sequence
MSIYEKGMNR REFIVKSLMA SGLVAGSSLL LSGCSSGSTG TTAKEGKRLK AAFSNAGLQA 
TWCAQGKDTV ERWGKWLGVD ITWYDGALSV DKQRAAVEDM ATKDWDFVAI QPLGIGTLNE
PVKKMLERGI PVIDMDTMIA QPGELPITCF IAPDNVWGAE QVTEALMQAI GGKGNVVMTQ
GSLGHTGAQG RAQGFHNVIK RYPDVKVIDE TPADFDVNKV AQIWENLLNR YDKIDAAYFH
NDDMALAAYQ VIKNAGREKE IKIGGNDGMQ PAVEAVQKGI MVATARNSAP RIHWGALMIG
YYAATEKDAN KKIPPFILAD GPIITYNVDQ SNKQPWLNKG YGQSLAPGLI WQEDHFMV