Gene Moth_2311 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_2311 
Symbol 
ID3831425 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp2431273 
End bp2432184 
Gene Length912 bp 
Protein Length303 aa 
Translation table11 
GC content57% 
IMG OID637830235 
Producthypothetical protein 
Protein accessionYP_431141 
Protein GI83591132 
COG category[E] Amino acid transport and metabolism
[G] Carbohydrate transport and metabolism
[R] General function prediction only 
COG ID[COG0697] Permeases of the drug/metabolite transporter (DMT) superfamily 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000000443541 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGCAGCCCA AACACATTTA CGCCCTGATC CTGGTAGCCT TGATCTGGGG GCTGACCTTC 
CCGGCCATGA AGATCGGTAG CTTCTACCTG CCGCCCTTAT CCTTTGCCGC CTGGCGGTTC
TTCCTGGGGG CCCTCTGCCT GCTTCCCCTG GCCGGCAGGC GCCAGGGGAA ACTATGGCAT
GCCGGCCGGG ATTTCTGGCC TTTATTTCTT CTCGGCCTCC TGCAGACGGC CATTATGGGC
GGCGCTCTGC ACCTGGGTAT TAGCATGGTA AAGAGCGGTA TAACCTCGGT GGTTCTTTAT
AGCTACCCCT TCTTCTTTAC CTTCCTGGCT TTTTTATTAC TCCGGGAACC CCTGACAGGG
AAGCAAATGG CAGGCTTAAT TATCGGTTTC GCAGGTTTAA TCCTGGTCGT AGATCCCTGG
AAGATGCATC CGACCCATGC TGAATTTATC GGGATCCTGG TCCTCCTGGG GGGGTCCATC
GGCTGGGGCT TGGCCAGCGT TTACCTGAAG GCCGCCTTTA AAACCAGGGA TAAGCTGGAG
GTTACCACCT ACCAGATGTT CTACGGGTCC CTGGTGTTGA TGCTGGTAGC GGCCTTCGCC
GACCACGGCC TGCGCTTTTC CTGGACCGCC CCCGGTCTGG GCATCATGTT ATATACTGCC
CTTCTAACCT CGGCCCTGGG TTTCGTTATC CTCCTGACCA TCCAGGCCCG CTATCCGGCC
AGCCAGACCA GCGTTTATCT CTTCCTGGTC CCAGTCTTCG GCGTCCTCTT TAGTTCACTC
CTACTGGGAG AGAAATTAAC CCTCAACCTT TTGCTGGGCC TGGCTCTGGT AGCAGCTGGT
ATTATCACCG TCAACCTGGG GGCTCCGGTT CAGGCGCGGG AGAAGACCGA TTGTGTCGCA
TCCGGCAAAT AG
 
Protein sequence
MQPKHIYALI LVALIWGLTF PAMKIGSFYL PPLSFAAWRF FLGALCLLPL AGRRQGKLWH 
AGRDFWPLFL LGLLQTAIMG GALHLGISMV KSGITSVVLY SYPFFFTFLA FLLLREPLTG
KQMAGLIIGF AGLILVVDPW KMHPTHAEFI GILVLLGGSI GWGLASVYLK AAFKTRDKLE
VTTYQMFYGS LVLMLVAAFA DHGLRFSWTA PGLGIMLYTA LLTSALGFVI LLTIQARYPA
SQTSVYLFLV PVFGVLFSSL LLGEKLTLNL LLGLALVAAG IITVNLGAPV QAREKTDCVA
SGK