Gene Moth_2344 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_2344 
Symbol 
ID3832062 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp2464683 
End bp2465741 
Gene Length1059 bp 
Protein Length352 aa 
Translation table11 
GC content44% 
IMG OID637830267 
Producturoporphyrinogen decarboxylase 
Protein accessionYP_431173 
Protein GI83591164 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0407] Uroporphyrinogen-III decarboxylase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.198374 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTAAATT ACCATGAGAT TTTAGCAACA GCAATGTTCA GACCCATAAA TCCCCATGTT 
CCCGTTGTAT TATGGGCTAT AGGGCAGACT TATGCCCCCT TTGCTAAAAT ACCGGATAAT
GAGTACTATG CCGATCCGGC GAAAATGCTG GAAGCCCAGG TGAAGTTTTA CGAACGTTTC
CCTGATGTAT TAACCATTCC CGGAATTTGG CCGGATCTTG GTCTGATGGC GGAACTGGGG
GCTTTAGGAG CAGAGTTAGA GTTTCCGGAT GATGCCCCTC CTCAGTCCAG GGGTGGGGCC
TTTGAAGATA TCAGGGAAGT AGAAAATTGG GAAGTCCCCG ATCCTAAAAA GGCGGATTAT
ACCTCTCAGA CGCTAGATTA CCTTAAGTAT TTCTGTAAGC ATTTACCGGA AGAACCCAGG
AAGAAGTGGG GATTCTTAGA TGGGCACATT TTCTGCGGTG GACCTGGTGA GATCTCAGGG
TTATTACTCG GGTATGATAA ATTTTCTTAT GCAATGTATG ATTATCCGCA GCTCGTGCAT
ATTCTGCTGC GTAAGGTAAC TGATTTTATT AAAAGCTATA TTGATGCCCA AATAGAAATA
GTTGGGGAAC CTAAAAGGGT AATCATATGG GATCATATTC CCGGTATGTT GTCAAGAGAG
CTTTTTGATG AATTTATCCA TCCTTATATG AAAGAGGTGT TTACCCATGT AGAAAAGGCG
ACCCTGAGGA TATACCATAA CGAGAATAAT TACCCCCATC TCCTTGATTT AATGCGGGAT
ATCCCGGCCA ACGTCTGCCA TATCGGCCCC AAACACGATC TGGTCGAGAG CAAAAGGGTT
TTAAAAAAAT GTGTAATGGG AAACGTTCAT CCTATCCAGG AATTATTGCT GGGTACAAAT
GAGGAAATTG AAGCAAAGTG TAAAACCATA ATTGAAACTG CAGGAAGAGG TGGCGGTTTA
TGGCTTTCAA CCGGAGGCGG TATGGCCCCG GAAACACCGG TGGAGAAAAT GCAGGTCCTC
ATTGACTGTA CTAAAAAATA TCTGCCACCT TCGCTGTAA
 
Protein sequence
MLNYHEILAT AMFRPINPHV PVVLWAIGQT YAPFAKIPDN EYYADPAKML EAQVKFYERF 
PDVLTIPGIW PDLGLMAELG ALGAELEFPD DAPPQSRGGA FEDIREVENW EVPDPKKADY
TSQTLDYLKY FCKHLPEEPR KKWGFLDGHI FCGGPGEISG LLLGYDKFSY AMYDYPQLVH
ILLRKVTDFI KSYIDAQIEI VGEPKRVIIW DHIPGMLSRE LFDEFIHPYM KEVFTHVEKA
TLRIYHNENN YPHLLDLMRD IPANVCHIGP KHDLVESKRV LKKCVMGNVH PIQELLLGTN
EEIEAKCKTI IETAGRGGGL WLSTGGGMAP ETPVEKMQVL IDCTKKYLPP SL