Gene Moth_0199 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_0199 
Symbol 
ID3832272 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp195342 
End bp196490 
Gene Length1149 bp 
Protein Length382 aa 
Translation table11 
GC content66% 
IMG OID637828135 
Productaminotransferase, class V 
Protein accessionYP_429077 
Protein GI83589068 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1104] Cysteine sulfinate desulfinase/cysteine desulfurase and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones48 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGGCCATTA TTTACCTCGA TAACAGCGCC ACCACAGCGG CTTTACCCGA AGTAGCAATC 
GCCGTGAAGG AAATGCTGAC GGAAAACTAC GGCAACCCCT CTTCCCTCCA CGGCCTGGGG
ATAAAGGCGG AAAAGGCCCT GGGCGAAGCC CGGCGCCAGG TGGCCGGCCT CATCGGCGCC
CGGCCCACGG AGATCTACTT CACCTCCGGC GGCACCGAGG CCAACAACTG GGCCCTGCTG
GGGATAGCCC GGGCACGGCG GCGCCAGGGC AGGCACCTGA TCACCACGGC CATCGAACAC
CCCTCCATCC TGGCCACCTG TCGGCGGCTG GAGGCCGACG GCTTTGAAGT AACCTACCTG
CCGGCAGACG CCCGGGGGGT CATCCGCCTG GCCGACCTGG AAGCGGCCCT GCGGGAGGAC
ACCATCCTGG TGAGCGTCAT GAGCGTTAAC AACGAGGTGG GTTCCCGGCA ACCTGTAGCT
GACATCGCCC GCCTGGTCCA CAGCCGCAGC CGGGCCGTCC TGCACGTCGA TCACATCCAG
GGCTACGGCA AGATACCCTT GAACTGCCAT GAAGCCGGCA TCGACCTGAT GTCCTTAAGC
GGCCATAAAA TTCACGGGCC CAAGGGCGTG GGCGCCCTGT ACATAAAGGA AGGTTTGCGG
CTGGAGCCCC TGCTGACCGG CGGCGGTCAG GAGGCCGGCC AGCGCTCTGG TACCGAGAAC
ACTGCCGGCA TCGCCGGTTT CGGCGTCGCC GCCCAACTGG CCGCAGCCGA CTTTGCCCGG
CGGACTGCCA GGATGCAGGA GATAAAGCTC GAACTCGCCC GGCGGCTGGT GGCCGAGATC
CCCGGCGCCG TCATCAACGG CCCGCCCCCC GAGGAAGGCG CCCCTAACAT CATAAACGTC
TCCTTCCCGG GGGTGCGGGC CGAGGTCCTG GTCCACATGC TGGAGCAGCG GGGCATCTAC
GTCTCCACCG GCTCGGCCTG CCACTCCCGC AGGGAGAGCG CCAGCCACGT CCTCCAGGCC
CTGCACCTGG AACGCTGGCG CCAGGACGGC GCCATCCGCA TCAGCCTGGG GGCCTTGAAC
CGGCTGGAAG AGGTGGAACC TACCGTGGAG GCCTTTAAGG AATGCGTGCA AGAATTGTGG
TCGTTATAG
 
Protein sequence
MAIIYLDNSA TTAALPEVAI AVKEMLTENY GNPSSLHGLG IKAEKALGEA RRQVAGLIGA 
RPTEIYFTSG GTEANNWALL GIARARRRQG RHLITTAIEH PSILATCRRL EADGFEVTYL
PADARGVIRL ADLEAALRED TILVSVMSVN NEVGSRQPVA DIARLVHSRS RAVLHVDHIQ
GYGKIPLNCH EAGIDLMSLS GHKIHGPKGV GALYIKEGLR LEPLLTGGGQ EAGQRSGTEN
TAGIAGFGVA AQLAAADFAR RTARMQEIKL ELARRLVAEI PGAVINGPPP EEGAPNIINV
SFPGVRAEVL VHMLEQRGIY VSTGSACHSR RESASHVLQA LHLERWRQDG AIRISLGALN
RLEEVEPTVE AFKECVQELW SL