Gene Moth_1652 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_1652 
Symbol 
ID3830940 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp1686589 
End bp1687767 
Gene Length1179 bp 
Protein Length392 aa 
Translation table11 
GC content59% 
IMG OID637829577 
Productaminotransferase, class V 
Protein accessionYP_430497 
Protein GI83590488 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1104] Cysteine sulfinate desulfinase/cysteine desulfurase and related enzymes 
TIGRFAM ID[TIGR03402] cysteine desulfurase NifS 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.143138 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00767612 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCGCAGGG TTTACCTTGA TCATAGCGCC ACAACTCCGG TAAGGCCCGA AGTCCTGGAG 
GCCATGTTAC CCTTTTTGAA GGATGAGGCC TTTGGTAATC CTTCCACCGT TTACAGCTAC
GGCCGGGAAG CGAAAAAGGC CCTGGAGGAA GCCCGGGAAA AGGTGGCCAA CCTCATCGGC
GCCCGGCCGG AGGAGATCTT CTTTACCAGC GGCGGCACGG AAGCCGACAA CCTGGCCCTT
ATCGGTACGG CTGCGGCCAA TGAAAAGAAG GGCCGTCACA TTATTACCTC CAGCATCGAA
CACCATGCCG TCCTGCACAC GGCCCAGTAC CTCCTGCGCC ACGGCTTTAA GGTAACCTTC
CTGCCGGTGA CCCCGGAGGG CCTGGTGCGG GTGGAGGACG TCGAAAAGGC CATTACCGAT
GAAACCATCC TCATCAGCGT CATGCATGTT AACAACGAAG TGGGTACCAT CCAACCCATC
AAAGAAATAG GGAAACTGGC CCGGGAACGG GGGATCATCT TCCATACCGA CGCCGTCCAG
AGCGTTGGCA AGCTCCCCGT TAATGTCGAC GAGCTGGGGG TGGACCTGCT GTCGGCCTCC
GGGCACAAGA TTTATGGCCC CAAGGGCATC GGCTGCCTTT ATATCCGCAA GGGGACGAAG
ATCAACCCCA TCCTTTACGG CGGTGCCCAG GAGCGTAAAC GTCGGCCTGG GACGGAGAAC
ATGCCCGGTA TTGTCGGCTT TGGCCGGGCA GCCGAACTGG CCGGCCAGGA ACTGGAGAGC
GAAATGGAGC GCCTGCAGGC CCTGCGGGAC AAGCTAATTG ACGGTATCTT GACACGTATT
GAAGACGTCC AGCTGAACGG TGATCCGCGG CAGCGGGTGG CCACCAATGC CAACTTCAGC
TTCCGCTATT GTGAGGGCGA ATCCATACTC CTGAGCCTGG ACATGAAGGG TATCTGCGCT
TCCAGTGGTT CGGCCTGTAC CTCCGGTTCC CTGGACCCGT CCCACGTCCT CCTGGCCATG
GGTATCCCCC ACGAAGTAGC CCATGGTTCG GTACGTATGA CCCTGGGCCG CGAAAATACA
GAAGAAGATA TTGACTACGT CCTGGAAGTC ATGCCGGAGA TAATAGCCCG GTTGCGTTCC
ATGTCACCCC TCTATGAGGA GGCCGCAGGG AAGAGGTAG
 
Protein sequence
MRRVYLDHSA TTPVRPEVLE AMLPFLKDEA FGNPSTVYSY GREAKKALEE AREKVANLIG 
ARPEEIFFTS GGTEADNLAL IGTAAANEKK GRHIITSSIE HHAVLHTAQY LLRHGFKVTF
LPVTPEGLVR VEDVEKAITD ETILISVMHV NNEVGTIQPI KEIGKLARER GIIFHTDAVQ
SVGKLPVNVD ELGVDLLSAS GHKIYGPKGI GCLYIRKGTK INPILYGGAQ ERKRRPGTEN
MPGIVGFGRA AELAGQELES EMERLQALRD KLIDGILTRI EDVQLNGDPR QRVATNANFS
FRYCEGESIL LSLDMKGICA SSGSACTSGS LDPSHVLLAM GIPHEVAHGS VRMTLGRENT
EEDIDYVLEV MPEIIARLRS MSPLYEEAAG KR