Gene Moth_1637 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_1637 
Symbol 
ID3831266 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp1672942 
End bp1674000 
Gene Length1059 bp 
Protein Length352 aa 
Translation table11 
GC content57% 
IMG OID637829562 
Productaminodeoxychorismate lyase 
Protein accessionYP_430482 
Protein GI83590473 
COG category[R] General function prediction only 
COG ID[COG1559] Predicted periplasmic solute-binding protein 
TIGRFAM ID[TIGR00247] conserved hypothetical protein, YceG family 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.521882 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGGGAAG AACTCGAAAC AAGTCATAAT TACATCATAG CCTTTGACCG GCCCTGGCGG 
CACCGGGTCA CACTATTACT TGCTTTAACA GCCCTGCTGG TGGGCCTGGG CTGGTACTTT
ACGACCCTTC TGGCTCCCAG GCACCCCGGG GGAGCGGCAA TCGAGGTGTC TATTCCGCCC
GGAGCGAGTT CGGCCACCAT TGCCGCCACC CTGGCGGAGA AGGGCCTTAT CCGCAGCCCC
CTGGCCTTCC GGCTGGTGGC CATGGCCCAG GGAGTGGACA GGCAGTTAAA GTCGGGCAGT
TATTTAATTT CTCCCGGCCT GCCCCTGCCG GCTATAACCC GCCTGCTGGC CAGCGGCAAC
ACCCTGGAGA TCGAGTTTAC CGTTCCGGAA GGGTATACCG TCCGCCAGAT TGCGTCCCTG
TTGCAACAAA AGGGACTGGT TAAGGAAGAG GATTTCTTAA AAGCCGCCGC CGGGGATTAT
CCTTTTTCTT TTCTCCAGGG CCTGCCTTCG GGGCCGGAAC ATGTGCAGGG TTTTCTCTTC
CCCGATACCT ACCAGGTAGC GCCGGGTACG CCAGCCCGGG AAATTATCAT GATGATGCTG
AACCGCTTTA ACCAGGTCTA CCAGGAGATT GCCCCGCAAA AGGATAAAGA CGTGGAATTC
AATATCCGTC AGATTGTAAC CCTGGCTTCC ATAGTCGAAA GGGAAGCCAA ACTCGACAAT
GAACGCCCCC TTATCGCCGG GGTATTTATC AACCGCCTCC GGCGGGGTAT GAGGCTGGAA
TCCTGTGCCA CGGTGGAATA TCTCCTGCCG GCGCCCAAAC CGGTCCTTTC CTACCAGGAC
CTGCAAATTG ATTCTCCTTA CAACACCTAC CGGGTAAAGG GCCTGCCTCC AGGCCCCATT
GCCAATCCGG GGCGGGCCTC ACTCCTGGCA GTCCTAAAGC CGAATCAAAC GGATTATCTC
TATTTTGTTG CCAAACCGGA CGGCTCCCAC TACTTCAGCA GCACCCTGGC CGAGCATAAC
CAGGCTACAG CCCGATACGA GTCAGGTAAC GGAAATTAG
 
Protein sequence
MGEELETSHN YIIAFDRPWR HRVTLLLALT ALLVGLGWYF TTLLAPRHPG GAAIEVSIPP 
GASSATIAAT LAEKGLIRSP LAFRLVAMAQ GVDRQLKSGS YLISPGLPLP AITRLLASGN
TLEIEFTVPE GYTVRQIASL LQQKGLVKEE DFLKAAAGDY PFSFLQGLPS GPEHVQGFLF
PDTYQVAPGT PAREIIMMML NRFNQVYQEI APQKDKDVEF NIRQIVTLAS IVEREAKLDN
ERPLIAGVFI NRLRRGMRLE SCATVEYLLP APKPVLSYQD LQIDSPYNTY RVKGLPPGPI
ANPGRASLLA VLKPNQTDYL YFVAKPDGSH YFSSTLAEHN QATARYESGN GN