Gene Moth_2108 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_2108 
Symbol 
ID3832475 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp2201319 
End bp2202719 
Gene Length1401 bp 
Protein Length466 aa 
Translation table11 
GC content64% 
IMG OID637830033 
Productaminodeoxychorismate synthase, subunit I 
Protein accessionYP_430943 
Protein GI83590934 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0147] Anthranilate/para-aminobenzoate synthases component I 
TIGRFAM ID[TIGR00553] aminodeoxychorismate synthase, component I, bacterial clade
[TIGR01824] aminodeoxychorismate synthase, component I, clade 2 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGCTGCCCG TTGTTATCGA GGTCAAAGCG CCTGCCGACC CGGAGGCCCT CTATGCTGCC 
CTCGCTGGCG GCCGGGAAAA TAGCTTCCTA TTGGAAAGCG CCCTCTTTCA CCCCCGCCTG
GGTCGCTATT CTTTCCTTGG TAGCGCACCG TTCCTGGTCC TAAAAACCAG GGGCAACGAG
GCCCGGCTTT TCCGGCCCGG CGCAATGGTT GAAAGGATTC CCGGCAATCC CTGGGAGATC
ATCCGCTCCC TCCTGTCCCG TTACCGGTTA CCGCGGCACG ATCAACCTGT ACCTTTTACC
GGTGGGGCTG TAGGTTACCT GGCCTACGAC CTGGGCCGTT ACATCGAAAA GCTGCCGTCT
CTGGCCGCCG ATGACCTGCC CTTCCCGGAG GGTTACCTGG CCTTCTATGA CACCATCGTG
GCCATTGACC ACCAGGAAGG GAGGGTCTAT GTCGCTGCCA GCGGTTTCCC GGCATCCAGC
CCGGCCGCTG AAAAGGAAGC CCTGGCCCGG GCAAGGGAAA CAGCCGCCCT CCTGGCAAAG
GCTCGCCCCC TGCCGCCCCC TTCCCCCGCC AGCGGCGGAC GGGCGCCCAT CAGCTCCCTG
TTTACCCGGG AAGCCTACTG CCGGGCCGTC GACAGGGCCA GGGAGTATAT TGCCGCCGGG
GATATCTTTG AAGTCAACCT GTCCCAGCGC CTGCAGGCGC CCCCGGGCCT GGAGCCCTGG
GAGCTGTACC GCCGCCTGCG GCGGGTCAAC CCGGCTCCCT TTGCCGCTTA CCTGCCGCTC
AGGGAAGGGA CCATTGTTAG TGCCTCACCG GAAAGGTTTT TGCGGGTCAG CCGGGGCCAG
GTGGAGACCC GGCCGATTAA GGGTACCCGG CGCCGGGGGC ATACCCCGGC AGAGGATGAC
GCCCTGCGCC GGGAATTGTG GCAGAGCGCC AAGGACCGGG CGGAACTGGT GATGATCATC
GACCTGGAAA GGAACGACCT GGGCCGGGTC TGCCGGGCCG GGTCGGTACG GGTGCCGGAA
CTCTTTGTTC TGGAGGAGTA TGCCACCGTC TTTCACCTGG TCAGCACCGT AACCGGCACC
TTGGAACCGG GAAAGGATAT GGTGGATCTC TGGCGGGCCA CCTTTCCCGG CGGCTCCATC
ACCGGCGCCC CCAAGGTCCG CTCCATGGAG ATCATCGAAG AACTGGAGCC GGTGCGGCGT
AGCGTCTATA CCGGCGCCAT CGGCTACCTG GGTTTTGACG GCGAAGCCGA CTGGAATATC
GTTATCCGCA CCTTTCTCCT GGCAAACAAC CAGGCCTATT TCCAGGTCGG GGGCGCCGTC
ACAGCCGACT CCGATCCCGG GGGAGAGTAC GAGGAAACCC TGGACAAAGC CCGGGGCCTC
ATCCAGGCGC TGGAGTTATA A
 
Protein sequence
MLPVVIEVKA PADPEALYAA LAGGRENSFL LESALFHPRL GRYSFLGSAP FLVLKTRGNE 
ARLFRPGAMV ERIPGNPWEI IRSLLSRYRL PRHDQPVPFT GGAVGYLAYD LGRYIEKLPS
LAADDLPFPE GYLAFYDTIV AIDHQEGRVY VAASGFPASS PAAEKEALAR ARETAALLAK
ARPLPPPSPA SGGRAPISSL FTREAYCRAV DRAREYIAAG DIFEVNLSQR LQAPPGLEPW
ELYRRLRRVN PAPFAAYLPL REGTIVSASP ERFLRVSRGQ VETRPIKGTR RRGHTPAEDD
ALRRELWQSA KDRAELVMII DLERNDLGRV CRAGSVRVPE LFVLEEYATV FHLVSTVTGT
LEPGKDMVDL WRATFPGGSI TGAPKVRSME IIEELEPVRR SVYTGAIGYL GFDGEADWNI
VIRTFLLANN QAYFQVGGAV TADSDPGGEY EETLDKARGL IQALEL