Gene GWCH70_2143 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGWCH70_2143 
Symbol 
ID7976953 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacillus sp. WCH70 
KingdomBacteria 
Replicon accessionNC_012793 
Strand
Start bp2209318 
End bp2210844 
Gene Length1527 bp 
Protein Length508 aa 
Translation table11 
GC content44% 
IMG OID644798959 
Productanthranilate synthase component I 
Protein accessionYP_002950119 
Protein GI239827495 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0147] Anthranilate/para-aminobenzoate synthases component I 
TIGRFAM ID[TIGR00564] anthranilate synthase component I, non-proteobacterial lineages 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.105763 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAACT GCACAATCAC CACTTTTTTG GAAGATGCGG TTCATTTCCA GACCATTCCG 
ATTGTACGCC GCTTTTTTGC AGATGTGTTT GAACCAGTGA AAATATTTGA AAATTTAAAA
AGTGAAGCGG TGTTTTTGCT AGAAAGCAAA GATGATCAAT CGCCATGGGC CCGCTATTCC
TTTATCGGTT TATCGCCGTT TTTAACGATT GAGAGTGAAA CCGGCCATAC GTTTTCTGTG
ATGGATGAGC GCGGCGAAGA AATGATGAAA GCTTCTTCGT TAAAAGAAGC ATTTCTGTTC
ATTGAGCAGA AGCTGCGCGT CAAGCAGCTT GCAAGCGAGA TACCATTTAC CGGCGGCGCG
GTCGGTTTTT TAGGATATGA TTTCATTTCC GCAATCGAAA AAGTCCCAAT TCATTCCAAT
CGCGATATTT CCTTGAAGAC GGGTTATTTT ACGTTTTGTG AATCGTTGAT CGTGTTTGAT
CATCATAAGC GTGAGATTGT GTTTATTCAT TATGTCCGCA TATCCGACAA GGACACAGAA
GAAGCGAAGA AGAGAAAGTA CGGGGAAGGG TTAAAGCGGA TCGAAACATT GATGAAAAAA
GCAGCAAGCG GAAGGGAAGA GCCATTGTTA TTGCTTCATC ACGATGACGA AGAAACGCGT
GTTTCCTTTC AAGGTGTCAT ATCCAATTAT GACAAAGCAT CATTTATGAG GGATGTAGAA
ACCATTAAAA GCTATATCGC CAACGGAGAT GTTTTTCAGG CGGTATTGTC CCAGCGATTT
ACCGTTCCGA TTCAAGTGAG CGGGTTTCAC ATTTATCGAA TGCTGCGTCA TATTAATCCG
TCGCCATATA TGTTTTATTT TCAGCTAGAC GGTATCGAAA TTGTCGGAAG CTCTCCGGAA
AAGTTAATTC AAGTGCATAA CCGTCATATG GAAATCCACC CGATCGCCGG GACTAGAAGA
AGAGGACGGT CGGCAGAAGA AGATGAGCAT CTTCAAAGGG AGCTTTACAA TGACCCGAAA
GAGAGAGCCG AACATTATAT GTTAGTAGAT TTAGCCCGCA ACGATATCGG AAAAGTAGCG
AAATATGGTA CAGTCGAGAC GCCAGTGTTA ATGGAAATTG GAAAGTTCTC CCATGTCATG
CATCTGATTT CGAAAGTAAC GGGTGTGTTA AAGGAAGGAA TTCACCCGAT TGACGCGCTG
TTAGCGGCCT TTCCAGCCGG AACGGTAAGC GGTGCGCCGA AAGTAAGGGC GATGCAAATT
TTACAGGAGC TAGAACCGAC GGCAAGAAAT TTATATGCTG GAACGATTGC CTACATCGGT
TTTGACGGCA ATATTGATTC ATGTATCGCG ATTCGCACTG CGATTGTAAA AGACGGTTAT
GCTTACGTGC AAGCAGGCGC GGGAATTGTC GCGGATTCCG TTCCAGAATT GGAGTGGAAA
GAAACGCGCA ATAAAGCGAG CGCCTTAATC AAAGCGATGG AACGTGCCGA ACGATTGTTT
GCGAAAGGAG AGAATATATA TGTTTAA
 
Protein sequence
MKNCTITTFL EDAVHFQTIP IVRRFFADVF EPVKIFENLK SEAVFLLESK DDQSPWARYS 
FIGLSPFLTI ESETGHTFSV MDERGEEMMK ASSLKEAFLF IEQKLRVKQL ASEIPFTGGA
VGFLGYDFIS AIEKVPIHSN RDISLKTGYF TFCESLIVFD HHKREIVFIH YVRISDKDTE
EAKKRKYGEG LKRIETLMKK AASGREEPLL LLHHDDEETR VSFQGVISNY DKASFMRDVE
TIKSYIANGD VFQAVLSQRF TVPIQVSGFH IYRMLRHINP SPYMFYFQLD GIEIVGSSPE
KLIQVHNRHM EIHPIAGTRR RGRSAEEDEH LQRELYNDPK ERAEHYMLVD LARNDIGKVA
KYGTVETPVL MEIGKFSHVM HLISKVTGVL KEGIHPIDAL LAAFPAGTVS GAPKVRAMQI
LQELEPTARN LYAGTIAYIG FDGNIDSCIA IRTAIVKDGY AYVQAGAGIV ADSVPELEWK
ETRNKASALI KAMERAERLF AKGENIYV