Gene Moth_1342 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_1342 
Symbol 
ID3831899 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp1387653 
End bp1389152 
Gene Length1500 bp 
Protein Length499 aa 
Translation table11 
GC content60% 
IMG OID637829278 
Productanthranilate synthase, component I 
Protein accessionYP_430198 
Protein GI83590189 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0147] Anthranilate/para-aminobenzoate synthases component I 
TIGRFAM ID[TIGR00564] anthranilate synthase component I, non-proteobacterial lineages 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.00981315 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTTTCACC CTGACCGGGC AACCTACCTG GACCTGGCCC GGAAGGCCCC TTATATTCCT 
GTCTGGACAG AGATTCTTGC CGATACAGAA ACTCCCATTT CCCTGTACTT AAAATTCGCC
GGCGGTCCGG ACAGCTTCTT ACTGGAGAGT GTCGAAGGCG GGGAAAACCT GGGCCGCTAT
TCCCTCATTG GCTTCGATCC CCTGCTGACC TTCACCGCCA GGAGTGGTAA GGCTTATTTA
AGCACGAATA ACGCCACTGC CAGGTTGCTG GAAGAAAAAC CCTTCAAGGC TTTAAACAAT
TTAATGGCCA GCCTGTCCCT ACCACCTAGT GAGGGACCAC GCTTTCAGGG CGGCCTGGTA
GGCTACCTGG GATATGATAT GGTCCGGGAA CTGGAGGCTT TGCCCCCGGG GCCCGGTAAT
GACCTGCAAA TACCAGATAC CCACCTGACC CTCCACCGTT GTTACCTGGT TTACGACCAT
ATCCTGCGGA CAGTGAGGAT AACCTGCCTG GGGCGAGGCG GGGAGAATGC CCTTGCCGGG
TATGAAGAAG CTGTAGCCGG GGTCAAGGGA ATTCTGGAAA AACTCGGGCG ATCTTCTGGC
GGGTACCGGA ATGGGTATCC GCCGGCAGCA GGGCAACTGG TCCCGGAAGG GGTCTCCTGG
CAGGCCAGCG TGACCCGACA GGAATTTACC GGGATGGTTA CAAAAGCTAA GGAGTATATC
GCCGCCGGCG ATATCTTCCA GGTGGTCCTC TCCCAGAGAT TGAGCCTGCC CTTCAGGGAG
GACGCCCTGG TCGTTTACCG GCACTTGCGA GCCCTTAACC CTTCGCCGTA TATGTTTTAC
CTTAACTTCC CGGAGGTGCA ACTGGTAGGC GCATCGCCGG AAATGCTGGT GCGGGTGGAA
AGGGGAACAA TCGATTATCG CCCCATCGCC GGTACCAGGC GCCGGGGACG GACTGCTGCC
GAGGACAGGG CCCTGGCCGC GGAACTCCTG GCCAGCGAGA AGGAGCGTGC CGAACACCTG
ATGCTCCTGG ACCTGGGGCG GAACGACGTG GGAAGGATAG CCGTCCCGGG CAGCCTGCAG
GTGACCCGGC AGATGGTGGT GGAGTATTAT TCCCACGTCA TGCACCTGGT TTCCAGCATT
ACCGCCCGCC TGGCTCCTGG CAGGAGTGCC CTGGACGCCC TTCTGGCCTG TTTCCCCGCC
GGTACGGTGA CCGGGGCTCC CAAGGTACGG GCCATGGAGA TCATCACCGA ACTGGAGCCG
GTCAACCGGG GACCCTATGC CGGCGCCGTA GGTTACCTGG GTCTCCATGG CAACCTGGAT
ACCTGCATTG CCATCCGCAC CATAGTTTTT GCCCGGGGTC GGGCCTTCAT CCAGGCCGGG
GCGGGCATTG TCGCCGACTC CGACCCGGAA GCCGAATATG AAGAGACCCT GAACAAAGCC
CGGGCGCTGT TGCAGGTATT AAAAAAGCCG GAGGTGGGCA CCCGTGCTGC TAATGATTGA
 
Protein sequence
MFHPDRATYL DLARKAPYIP VWTEILADTE TPISLYLKFA GGPDSFLLES VEGGENLGRY 
SLIGFDPLLT FTARSGKAYL STNNATARLL EEKPFKALNN LMASLSLPPS EGPRFQGGLV
GYLGYDMVRE LEALPPGPGN DLQIPDTHLT LHRCYLVYDH ILRTVRITCL GRGGENALAG
YEEAVAGVKG ILEKLGRSSG GYRNGYPPAA GQLVPEGVSW QASVTRQEFT GMVTKAKEYI
AAGDIFQVVL SQRLSLPFRE DALVVYRHLR ALNPSPYMFY LNFPEVQLVG ASPEMLVRVE
RGTIDYRPIA GTRRRGRTAA EDRALAAELL ASEKERAEHL MLLDLGRNDV GRIAVPGSLQ
VTRQMVVEYY SHVMHLVSSI TARLAPGRSA LDALLACFPA GTVTGAPKVR AMEIITELEP
VNRGPYAGAV GYLGLHGNLD TCIAIRTIVF ARGRAFIQAG AGIVADSDPE AEYEETLNKA
RALLQVLKKP EVGTRAAND