Gene Moth_1340 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_1340 
Symbol 
ID3831897 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp1386005 
End bp1387024 
Gene Length1020 bp 
Protein Length339 aa 
Translation table11 
GC content63% 
IMG OID637829276 
Productanthranilate phosphoribosyltransferase 
Protein accessionYP_430196 
Protein GI83590187 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0547] Anthranilate phosphoribosyltransferase 
TIGRFAM ID[TIGR01245] anthranilate phosphoribosyltransferase 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value0.728969 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.834843 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTTGAAAG CCCAGATTAG CAAAGTAGTA GCCGGCCAGC ACTTGAGCGA AGCCGAGGCC 
GCAGAAGCCA TGGACATAAT CATGGCGGGC GAAGCCACCC CGGCCCAGAT TGCCGCTTTC
CTCACCGCCC TGCGCCTTAA AGGGGAAATG GTAGACGAGA TAACCGGTTT TGCCCGCAGC
ATGCGCCGCC GGGCTATTCC CCTTACCACT AGCCACCCGG TATTTGTGGA CACCTGTGGT
ACCGGGGGAG ACGGCCGCCA AACCTTTAAC ATTTCCACCA CCGCGGCCTT CGTCGTGGCC
GGCGCCGGGG TAGCAGTAGC CAAACACGGC AACCGTTCAG TTTCCAGCCG TTGCGGTAGC
GCCGACATGC TGGAAGCCCT GGGGATCAAA GTCGACCTGC CTCCGGACGC CGTTGCCCGC
TGCCTGGATG AAGTGGGCAT GGCCTTCCTC TTTGCTCCCG TTTTTCATGG TGCAATGAAA
TACGCCGCCG GACCGCGGCG GGAGATCGGC ATTCGTACAG CCTTCAACCT CCTGGGGCCC
CTGACTAACC CGGCGGGCGC TCCCTGCCAG CTGGTGGGAG TTTACGACCC GGATTTAACG
GAAACAGTCG CGGCCGTCCT GGGGCGCCTG GGCAGCCGCC GGGCCTATGT AGTCCACGGT
AGCGATGGAC TGGACGAAGT AACTATCACC GGACCCAGCA AGATAACCTG CCTCGATAAA
GGCGCGATCA GGACGTATAC CTTTACCCCG GAAGATGTCG GCCTGCCACG GGCGAACCTT
GCCGACCTGG CCGGGGGTAC AGCCACCGAC AATGCCGCCA TTGCCCGCGC CGTCCTTTCC
GGTACCAGGG GCCCGGCCCG GGACGTCGTC CTCATTAATG CCGCCTTCGC CCTCCTGGCA
GCCGGTGCCG CTGACACCCT CCAGCAAGCC CTGGCCCTGG CCGAATCAAG TATCGATTCC
GGCGCCGCGG CGGCAAAACT CCAGGCTATG GTGGCCTGGG TGGAAAGCTG GGCTGCCTGA
 
Protein sequence
MLKAQISKVV AGQHLSEAEA AEAMDIIMAG EATPAQIAAF LTALRLKGEM VDEITGFARS 
MRRRAIPLTT SHPVFVDTCG TGGDGRQTFN ISTTAAFVVA GAGVAVAKHG NRSVSSRCGS
ADMLEALGIK VDLPPDAVAR CLDEVGMAFL FAPVFHGAMK YAAGPRREIG IRTAFNLLGP
LTNPAGAPCQ LVGVYDPDLT ETVAAVLGRL GSRRAYVVHG SDGLDEVTIT GPSKITCLDK
GAIRTYTFTP EDVGLPRANL ADLAGGTATD NAAIARAVLS GTRGPARDVV LINAAFALLA
AGAADTLQQA LALAESSIDS GAAAAKLQAM VAWVESWAA