Gene Lferr_2844 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagLferr_2844 
SymboltrpD 
ID6878848 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAcidithiobacillus ferrooxidans ATCC 53993 
KingdomBacteria 
Replicon accessionNC_011206 
Strand
Start bp2825241 
End bp2826257 
Gene Length1017 bp 
Protein Length338 aa 
Translation table11 
GC content64% 
IMG OID642790698 
Productanthranilate phosphoribosyltransferase 
Protein accessionYP_002221236 
Protein GI198284915 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0547] Anthranilate phosphoribosyltransferase 
TIGRFAM ID[TIGR01245] anthranilate phosphoribosyltransferase 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.118515 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones40 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCGTCC GGGACATACT GGAGCAAATC GTGGCGGGGC AGGACTTGTC GCGCAAAGAG 
ACGGAAACGG TCTTCGCCGC TATCATGGCC GGAGCATGGA CTCCCGCGCA GATCGGTGCT
CTGCTCATGG GACTCCGCAT GAAGGGGCAG CGGGTCGAGG AACTGGTGGG CGCCACCCAG
GCTTTGCGCG CCTGTATGAC GCGGGTAGAA GTCTCTACCG ATCACCTGCT GGATACCTGC
GGTACCGGCG GTGACGCACT GAGCACCTTC AACATATCGA CGGTGTCTGC GGTGGTGGCG
GCGGCCGGCG GGGCGCGGGT GGCCAAACAC GGCAACCGTT CCATGGTCAG CCGCAGCGGC
AGCGCTGACG TGCTGGAAGC CGCGGGTCTG CGCATGGACA TGAGCCCCGC AGAAGTCGCC
GACAGTATCG AGCGCATCGG TATCGGTTTT CTATTCGCGC CGGCGCACCA TGGCGCCATG
CGTTATGCCG TTGGTCCGCG CAAGGAGCTC GCCATCCGTT CGTTGTTCAA CCTCATGGGA
CCACTGAGCA ACCCGGCGGG GGCGCCGCAT CAGGTACTCG GCGTTTATGC CGAGCGCTGG
CTGATTCCCA TGGCCGAAGC CGCCCGGGAA CTGGGATCAC GCCATGTGCT GGTGGTACAT
GGGCACGATG GCCTGGATGA GATCAGCCTG TCCGGGCCAT CAGACATAGC GGAGTTAAAG
GACGGGATGA TCAGTCGCAG CCGGATTCAG CCGGAGGACT TCGGGCTGTC ATCAGCACCG
CTGGCGACCC TGCAAATCGA CAGCGTGGCG GCCGCTCTGG CGGCGGCGGA AGAAGTATTG
CAGAATCGCC CCGGCCCGCG TCGTGACGTA GTTCTGCTCA ATGCCGGGGC CGCCCTCTAT
GCGGCGGACG TGGTCCCCGA TATGGCGGTG GGCGTGGTGG TCGCCCGGGC TGTGCTCAAA
TCCGGCGCCG CCTGGGATAA GTGGCAGGCT TTGTTGGGCA GGACTTCACA GGGATAA
 
Protein sequence
MIVRDILEQI VAGQDLSRKE TETVFAAIMA GAWTPAQIGA LLMGLRMKGQ RVEELVGATQ 
ALRACMTRVE VSTDHLLDTC GTGGDALSTF NISTVSAVVA AAGGARVAKH GNRSMVSRSG
SADVLEAAGL RMDMSPAEVA DSIERIGIGF LFAPAHHGAM RYAVGPRKEL AIRSLFNLMG
PLSNPAGAPH QVLGVYAERW LIPMAEAARE LGSRHVLVVH GHDGLDEISL SGPSDIAELK
DGMISRSRIQ PEDFGLSSAP LATLQIDSVA AALAAAEEVL QNRPGPRRDV VLLNAGAALY
AADVVPDMAV GVVVARAVLK SGAAWDKWQA LLGRTSQG