Gene Lferr_1687 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagLferr_1687 
Symbol 
ID6877668 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAcidithiobacillus ferrooxidans ATCC 53993 
KingdomBacteria 
Replicon accessionNC_011206 
Strand
Start bp1649824 
End bp1651005 
Gene Length1182 bp 
Protein Length393 aa 
Translation table11 
GC content62% 
IMG OID642789555 
Productaminotransferase class I and II 
Protein accessionYP_002220116 
Protein GI198283795 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0436] Aspartate/tyrosine/aromatic aminotransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGATATCC GTCTTTCCCG CCGCGTCAAT GCGGTGCGCC CATCCCCCAC CCTTGCGGTC 
ACCGCCCGTG CCCAGCAACT GCGCCGCGAA GGAAAGGATA TCGTCAGCCT TGGTGCCGGC
GAGCCGGATT TCGACACCCC GGAGTACATC AAGGAGGCAG CCATTGCCGC CATTCGCCAG
GGCTTCACCA AATATACCGC CGTCGGCGGC ACACCGGAAC TGAAGGCCGC CATCATCGGC
AAATTCGCGC ACGACAACCA TCTGTCATAC CGCCCCGATG AAATTCTCGT TTCCGTCGGC
GGCAAGCAAA GCTTCTTCAA TCTTTGCCAG GCCCTTCTGG ATGCCGGCGA TGAGGTCATC
ATTCCCGCGC CCTACTGGGT ATCCTATCCG GACATCGTGC TTCTGGCCGA AGCGCGGCCC
GTCATCATCG ATACCGGCGC CAACCAGCGT TTCAAGATCA GTCCGGAGCA GCTGGAGGAA
GCGATCACGC CCAACACCCG CCTGCTGGTC ATCAACAGCC CCTCCAATCC CTCCGGCATG
ACCTACAGCC GCCCGGAATT GGAAGCCCTG GGTGAGGTCC TCCGCCGTTA TCCCCATATC
CTCATCGCCA GCGATGACAT GTACGAAAAA ATCCGCTTCC ACGATGAAGA GTTCGTCAAC
ATCGCCAACG CCTGCCCGGA TCTGGCTCCA CGCTGCATCG TCATGAATGG CGTGTCCAAG
GCCTATGCCA TGACCGGGTG GCGCATCGGC TACTGCGCCG GCCCCAAGAC GCTGATCACC
GCAATGAATA CCGTACAGTC CCAGAGCACC TCCAATCCCA CCTCCATCGC TCAGGTGGCC
GCCCAGGCGG CACTGGAAGG CGGCGACAGC GCCATCCACG AAATGGTGCT GGCTTTCAAG
CGGCGCCACA CGTATGTCTA CAACCGCCTG AAAGTGCTGC CCGGCGTTGC TGCCATGCCC
TCCGATGGTA CCTTTTACAG CTTTCCGGGA TTTCGCGAAG TCATGGCGGC GAAAGGCCTG
CGGGATGATC TTGCCCTGGC CGAGGCCTTG CTGGGAGCCG GAGTGGCCGT CGTACCGGGC
TCGGCCTTCG GCACTCCTGG CCACATCCGC CTGTCCTTCG CGACCAGCGA CAAGAACCTG
GAGATGGCCC TGGACCGCAT CAGCGCTTTC GTCAACGCCT GA
 
Protein sequence
MDIRLSRRVN AVRPSPTLAV TARAQQLRRE GKDIVSLGAG EPDFDTPEYI KEAAIAAIRQ 
GFTKYTAVGG TPELKAAIIG KFAHDNHLSY RPDEILVSVG GKQSFFNLCQ ALLDAGDEVI
IPAPYWVSYP DIVLLAEARP VIIDTGANQR FKISPEQLEE AITPNTRLLV INSPSNPSGM
TYSRPELEAL GEVLRRYPHI LIASDDMYEK IRFHDEEFVN IANACPDLAP RCIVMNGVSK
AYAMTGWRIG YCAGPKTLIT AMNTVQSQST SNPTSIAQVA AQAALEGGDS AIHEMVLAFK
RRHTYVYNRL KVLPGVAAMP SDGTFYSFPG FREVMAAKGL RDDLALAEAL LGAGVAVVPG
SAFGTPGHIR LSFATSDKNL EMALDRISAF VNA