Gene Lferr_0203 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagLferr_0203 
Symbol 
ID6876153 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAcidithiobacillus ferrooxidans ATCC 53993 
KingdomBacteria 
Replicon accessionNC_011206 
Strand
Start bp193249 
End bp194439 
Gene Length1191 bp 
Protein Length396 aa 
Translation table11 
GC content59% 
IMG OID642788080 
Producthypothetical protein 
Protein accessionYP_002218669 
Protein GI198282348 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTACGT TCGCCACCAC AAAACATCCC CATACAAAGA GCAAACAGTT TTTAACAACC 
CTTGCTGTCG CGATTTTGTC CACACTGGCC CTGGTATCAG ACGCCGCTCA AGCCGCGCCC
CTGCCGATGC CAGCGATGAC CGGACCGCTG CAGCCACCGT CTCCTTTTCA GTTCAATGCC
GGGCCGCTCG GAAAACTGGA CGTCACCGGC GTGATGAGCG GCATGGGCGT CTGGCAGGAC
AACCGGGTTC CCGGCGACCG GTTCACCCGC GCCGATATCA GCAATGGCCA GATCTTCATC
CAGAAAACGC ACGGGCTGAT CCAGTTCTTT CTCCAGGCCG GCGCCTACAA TATGCCCGCC
CTGGGCACAC CCTTCCTCTC TACGGGGGCC ACCACCAGCG ATTATTATGG CGCATTGCCG
CAGGCCTATC TGAAAATCGC GCCCACCAAA GACTTCTCGG TGCTGATCGG CAAGCTACCC
ACCCTGATCG GCGCGGAATA TACCTTCACC TTCGAAAACA TGAACATTGA GCGCGGCTTG
CTGTGGAACC AGGAAAACGC CGTCACGCGC GGCGTCCAAG TCAACTACAG TGCGGGGCCA
TTGAGCGCCT CCCTGTCGTG GAACGATGGG TTCTACTCCA ACCGCTTCAA CTGGCTCTCC
GGCGACCTGT CCTACACCAT CAACTCCGCC AACACCGTGA GCTTCGTGGG CATGGGCAAT
GCGGGGCAGA CGGGTTATTC CAGTCTCGCT ACCCCGGTCT ATCAAAACAA CAGCGACATC
TACAACCTGA TCTATACCTA TAGCTCGGGT CCATGGATGA TTCAGCCCTA CCTCCAGTAC
ACCCATGTGT CGGCCAATCC CGCTATCGGT GTGGGACGCG GCACGGCGAC CAAAGGCGCG
GCCATCCTGG CGAGTTATAG TCTCACCCCC CACATTACCT TGGCGGCCCG CGCCGAGTAT
ATCGCAAGCA CCGGCAATGC CACGGATGGC GCGGTCAACC TGATGTATGG ACCCGGTAGC
AAGGCATGGT CGATCACCGT GACGCCCACC TACCAGGATC ACGATTTCTT CGCCCGCGCC
GATTTTTCCT ATGTACAGGC GAGCAGCTAC ACCCAGGGCG ATGTCTTTGG CCCACAGGGA
ACCAATCCTA CACAGGCCAG AGCCCTCGTC GAAACCGGAT TCCTATTTTA A
 
Protein sequence
MSTFATTKHP HTKSKQFLTT LAVAILSTLA LVSDAAQAAP LPMPAMTGPL QPPSPFQFNA 
GPLGKLDVTG VMSGMGVWQD NRVPGDRFTR ADISNGQIFI QKTHGLIQFF LQAGAYNMPA
LGTPFLSTGA TTSDYYGALP QAYLKIAPTK DFSVLIGKLP TLIGAEYTFT FENMNIERGL
LWNQENAVTR GVQVNYSAGP LSASLSWNDG FYSNRFNWLS GDLSYTINSA NTVSFVGMGN
AGQTGYSSLA TPVYQNNSDI YNLIYTYSSG PWMIQPYLQY THVSANPAIG VGRGTATKGA
AILASYSLTP HITLAARAEY IASTGNATDG AVNLMYGPGS KAWSITVTPT YQDHDFFARA
DFSYVQASSY TQGDVFGPQG TNPTQARALV ETGFLF