Gene Lferr_0859 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagLferr_0859 
Symbol 
ID6876824 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAcidithiobacillus ferrooxidans ATCC 53993 
KingdomBacteria 
Replicon accessionNC_011206 
Strand
Start bp815676 
End bp816872 
Gene Length1197 bp 
Protein Length398 aa 
Translation table11 
GC content63% 
IMG OID642788741 
ProductNHL repeat containing protein 
Protein accessionYP_002219316 
Protein GI198282995 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTGAAT ATACCCATCC CCTGAGTGCC GCTATCCGGA ACCGGCCAGC CGCCATCCCT 
TCGGAGCAGT TGCTGGTTGG GGCGGGGGCA AGGGTCATCC TGGGGGAGCA GGTACGGCCC
AGGGGGGTCG TCGTACCTGT CGTTCCTTCC GCGCAGACGC TTTTCGGCCC GCGTGGTGCC
AGCCTCATGG CTGACGGATC CCTTTGGGTG GCTGATACCG GCCATCATCG CTTGCTGGGG
TGGCCCACGC TACCCGAAGC CGATGGCCAA CCTGCCACCT GGCTCATTGG TCAGCCAGAT
TTTGAACGGG ACGGGGGGCG CAACGCCCAC GGCCCTGTCG GCGCGGCATC ACTGAATGTT
CCCACCGGGA TCTGTCCGGT TGGCAATGGG ATGGCGGTGG CGGACGTCTG GAACCATCGG
GTGCTGATCT GGTATGAAGT ACCTCATGAA AGCCATGTTC CGGCGGATCT GGTTCTCGGC
CAAACCGATT TTGTGTCGGC GGAAATCAAT CGCGGTGCGC CCCAACCATC CGCGTCCACC
TTATACTGGC CTTATGGGGT CTTTTGGGAT GGTGCCCGGC TCTATGTCGC GGATTCGGGT
AACCGCCGCG TCCTCTGGTG GGAGGGCATT CCCACGGAAA AAGGACAACC CGCGGACGGG
GTCCTGGGCC AGGCGGATTT TCATTGCCGG GACGAGAACG GAGGTCACGA AGCCGACGCC
ATGAGCATGC GCTGGCCCCA TGCGGTAACC CATTTCTGGG ATTGGCTGGT CGTGGGGGAT
GCGGGGAACA ACCGGGTGCT GCTCTGGCGT GGGGCGCCAC AGCGCAATGG TCAGGCGGCC
GATATGGTTC TCGGACAGCC TGATTTTGCT CAGAACGCCC ACAATCGCGG TAATTATTTC
CCCAATGCGG CCTGCTTCAA TATGCCCTAT GGGGTGACCG CCACGGGAAA CTGGCTGATC
GTGGCGGATA CGGCCAACAG CCGGCTGCTG GGCTGGCAGG CGGACGATCT ATTGACGGGC
GCTTCGGCAC GCACCCTCGC CGGTCAGGAT GGTTTCCAGC ACAAGGGGGA CAACCGCTGG
GGCGTGGTGG GGCGCAATAC GCTGTGCTGG CCTTATGGGA TTTCTGCTGC GGGAAGGAGC
GTGATCATCG CCGACTCGGG TAACAACCGC GTGCTGCTTT GGGACAGGCG GCCATGA
 
Protein sequence
MTEYTHPLSA AIRNRPAAIP SEQLLVGAGA RVILGEQVRP RGVVVPVVPS AQTLFGPRGA 
SLMADGSLWV ADTGHHRLLG WPTLPEADGQ PATWLIGQPD FERDGGRNAH GPVGAASLNV
PTGICPVGNG MAVADVWNHR VLIWYEVPHE SHVPADLVLG QTDFVSAEIN RGAPQPSAST
LYWPYGVFWD GARLYVADSG NRRVLWWEGI PTEKGQPADG VLGQADFHCR DENGGHEADA
MSMRWPHAVT HFWDWLVVGD AGNNRVLLWR GAPQRNGQAA DMVLGQPDFA QNAHNRGNYF
PNAACFNMPY GVTATGNWLI VADTANSRLL GWQADDLLTG ASARTLAGQD GFQHKGDNRW
GVVGRNTLCW PYGISAAGRS VIIADSGNNR VLLWDRRP