Gene Lferr_0287 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagLferr_0287 
Symbol 
ID6876237 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAcidithiobacillus ferrooxidans ATCC 53993 
KingdomBacteria 
Replicon accessionNC_011206 
Strand
Start bp270589 
End bp272049 
Gene Length1461 bp 
Protein Length486 aa 
Translation table11 
GC content53% 
IMG OID642788159 
Producthypothetical protein 
Protein accessionYP_002218748 
Protein GI198282427 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value0.753964 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGGTTGT CCAAATCTCG CGTTATAGCG CATCTCCAGT GCCCCCAAAG ACTCTGGCTG 
CAAACCTATC ATCCTGAACT TGCCGAGGTG GATGGGGGTT CGCAGATGCG GATGAACGTG
GGCACAACCG TCGGAGAGAT TGCACGAACC CTCTTTCCAG ATGGTCACCT GGTGGAAACG
AATGATTTGC GTATAGCCCT TGCAGAGACT GCGTCATTAC TCCAATCTCC GAGCGGCCCA
ATTTTTGAAG CCACTTTGCA GGCAGCGGGA ACACTGGTGC GTGTGGATAT TCTGCTGCCC
CAGGCACATG GGTATCATCT GATCGAGGTG AAATCGGCTA CCCATATCAA GCCCTACTAC
GCTCAGGATA CGGCCATTCA GACCTGGATA AGCCGCGAGG CAGGGGTGCC GGTCTTGCAG
AGCAGCATCG CGACTATCAA CAATCAATTT GTTTACCCCG GCAACAGTAA TTACCAGGAA
CTCTTCACAA TCGAGTCAGT GGATGCACTC ATTGCGCCGC TACTTCCTGA AGTACCGCAA
TGGATCAGCG CTGCCCAGGC GACATTGGCG GGTGGAGAAC CAGATATAGT GCCTGGGAAT
CAATGTGAAA CGCCATTCTC CTGTCCTTTT ATAGGATATT GCGAAAAAGA CCTGCCGAAT
CCCCCAGAAT ATCCCGTTGC ATTACTGCCG CACGGTAGTA CCACTGCAGC GATTTTGAGC
GCGGAAGGGT ATGAGGATTT GCGCCAGGTA CCGAAAAACC GGTTAAGCCA TCCCCTGTAT
CAGCGTATCC GTCAGGCAAG TATGGATAAC AAAGCGTTCC TTGGCCCAGA AGCAGCAGAG
ATTCTGCAAG CCTTACCCTA TCCACGATTT TATATAGATT TTGAAACCAT CAATCCGGCG
ATTCCTATCT GGGCCAACTC ACGGCCCTAC CAGCAAATTC CCTTTCAGTG GTCCTGCCAC
CGCGAAGACG CGGATGGCAG CATTACGCAT GACGCTTATC TTGCGGATGG ACAGGACGAC
CCTAGACCTC ACTTTTTAAG CACACTACTG CTGGCGTTGA GGGATACGGG ACCCATATTG
GTCTATAATG CCGGGTTCGA GAATGCCCGA TTGCGCGAAC TGGCAGAGCA ACTTCCGCAA
CAATCACAGG CTGTACAATC CGTTCTGGCG CGGGTGGTAG ATCTACTGCC CATCGCTCGC
GCTTATTACT ACCATCCCGC CATGAAAGGC TCTTGGTCCA TCAAGGCGGT TCTCCCGACT
ATTGCTCCAG AGCTTGCCTA CGATGATCTT GAGGTAGCAA ATGGGGGTAT GGCACAGGAA
GCCTTTATGG AGATGTTGAG CCCAGATGCA GACCCTCTGC GCAGAGAAGA GGTCCGTGCG
GCACTGCTCA CTTATTGTGA ACGGGACACA CTGGCTATGA TCCACATTGC CCGGCACTTC
AAGGGGGAAG CACATGGGTA G
 
Protein sequence
MGLSKSRVIA HLQCPQRLWL QTYHPELAEV DGGSQMRMNV GTTVGEIART LFPDGHLVET 
NDLRIALAET ASLLQSPSGP IFEATLQAAG TLVRVDILLP QAHGYHLIEV KSATHIKPYY
AQDTAIQTWI SREAGVPVLQ SSIATINNQF VYPGNSNYQE LFTIESVDAL IAPLLPEVPQ
WISAAQATLA GGEPDIVPGN QCETPFSCPF IGYCEKDLPN PPEYPVALLP HGSTTAAILS
AEGYEDLRQV PKNRLSHPLY QRIRQASMDN KAFLGPEAAE ILQALPYPRF YIDFETINPA
IPIWANSRPY QQIPFQWSCH REDADGSITH DAYLADGQDD PRPHFLSTLL LALRDTGPIL
VYNAGFENAR LRELAEQLPQ QSQAVQSVLA RVVDLLPIAR AYYYHPAMKG SWSIKAVLPT
IAPELAYDDL EVANGGMAQE AFMEMLSPDA DPLRREEVRA ALLTYCERDT LAMIHIARHF
KGEAHG