Gene Lferr_2009 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagLferr_2009 
Symbol 
ID6878000 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAcidithiobacillus ferrooxidans ATCC 53993 
KingdomBacteria 
Replicon accessionNC_011206 
Strand
Start bp2010327 
End bp2011598 
Gene Length1272 bp 
Protein Length423 aa 
Translation table11 
GC content53% 
IMG OID642789881 
Productrestriction modification system DNA specificity domain 
Protein accessionYP_002220433 
Protein GI198284112 
COG category[V] Defense mechanisms 
COG ID[COG0732] Restriction endonuclease S subunits 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.999718 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value0.91224 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACCCCA AAGCACCAAG CACTGTTTTG ACAGCAGAAA CAAAGCCCGC GCGAGTGCCG 
AAGCTGCGGT TTCCGGCGTT TCGGGGGGCG GATGGGTGGA AGCTCGCGCC GCTGAGTCAA
TTGGCAACTC GCACCAAACA GAAAAACCGC GATGAAAAAA TTACTCGCGT GCTGACGAAC
TCAGCCGAGT TTGGCGTGAT GGACCAGCGC GACTTCTTCG ACAAAGAAAT TGCCACGCAA
GGCAATCTCG AGAGCTATTT CGTAGTTGAG CTGGGCAGCT ATGTTTACAA CCCGCGCATC
TCTGCAACAG CACCTGTTGG CCCCATTTCT AAGAACAAAG TTGGCACTGG CGTCATGTCG
CCGCTCTACA CCGTCTTCAA GTTCAAAGAC GGTGGCAATG ACTTTTATGA GCACTACTTC
AAGACAACCG GCTGGCACAC CTACATGCGG CAGGCATCCA GTACAGGCGC GCGGCATGAT
CGAATGGCCA TCTCCAGCGA CGATTTCATG GCCATGCCTT TGCCTGTTCC GACACCGAAG
GAACAACAAA AAATCGCCGA GTGCCTGAGT TCGGTGGACG CGCTGATGGC CGCGCAAGCG
CGGAAAGTGG ACGCGCTCAA GACCCATAAA AAAGGGCTGA TGCAGCAGCT TTTCCCCACG
GAGGGCGAAA CCCAACCCCG CCTGCGCTTC CCCGAATTCC AAAACGCCGG GGAGTGGAAC
AAGACGACCT TGGGTGAAGC AGCGACATTC TTCAACGGCC GAGCATACAA ACAGGAAGAA
CTGCTTGAAT CCGGAAAGTA TCCAGTTCTC CGCGTTGGAA ATTTCTTCAC CAACAACAAT
TGGTATTACT CAGACCTTGA ACTGGATGAG ACAAAGTATT GTGACAAAGG CGATTTGCTT
TACGCATGGT CGGCGTCGTT CGGGCCGCGT ATGTGGCACG GGGTAAAAGT GATTTATCAC
TACCACATCT GGAAGGTCGA ACAACACAGT GGAATAGACC GACAGTTCCT TTTCATCACA
CTCGAAAATG AAACTGAGAG GATGAAATCC AACTCAGCAA ATGGATTGGG ACTTCTGCAC
ATTACGAAGG GAACCATCGA GGGCTGGGAC ACTGCATTCC CATCACCGCC AGAACAACAC
CGCATCGCCT CCTGCCTGAG CAGCCTCGAC GCCCTGATCA CCCTGGAGAC CCAAAAGCTC
GAAGCCCTCA AGACCCACAA GAAAGGGCTG ATGCAGCAGC TTTTCCCAGT ACTCAACGAG
GTGCAGGGAT GA
 
Protein sequence
MNPKAPSTVL TAETKPARVP KLRFPAFRGA DGWKLAPLSQ LATRTKQKNR DEKITRVLTN 
SAEFGVMDQR DFFDKEIATQ GNLESYFVVE LGSYVYNPRI SATAPVGPIS KNKVGTGVMS
PLYTVFKFKD GGNDFYEHYF KTTGWHTYMR QASSTGARHD RMAISSDDFM AMPLPVPTPK
EQQKIAECLS SVDALMAAQA RKVDALKTHK KGLMQQLFPT EGETQPRLRF PEFQNAGEWN
KTTLGEAATF FNGRAYKQEE LLESGKYPVL RVGNFFTNNN WYYSDLELDE TKYCDKGDLL
YAWSASFGPR MWHGVKVIYH YHIWKVEQHS GIDRQFLFIT LENETERMKS NSANGLGLLH
ITKGTIEGWD TAFPSPPEQH RIASCLSSLD ALITLETQKL EALKTHKKGL MQQLFPVLNE
VQG