Gene Lferr_0967 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagLferr_0967 
Symbol 
ID6876932 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAcidithiobacillus ferrooxidans ATCC 53993 
KingdomBacteria 
Replicon accessionNC_011206 
Strand
Start bp936311 
End bp937591 
Gene Length1281 bp 
Protein Length426 aa 
Translation table11 
GC content55% 
IMG OID642788845 
Productrestriction modification system DNA specificity domain 
Protein accessionYP_002219420 
Protein GI198283099 
COG category[V] Defense mechanisms 
COG ID[COG0732] Restriction endonuclease S subunits 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.800586 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value0.821104 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGGAGG TGCGGAGTGA GCGCCAAACG GAGCCAACGG ATCGGGGTAT GAGCGACTGG 
CAGAAAACTA CAGTTGGCGA GATCGCCTCC GGATTCCTCA GCGGTGGAAC GCCATCGACA
TCGCGGGCTG ATTTCTGGGA AGGAGAGAAT CCTTGGATCA CGAGTAAATG GCTCGGAGAC
AAACTTGAGC TCACTACCGG AGAGAAATTC GTCTCCGAAG GAGCAGTCAA AAAGACTGCA
ACCAAGATCG TCCCCAAGGA CAGCATCATC TTCGCAACGC GGGTTGGCGT CGGCAAAGTC
GGCATCAACC GGATTGACCT CGCCATCAAC CAGGACCTCG CAGGCGTCCT GATCGACAAC
GAGCGCTACG ACATCAAGTT TCTCGCATAC CAATTGGGGA TCGACTCGAT TCAGCAATAC
GTCGCGATGA ACAAGCGCGG CGCGACGATC AAAGGGATCA CGCGAGATTG TCTCGAACAG
ATTCGGCTCA ACCTCCCCCC ACTCCCCGAG CAGAAGAAGA TCGCGCACAT CCTTTCGACG
GTGCAGCGGG CGATCGAAGC GCAAGAGCGG ATCATTCAGA CCACCACCGA GCTGAAAAAG
GCCCTCATGC ACAAGCTCTT CACCGAAGGC CTCCGCAACG AACCCCAAAA ACAAACCGAA
ATCGGCCCCA TCCCCGAAAG CTGGGAGGTG GTGGAGATAG GTGACCTCGG AAAATGCATC
ACTGGCTCCA CGCCAAAAAC GAAGGTTGAC TCGTTCTACG ATCCACCTAC CGAAGACTTC
ATAGCCCCTG CCGACCTCGG AGCACGTCGC TACGTATACG ATTCGGAGAA GAAGATTTCT
CCCGAGGGTA TGGCCACCAT CCGGCCAATT CCTAGGAACG CAGTGATGTG CGTCTGTATC
GGATCAAGCA TTGGCAAGGT AGGCATGAGC TATCGGGAAG AGTCTGCGAC GAATCAGCAA
ATCAATTCGA TCATCTGCGG TGAAGGTCGC GACCCCGAAT TCGTCTACTG CCTCCTCAGC
TACCGCTCTG ATTACTGGAA AAGCTTCGCC ACGTTTGGCC CAGTGCCAAT CCTCAGCAAA
GGGCGCTTTT CCACAATCGG CGTTCCGATC CCTTCGTCAC TTGACGAGCA GCAAGCCATC
GCCAAGCCGC TGGTCGCGCT CGAAACAAAA GTTGAAGTCG CCGAGAAGAA AGTCACGGTA
CTGAAAGACC TTTTCCGCAC CCTCCTTCAC GAACTGATGA CCGCGAAGAC CCGCGTTAAC
GAACTAGAGA TTTCCACATG A
 
Protein sequence
MMEVRSERQT EPTDRGMSDW QKTTVGEIAS GFLSGGTPST SRADFWEGEN PWITSKWLGD 
KLELTTGEKF VSEGAVKKTA TKIVPKDSII FATRVGVGKV GINRIDLAIN QDLAGVLIDN
ERYDIKFLAY QLGIDSIQQY VAMNKRGATI KGITRDCLEQ IRLNLPPLPE QKKIAHILST
VQRAIEAQER IIQTTTELKK ALMHKLFTEG LRNEPQKQTE IGPIPESWEV VEIGDLGKCI
TGSTPKTKVD SFYDPPTEDF IAPADLGARR YVYDSEKKIS PEGMATIRPI PRNAVMCVCI
GSSIGKVGMS YREESATNQQ INSIICGEGR DPEFVYCLLS YRSDYWKSFA TFGPVPILSK
GRFSTIGVPI PSSLDEQQAI AKPLVALETK VEVAEKKVTV LKDLFRTLLH ELMTAKTRVN
ELEIST