Gene Lferr_1970 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagLferr_1970 
Symbol 
ID6877958 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAcidithiobacillus ferrooxidans ATCC 53993 
KingdomBacteria 
Replicon accessionNC_011206 
Strand
Start bp1966393 
End bp1968288 
Gene Length1896 bp 
Protein Length631 aa 
Translation table11 
GC content55% 
IMG OID642789839 
ProductRNA polymerase, sigma 70 subunit, RpoD 
Protein accessionYP_002220394 
Protein GI198284073 
COG category[K] Transcription 
COG ID[COG0568] DNA-directed RNA polymerase, sigma subunit (sigma70/sigma32) 
TIGRFAM ID[TIGR02393] RNA polymerase sigma factor RpoD, C-terminal domain
[TIGR02937] RNA polymerase sigma factor, sigma-70 family 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.53576 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones37 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGAGACG GTGAGAGTGT AGTGGATAAT GAATCTCAGC GGTCTGAGCT GAAGCGGCTT 
ATTGCGCGTG GTAAAGAGCA AGGCTACCTG ACCTATCGCG AGATCAATGA TCATTTGCCG
GAAGAGGTTT TCGACCCCGA GCAGATGGAA AATGTCATCT CCATGATCAA TGACATGGGT
ATTGAAGTCT TTGAGGAGGC TCCGGACGAC GATACCCTGC TGATGGATGG GGAAGGTGGT
ACCGTAGTCG CCGCTCAGGA AGCGGAAGAA GCGGCAGAGG AAGCCCTGGC CGTGGTCGAG
GCGGATATCG GGCGAACCAG TGATCCGGTG CGCATGTACA TGCGCGAGAT GGGCAGCGTG
GAACTGCTCA CGCGTGAAGG CGAAATCGAA ATTGCCCGGC GTATCGAAGA TGGTCTGATG
CAGGTATTGC GCGCTGTATC GACCTGCCCG ACGACCATCT CTCTTCTTCT GGACGCCGCG
GCTCGGGTGG AGCGCGGCGA GACGCGCCTG GATGAAGTGG TCGACGCCTT TATTGATCTT
AGCGCGCTGG ATGCAGAAAC CGTCAGTGAG GCAGAAGAAA GCGTGCTGGT GGAAGGCGAC
GATCTGGACG TCGATGAGGA TGAGGAAGAA TCCGAAGACG GTGATATCGA AGTCGTAGAC
AAGGGGCCAC AGCTCGAAGA TGCACTGGAA CGTTTTGCGG TCATCCGTGC TGCGTATACC
GCGCTGCTGG CCAGCCATGC CGAGGGGGAG ATGCATGGTG AAAGTTATCA GCGGCAGCGC
CGGGAGCTCG CCGAACGTTT CCTGGAAATC AAACTCAACG GTCGTCAGAT TGACGTCATG
ACCGACGCCC TGCGCGGGCT AACGGAAGAG GTGCGTCAGT GCGAACGGGA GTTGATGGAA
CTGTGCATTG AACGTGCGCG CTTCCCGCGC AAAGAATTTG TGCGCAGTTA TCCCGGTCAT
GAGGGTGATA TCGGCTGGAT TGATCAGCAA ATCGCCGCTG GTCATGCCTA TAGTACCCGT
TTGGTTGAGT TTCGTGATGA TATCGTGGCG ATCATGAAAC GCTTGGCGAA TATTGAGAAG
CGTGCCGGTT TGCCGATTGC CGAGATCAAA GAGGCCAGTC GCCTGATGTC CATTGGTGAG
GCCAAGGCGC GGCGTGCCAA GAAGGAAATG GTGGAGGCCA ATCTGCGTTT GGTGATCTCT
ATCGCCAAGA AGTATACCAA TCGTGGCCTG CAGTTCCTGG ATCTGATTCA GGAAGGAAAT
ATCGGTCTGA TGAAAGCGGT GGACAAGTTC GAATACCGAC GTGGCTACAA ATTCTCTACT
TATGCGACTT GGTGGATACG CCAGGCCATT ACGCGGAGTA TTGCGGATCA GGCGCGGACC
ATTCGTATCC CGGTACATAT GATCGAAACC ATAAACAAAC TCAATCGAAT CAGTCGCCAG
ATGCTCCAGG AAATGGGACG TGAACCGTCG CCTGAAGAGT TGGCAGAACG GATGGAAATG
CCCGAGGATA AAATCCGCAA GGTGCTCAAA ATTGCCAAGG AGCCTATCTC CATGGAGACG
CCCATCGGTG ACGATGAAGA TTCGCACCTG GGCGACTTCA TCGAAGACCG GAATGTGACG
GCGCCAGCGG ATTCTGCCGT TAACGCCGCC ATTCGCGAAG TGGTCGAGGA GTTGCTTGAT
AATGGCCTGA CAGCGCGGGA AGCCAAGGTG TTGCGTATGC GTTTTGGCAT CGGGATGAAT
ACCGACCACA CGCTGGAGGA GGTCGGCAAG CAGTTTGACG TTACTCGTGA GCGTATCCGC
CAGATTGAAG CCAAGGCGCT GCGCAAACTG CGTCATCCCT CTCGATCGGA GCGGTTACGT
AGTTTTGTGG ACGGTGAAGT GACGATTCCA TCGTGA
 
Protein sequence
MRDGESVVDN ESQRSELKRL IARGKEQGYL TYREINDHLP EEVFDPEQME NVISMINDMG 
IEVFEEAPDD DTLLMDGEGG TVVAAQEAEE AAEEALAVVE ADIGRTSDPV RMYMREMGSV
ELLTREGEIE IARRIEDGLM QVLRAVSTCP TTISLLLDAA ARVERGETRL DEVVDAFIDL
SALDAETVSE AEESVLVEGD DLDVDEDEEE SEDGDIEVVD KGPQLEDALE RFAVIRAAYT
ALLASHAEGE MHGESYQRQR RELAERFLEI KLNGRQIDVM TDALRGLTEE VRQCERELME
LCIERARFPR KEFVRSYPGH EGDIGWIDQQ IAAGHAYSTR LVEFRDDIVA IMKRLANIEK
RAGLPIAEIK EASRLMSIGE AKARRAKKEM VEANLRLVIS IAKKYTNRGL QFLDLIQEGN
IGLMKAVDKF EYRRGYKFST YATWWIRQAI TRSIADQART IRIPVHMIET INKLNRISRQ
MLQEMGREPS PEELAERMEM PEDKIRKVLK IAKEPISMET PIGDDEDSHL GDFIEDRNVT
APADSAVNAA IREVVEELLD NGLTAREAKV LRMRFGIGMN TDHTLEEVGK QFDVTRERIR
QIEAKALRKL RHPSRSERLR SFVDGEVTIP S