Gene Lferr_0554 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagLferr_0554 
Symbol 
ID6876516 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAcidithiobacillus ferrooxidans ATCC 53993 
KingdomBacteria 
Replicon accessionNC_011206 
Strand
Start bp517416 
End bp518897 
Gene Length1482 bp 
Protein Length493 aa 
Translation table11 
GC content60% 
IMG OID642788437 
ProductNusA antitermination factor 
Protein accessionYP_002219015 
Protein GI198282694 
COG category[K] Transcription 
COG ID[COG0195] Transcription elongation factor 
TIGRFAM ID[TIGR01953] transcription termination factor NusA
[TIGR01954] transcription termination factor NusA, C-terminal duplication 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.557873 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.218927 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTCGTG AACTTCTTTA TCTGGCGGAT GCCGTCGCCC ACGAGAAGGA TGTGGACCGG 
GAAGTCATTT TCCTGGCCCT GGAGGCATCT CTGGTCTCCG CATCCAAAAA GAAGTATGGG
CAGGACTGGC ATATCGCGGT GGATGTGGAT CGTAAGACCG GAGATTATGT AACCCGTCGG
CTGTGGGAGG TGGTTGCGGA TGATGTCGCG GATTATGACG TGGATCAGCA GATCCGTCTG
AGCGATGCCC GGAAAACCCG CCCCGAAGCG GAGCCGGGTG ACTACCTCGA AGAGGTGTTG
CCACCCGTCG AGTTCGGGCG GATCGCGGCA CAAACGGCCA AACAGGTAAT CGTGCAGAAA
GTGCGGGATG CCGAGCGCGA CCGGATCGTA TCGGACTTTG CGATACGCAA GGGGGATATC
GTCAGTGGTC TGGTCAAACG CATGGAAAAA GGCAACGCCA TCGTCGACAT GGGGCGCGCC
GAGGCCATTC TGCCAAAAGA GGAGATGATG CCGCGCGAGG CCATCCGCCC CGGTGACCGG
GTGAGAGCAC ATCTCCAGGA TGTACGTCGC GTGCAGCGGG GGCCGCAGCT TTTTCTTTCG
CGGACCAGTC CTGAGTTGCT GATCAAGCTG TTCGCCCAGG AAGTGCCGGA AATCGGGAAC
GGGATGATCG AAATCATGGG TGCGGCGCGT GATCCCGGCC TGCGGGCGAA GCTGGCCGTG
CGTTCCAACG ACCCGCGCGT GGACCCCGTG GGGGCTTGTG TGGGCCTGCG CGGTAACCGG
GTACAGACGG TTATCAACGA GTTGAAAGGC GAGCGGATTG ACATTGTGAT CTGGGCAGCC
GATCCGGCCA GCTATGTGAT CAACGCCCTT TCACCCGCGG AAGTGTCCAG CATCGTGGTC
GACGAGAACA CCCACAGTAT GGATGTGGTG GTCGGACCGG AGCACTTGTC CCAGGCCATC
GGGCGGGGCG GTCAGAATGT ACGGCTGGCG ACTCAGTTGA CGGGCTGGAC CATCAACATT
CTGACCGAGG AAGAGGCTCA GGCCAAGCGG GAAGAGGAAG AGTCGACCTT TCTCAACCAC
TTCATCCAGG ATCTGGGTGT GGATGAGGAT CTGGCCGCCC TGCTGGTCAG CGAGGGTTTT
ACCTCCATTG AGGAGGTGGC CTATGTTCCG GTTGCCGAAA TGATGGAAAT CGATGGTCTG
GACGAGAACC TCGTCGGCGA ATTGCGGCGC CGTGCGCGTG ACGTCCTGCT CAACAAGGCC
ATTGCCCAGG AAGAACAGGT GGCGCTCAGT GAACCCGCGG AAGATTTGTT GTCCCTGAAA
GGTATGGATA AGGGTTTAGC GCACTTACTG GCCAGTAAAG GTGTTGTCAC TTCCGAGGAC
CTGGCGGAAC TGGCTGCGAG CGAGCTATGC GAGATGGTCG GTGTGGATGA AGAGCGGGCC
AAGGCTCTCA TTCTGGAGGC GCGTGCGCCC TGGTTTGCTT GA
 
Protein sequence
MSRELLYLAD AVAHEKDVDR EVIFLALEAS LVSASKKKYG QDWHIAVDVD RKTGDYVTRR 
LWEVVADDVA DYDVDQQIRL SDARKTRPEA EPGDYLEEVL PPVEFGRIAA QTAKQVIVQK
VRDAERDRIV SDFAIRKGDI VSGLVKRMEK GNAIVDMGRA EAILPKEEMM PREAIRPGDR
VRAHLQDVRR VQRGPQLFLS RTSPELLIKL FAQEVPEIGN GMIEIMGAAR DPGLRAKLAV
RSNDPRVDPV GACVGLRGNR VQTVINELKG ERIDIVIWAA DPASYVINAL SPAEVSSIVV
DENTHSMDVV VGPEHLSQAI GRGGQNVRLA TQLTGWTINI LTEEEAQAKR EEEESTFLNH
FIQDLGVDED LAALLVSEGF TSIEEVAYVP VAEMMEIDGL DENLVGELRR RARDVLLNKA
IAQEEQVALS EPAEDLLSLK GMDKGLAHLL ASKGVVTSED LAELAASELC EMVGVDEERA
KALILEARAP WFA