Gene RoseRS_3749 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRoseRS_3749 
Symbol 
ID5210731 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus sp. RS-1 
KingdomBacteria 
Replicon accessionNC_009523 
Strand
Start bp4690549 
End bp4691877 
Gene Length1329 bp 
Protein Length442 aa 
Translation table11 
GC content58% 
IMG OID640597345 
ProductNusA antitermination factor 
Protein accessionYP_001278053 
Protein GI148657848 
COG category[K] Transcription 
COG ID[COG0195] Transcription elongation factor 
TIGRFAM ID[TIGR01953] transcription termination factor NusA 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00409142 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000152852 
Fosmid HitchhikerNo 
Fosmid clonabilityunclonable 
 

Sequence

Gene sequence
ATGAAAAGCG ACTTTTATGC GGCAATCTCT CAGATTGCCT CCGAACGCGG TATTCCAAAG 
GAAGCCATCG TCGAAGTGAT GGAAAAAGCG CTGGCGACTG CCTATCGTCG GACACTGGGA
CCGAACCCGC CGCCAATGGA GATCTCGGTC CGGCTCGACC CTGTCACCGG TATGGCGCGT
GTCTATGCCG AAAAACAGGT CGTCGACGAT GTCTTCGATG AGCGCTTCGA GATCGACCTG
GAGAGCGCGC GCAAGATCAA GCCCGACGTC GAATTAGGCG AGTCGGTTGT GGTCGAGGCA
ACGCCCAGGG ATTTCGGGCG GATTGCCGCA CAAACGGCGA AGCAGGTCAT TCTTCAGGGA
ATCAAAGAAG TCGAGCGCGA GCATATCTAC GGCGAGTATA TGGATCGCGA GGGCGAACTG
GTGACCGCGA CGGTGCAGCG GATAGCGAAG GGCAACGTCA TCCTCGAAAT GGGGAAAGCC
GAGGCTATTC TGCCGCCGAA AGAGCAGGTC GAGACCGATC GGTACTACCA CGGGCAGCGC
CTGAAAGTCT ATCTGATGGA AATCCGGCGT GAGGATCGCG GTCCGAAATT GATCGCGTCG
CGTGCGCACA AAAATCTGAT CACGCGCCTG TTCGAGATGG AGGTGCCGGA GATTTATAAT
GGCGCGGTTG AGATTAAGTC GATAGCGCGC GAGCCGGGCA TCCGCACAAA AGTGGCAGTC
GCCGCGCGCC AGGAGGGAAT CGACCCGGTC GGCTCTTGCG TCGGTATGCG CGGGATCCGG
ATCCAGAACA TTGTCAATGA ACTAAATGGC GAGAAAATCG ATGTGGTTCA GTGGTCGTCC
AATCCGAAAG AGTTCATTGC CAATGCACTG TCGCCAGCGC AGGTGGTTGA GGTGCAACTG
CGCGATGATG AACACGCGGC GACGGTGATT GTGCCGGATA AGCAACTTTC GCTGGCGATC
GGCAAAGAGG GGCAGAATGT GCGCCTGGCA GCAAAATTGA CCGGATGGCG CATCGATATC
AAGAGTGCAT CGGCGTTGCT CGATGAAGAG CGCGCTGCTG CCGAAGCACG CGATGCTGCG
GAAGCCGAGG CGCTGGCGAC TGAAGCAGCG CTGGCGACGG CGAAAGTCGA GACGCGCAAA
GTCTATGCCG ATGGCACGAT CGTCTATCGC AAGCACCGCT ATGGTCCGCT CGGCGATGAT
CTGGTCGGTG AAACGGTTCA GTTGCGCGCG ACGCCGCAGA AACTGTACAT CTATCGCGGT
GACCGCCTCG TGGCCTCCTA TATTCTGGTT GAAGAAGACG ACGACGAAGA GGATGAAGAC
GAGGAGTAG
 
Protein sequence
MKSDFYAAIS QIASERGIPK EAIVEVMEKA LATAYRRTLG PNPPPMEISV RLDPVTGMAR 
VYAEKQVVDD VFDERFEIDL ESARKIKPDV ELGESVVVEA TPRDFGRIAA QTAKQVILQG
IKEVEREHIY GEYMDREGEL VTATVQRIAK GNVILEMGKA EAILPPKEQV ETDRYYHGQR
LKVYLMEIRR EDRGPKLIAS RAHKNLITRL FEMEVPEIYN GAVEIKSIAR EPGIRTKVAV
AARQEGIDPV GSCVGMRGIR IQNIVNELNG EKIDVVQWSS NPKEFIANAL SPAQVVEVQL
RDDEHAATVI VPDKQLSLAI GKEGQNVRLA AKLTGWRIDI KSASALLDEE RAAAEARDAA
EAEALATEAA LATAKVETRK VYADGTIVYR KHRYGPLGDD LVGETVQLRA TPQKLYIYRG
DRLVASYILV EEDDDEEDED EE