Gene Rcas_1134 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_1134 
Symbol 
ID5538600 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp1468453 
End bp1469781 
Gene Length1329 bp 
Protein Length442 aa 
Translation table11 
GC content58% 
IMG OID640893268 
ProductNusA antitermination factor 
Protein accessionYP_001431251 
Protein GI156741122 
COG category[K] Transcription 
COG ID[COG0195] Transcription elongation factor 
TIGRFAM ID[TIGR01953] transcription termination factor NusA 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000746814 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.939177 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAGCG ATTTTTATGC AGCAATCTCA CAGATTGCGT CCGAACGCGG CATTCCCAAG 
GAAGCCATCG TCGAGGTCAT GGAAAAGGCG CTGGCGACAG CGTATCGCCG AACGCTCGGT
CCTAACCCAC CGCCAATGGA GATATCGGTC CGGCTGGACC CGCTGACGGG CGCGGCGCGC
GTCTATTCCG AAAAACAGGT CGTTGACGAT GTCTACGATG AGCGCTTTGA GATCGATCTG
GAGAGCGCCC GCAAGATCAA GCCCGATGTC GAACTGGGCG AGTCGGTCGT CGTCGAGACG
ACGCCGAAGG ATTTTGGACG AATCGCAGCG CAAACGGCGA AACAGGTTAT CCTTCAGGGA
ATCAAGGAAG TCGAGCGCGA ACACATCTAT GGCGAATACA TGGATCGCGA GGGCGAACTG
GTCACGGCGA CCGTACAGCG CATGGCAAAG GGCAACGTCA TTCTCGAAAT GGGAAAAGCC
GAAGCCGTCT TGCCGCCGAA GGAACAGGTC GAGACCGATC GCTACTACCA CGGGCAGCGC
CTGAAGGTCT ACCTGATGGA AATCCGCCGT GAGGAACGCG GACCGAAACT GATCGCATCG
CGCGCGCACA AAAATCTGAT TACGCGCCTC TTCGAGATGG AAGTGCCGGA GATTTATAAT
GGCGCCGTCG AGATCAAGTC GATCGCGCGC GAGCCGGGTA TCCGTACCAA AGTAGCAGTC
GCGGCGCGGC AGGAGGGCAT CGATCCGGTT GGTTCGTGCG TCGGTATGCG CGGCATTCGC
ATTCAGAACA TCGTCAACGA ACTGAATGGC GAAAAGATCG ATGTTGTGCA GTGGTCGTCA
AATCCAAAAG AGTTCATTGC GAATGCACTG TCGCCAGCAC AGGTCGTTGA GGTGCAGTTG
CGCGACGATG AACACGCTGC GACGGTCATT GTGCCGGATA AGCAACTCTC GCTGGCGATC
GGTAAAGAAG GGCAGAATGT GCGCCTGGCG GCAAAACTGA CGGGATGGCG GATCGATATC
AAGAGCGCGT CGGCGTTGCT CGACGAAGAG CGCGCAGCGG CGGAGGCGCG CGATGCGGCA
GAAGCGGAGG CGCTGGCGAC TGAGGCGGCG CTGGCGACGG CAAAGGTCGA GATGCGCAAA
GTGTACGCCG ATGGAACGAT CGTCTATCGG AAGCATCGCT ATGGTCCACT CGGCGACGAC
CTGGTCGGCG AAACGGTGCA ACTGCGCGCG ACGCCGCAAA AACTGTACAT CTATCGCGGT
GATCGCCTGG TGGCATCGTA TATGCTCGTT GGCGACGATG AAGAGGATGC GATCGAGGGC
GACGAGTAA
 
Protein sequence
MKSDFYAAIS QIASERGIPK EAIVEVMEKA LATAYRRTLG PNPPPMEISV RLDPLTGAAR 
VYSEKQVVDD VYDERFEIDL ESARKIKPDV ELGESVVVET TPKDFGRIAA QTAKQVILQG
IKEVEREHIY GEYMDREGEL VTATVQRMAK GNVILEMGKA EAVLPPKEQV ETDRYYHGQR
LKVYLMEIRR EERGPKLIAS RAHKNLITRL FEMEVPEIYN GAVEIKSIAR EPGIRTKVAV
AARQEGIDPV GSCVGMRGIR IQNIVNELNG EKIDVVQWSS NPKEFIANAL SPAQVVEVQL
RDDEHAATVI VPDKQLSLAI GKEGQNVRLA AKLTGWRIDI KSASALLDEE RAAAEARDAA
EAEALATEAA LATAKVEMRK VYADGTIVYR KHRYGPLGDD LVGETVQLRA TPQKLYIYRG
DRLVASYMLV GDDEEDAIEG DE