Gene Rcas_3710 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_3710 
Symbolrho 
ID5541212 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp4862737 
End bp4864074 
Gene Length1338 bp 
Protein Length445 aa 
Translation table11 
GC content59% 
IMG OID640895821 
Producttranscription termination factor Rho 
Protein accessionYP_001433768 
Protein GI156743639 
COG category[K] Transcription 
COG ID[COG1158] Transcription termination factor 
TIGRFAM ID[TIGR00767] transcription termination factor Rho 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.238599 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000000242256 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGCCAAGAA TGAGAACGAA AAATACCACC ACCAACAATG GCGGCGAACC AGTCGCTGAG 
TACACGGAAC CCGTTCGCGA CAAGCCCATG ATTAATGTTG CCGAACTGGA AAGCAAAACG
CTCAATGAGT TGCGCGAAAT GGCGAAAGCG ATCGGTGTGA CCGGTGTGAG CACCCTCAAG
AAGCAGGATT TGATTTTCAA ACTGCTCCAG ATGCAAACCG AAGAAGCGGG TCATACCCTG
TCCGACGGCA TTCTCGATAT TGTCGCCGAT GGGTATGGAT TCCTGCGCGG TGAGCGGATG
CTGCCGGGAC CGGATGATGT CTATGTGTCG CAGTCGCAAA TCCGGCGCTT CGGCTTGCGC
ACCGGCGATC GCGTCTGGGG TCAGATCCGT CCGCCAAAGG AGAGTGAGCG CTACTACTCG
CTCCTGCGGG TTGAAATGGT CAATGGGATG GACCCGGAGA CGGCGCGGAA GCGTCCGTCG
TTCGATCAGT TGACGCCGAT CTTTCCCAAT GAGCAGATTA AACTCGAAAC CGAACCGCAC
CTGCTCGCTA CCCGCCTGGT TGATCTGGTG GCGCCGATCG GTCGCGGGCA GCGCGGTCTG
ATCGTGTCGC CGCCCAAAGC CGGCAAGACG TTGCTGCTCA AGGCAATCGC CAACGGCATT
ACGACCAACT ATCAGGATAT TCACCTGATG GTGCTGCTCA TCGGTGAGCG CCCGGAAGAG
GTTACCGATA TGCGGCGTTC GGTCAAGGGG GATGTCATTT CGTCAACCTT CGATGAGCCG
GTCGAGGACC ATACGAAGGT CGCCGAGATG ACGCTGGAAC GCGCCAAACG CCTGGTCGAA
GGCGGGCAGG ATGTGGTGAT CCTGATGGAC TCGATCACCC GCCTGGCGCG CGCTTACAAC
CTGGATATGC CGCCGAGCGG ACGCACGCTC TCCGGCGGTA TCGATCCGGT GGCGCTTTAT
CCGCCCAAAC GCTTCTTCGG CGCTGCGCGC AATATCGAGA ATGGCGGGTC GCTTACCATT
ATTGCGACCT GTCTGGTCGA TACCGGTAGC CGTATGGACG ATGTGATCTA CGAGGAGTTC
AAAGGCACCG GCAATATGGA ATTGCACCTC GACCGCAAAC TGGCGGAGAA GCGCGTCTTC
CCGGCAATCG ATATTACCCG CAGCGGTACG CGCCGTGAGG AACTGCTGCT GTCGCCCGAT
GTGTTGCGTC AGGTGTGGAC GCTGCGCCGT ATGGTCGGTA TGCTCGGTGA GGGTGAAGGC
ACCGAACTGG TGCTGACGCG CATGGCGAAG ACGCGCAATA ATGCTGAGTT CCTGGCGACG
CTGAGTAAGG CGAATTAG
 
Protein sequence
MPRMRTKNTT TNNGGEPVAE YTEPVRDKPM INVAELESKT LNELREMAKA IGVTGVSTLK 
KQDLIFKLLQ MQTEEAGHTL SDGILDIVAD GYGFLRGERM LPGPDDVYVS QSQIRRFGLR
TGDRVWGQIR PPKESERYYS LLRVEMVNGM DPETARKRPS FDQLTPIFPN EQIKLETEPH
LLATRLVDLV APIGRGQRGL IVSPPKAGKT LLLKAIANGI TTNYQDIHLM VLLIGERPEE
VTDMRRSVKG DVISSTFDEP VEDHTKVAEM TLERAKRLVE GGQDVVILMD SITRLARAYN
LDMPPSGRTL SGGIDPVALY PPKRFFGAAR NIENGGSLTI IATCLVDTGS RMDDVIYEEF
KGTGNMELHL DRKLAEKRVF PAIDITRSGT RREELLLSPD VLRQVWTLRR MVGMLGEGEG
TELVLTRMAK TRNNAEFLAT LSKAN