Gene Saro_0120 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_0120 
Symbolrho 
ID3916006 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp121800 
End bp123056 
Gene Length1257 bp 
Protein Length418 aa 
Translation table11 
GC content61% 
IMG OID640442845 
Producttranscription termination factor Rho 
Protein accessionYP_495403 
Protein GI87198146 
COG category[K] Transcription 
COG ID[COG1158] Transcription termination factor 
TIGRFAM ID[TIGR00767] transcription termination factor Rho 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.342043 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCATCTCA AAGACCTCAA GAAGAAGACC CCCGCCGAGC TGGTCCAGAT GGCCGAAGAG 
CTCGAGGTCG AAGGCGCCAG CACCATGCGT CGCCAGGACC TGATGTTCGC TATCCTCAAG
GAAATGGCCG AAGACGGCGA GGAAATCCTC GGCATCGGCA CGATCGAGGT TCTTCCCGAC
GGTTTCGGCT TCCTGCGGAG CCCCGAAGCG AACTATCTCG CCGGACCCGA CGATATCTAC
GTCTCGCCGA ACCAGGTCCG CAAATGGGGC CTGCGCACCG GCGACACGGT GGAAGGCGAA
GTCCGCGCGC CCAAGGACGG GGAGCGCTAT TTCGCGATCA CCCGTCTGAT CAAGGTGAAC
TTCGACGATC CCGAGGCCGT GCGCCACCGT GTCAACTTCG ACAACCTGAC CCCGCTCTAT
CCGAACGAGC GACTGAAGCT CGACACGCTC GACCCGACGG TCAAGGACAA GTCGGCTCGT
GTGATCGATC TCGTTTCGCC ACAGGGCAAG GGCCAGCGCG CGCTGATCGT CGCCCCTCCG
CGCACCGGCA AGACCGTGTT GCTGCAGAAC ATGGCCAAGG CGATCACAGA CAACCATCCG
GAAGTCTTCC TGATCGTGCT TCTGGTTGAC GAACGTCCCG AAGAAGTCAC CGACATGCAG
CGTTCGGTGA AGGGCGAGGT CATTTCCTCG ACCTTTGACG AACCAGCCTC GCGCCACGTC
CAGGTCGCTG AAATGGTCAT CGAGAAGGCC AAGCGTCTTG TCGAGCACAA GCGCGACGTG
GTGATCCTGC TCGACTCGAT CACACGTCTC GGCCGTGCGT ACAACACCGT CGTGCCCTCG
TCGGGCAAGG TGCTGACCGG CGGTGTCGAT GCCAACGCCC TGCAGCGTCC CAAGCGCTTC
TTCGGCGCGG CGCGCAACAT CGAGGAAGGC GGTTCGCTTT CGATCATTGC CACGGCGCTG
ATCGATACCG GCAGCCGCAT GGACGAAGTG ATCTTCGAAG AGTTCAAGGG CACCGGCAAC
TCGGAAATCG TGCTGGACCG CAAGGTTGCG GACAAGCGCA TCTTCCCGGC GCTGGACGTG
GGCAAGAGCG GTACCCGCAA GGAAGAACTG CTCGTACCGA AGGATCAGCT CTCGAAGATG
TGGGTCCTGC GCCGCATCTT GATGCAGATG GGCACTGTCG ATGCGATGGA GTTCCTGCTC
GACAAGATGA AGGATTCGAA AACCAACGAA GACTTCTTCG CGACGATGAA CCAGTAA
 
Protein sequence
MHLKDLKKKT PAELVQMAEE LEVEGASTMR RQDLMFAILK EMAEDGEEIL GIGTIEVLPD 
GFGFLRSPEA NYLAGPDDIY VSPNQVRKWG LRTGDTVEGE VRAPKDGERY FAITRLIKVN
FDDPEAVRHR VNFDNLTPLY PNERLKLDTL DPTVKDKSAR VIDLVSPQGK GQRALIVAPP
RTGKTVLLQN MAKAITDNHP EVFLIVLLVD ERPEEVTDMQ RSVKGEVISS TFDEPASRHV
QVAEMVIEKA KRLVEHKRDV VILLDSITRL GRAYNTVVPS SGKVLTGGVD ANALQRPKRF
FGAARNIEEG GSLSIIATAL IDTGSRMDEV IFEEFKGTGN SEIVLDRKVA DKRIFPALDV
GKSGTRKEEL LVPKDQLSKM WVLRRILMQM GTVDAMEFLL DKMKDSKTNE DFFATMNQ