Gene Sala_2845 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSala_2845 
Symbolrho 
ID4080638 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSphingopyxis alaskensis RB2256 
KingdomBacteria 
Replicon accessionNC_008048 
Strand
Start bp2996954 
End bp2998210 
Gene Length1257 bp 
Protein Length418 aa 
Translation table11 
GC content61% 
IMG OID638011229 
Producttranscription termination factor Rho 
Protein accessionYP_617883 
Protein GI103488322 
COG category[K] Transcription 
COG ID[COG1158] Transcription termination factor 
TIGRFAM ID[TIGR00767] transcription termination factor Rho 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.495163 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0341841 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCACTTGA AAGAACTCAA AACAAAATCG CCCGCGCAGC TCGTCGAAAT GGCCGAGGAA 
CTGGGCGTCG AGGGCGCATC GACGATGCGC AAGCAGGATC TGATGTTCTC GATCCTGAAG
GAGCTCGCCG AAGAGGGCGA GGAAATCATC GGGTCGGGGA CGATCGAGGT ACTGCCCGAC
AGCTTCGGGT TCCTGCGCTC GTCCGAGGCC AATTATCTCG CCGGTCCCGA CGACATCTAT
GTCTCGCCGA ACCAGGTCCG CAAATATGGC CTGCGTACCG GCGACACCGT CGAGGGCGAG
ATTCGTGCGC CGCGCGATGG CGAACGCTAT TTCGCGCTCA CCAAGCTGAT CAGCGTCAAT
TTCGACGATC CCGACGTCGT GCGCCACCGC GTCAATTTCG ATAACCTCAC ACCGCTCTAT
CCGAACCAGA AGCTGTCGCT CGACACCGTC GATCCGACGG TCAAGGACAA GTCGGCGCGC
GTCATCGACC TTGTCAGCCC GCAGGGCAAG GGGCAGCGCG CGCTGATCGT CGCGCCGCCG
CGCACCGGCA AGACGGTGCT CTTGCAGAAT ATCGCCAAGG CGATCACCGA CAACCACCCG
GAAGTTTACC TCATCGTCCT CCTCGTCGAC GAACGGCCCG AGGAAGTCAC CGATATGCAG
CGCAGCGTGA AGGGCGAGGT CGTTTCCTCC ACCTTCGACG AACCCGCGAC GCGCCACGTC
CAGGTCGCCG AAATGGTGAT CGAAAAGGCC AAGCGTCTCG TCGAGCACAA GAAGGATGTC
GTCATCCTGC TCGATTCGAT CACGCGCCTC GGTCGTGCCT ACAACACCGT CGTCCCTTCG
TCGGGCAAGG TGCTGACCGG CGGCGTCGAC GCCAATGCGC TGCAGCGGCC GAAGCGTTTC
TTCGGCGCCG CGCGCAACAT CGAAGAGGGC GGTTCGCTGT CGATCATCGC CACGGCGCTG
ATCGATACCG GCAGCCGCAT GGACGAGGTC ATCTTCGAAG AGTTCAAGGG CACGGGTAAC
AGCGAAATCG TGCTCGACCG CAAGGTTGCC GACAAGCGCA TCTTCCCGGC GCTCGACGTC
GGCAAGTCGG GCACGCGCAA GGAAGAATTG CTCGTTGAAA AGGACAAGCT CTCGAAAATG
TGGGTGCTGC GCCGCATCCT CATGCAGATG GGCACCGTCG ACGCAATGGA GTTCCTGCTC
GACAAGATCA AGGATTCGAA GACAAACGAG GATTTCTTCG ATTCGATGAA CCAATAG
 
Protein sequence
MHLKELKTKS PAQLVEMAEE LGVEGASTMR KQDLMFSILK ELAEEGEEII GSGTIEVLPD 
SFGFLRSSEA NYLAGPDDIY VSPNQVRKYG LRTGDTVEGE IRAPRDGERY FALTKLISVN
FDDPDVVRHR VNFDNLTPLY PNQKLSLDTV DPTVKDKSAR VIDLVSPQGK GQRALIVAPP
RTGKTVLLQN IAKAITDNHP EVYLIVLLVD ERPEEVTDMQ RSVKGEVVSS TFDEPATRHV
QVAEMVIEKA KRLVEHKKDV VILLDSITRL GRAYNTVVPS SGKVLTGGVD ANALQRPKRF
FGAARNIEEG GSLSIIATAL IDTGSRMDEV IFEEFKGTGN SEIVLDRKVA DKRIFPALDV
GKSGTRKEEL LVEKDKLSKM WVLRRILMQM GTVDAMEFLL DKIKDSKTNE DFFDSMNQ