Gene SNSL254_A4197 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSNSL254_A4197 
Symbolrho 
ID6484905 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Newport str. SL254 
KingdomBacteria 
Replicon accessionNC_011080 
Strand
Start bp4091894 
End bp4093153 
Gene Length1260 bp 
Protein Length419 aa 
Translation table11 
GC content53% 
IMG OID642739451 
Producttranscription termination factor Rho 
Protein accessionYP_002043154 
Protein GI194444899 
COG category[K] Transcription 
COG ID[COG1158] Transcription termination factor 
TIGRFAM ID[TIGR00767] transcription termination factor Rho 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones95 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATCTTA CCGAATTAAA GAATACGCCG GTTTCTGAGC TGATCACTCT CGGCGAAAGT 
ATGGGGCTGG AAAACCTGGC CCGTATGCGC AAGCAGGACA TTATTTTTGC CATCCTGAAG
CAGCACGCAA AGAGTGGCGA AGATATCTTT GGCGACGGTG TGCTGGAGAT ATTGCAGGAT
GGATTTGGTT TCCTCCGTTC TGCAGACAGC TCCTACCTCG CCGGTCCTGA TGATATCTAC
GTTTCCCCCA GCCAAATCCG CCGTTTCAAC CTCCGCACTG GTGATACCAT TTCTGGTAAG
ATTCGCCCGC CGAAAGAAGG TGAACGCTAT TTTGCGCTGT TGAAAGTTAA CGAAGTTAAC
TACGACAAAC CGGAAAACGC CCGTAACAAA ATCCTCTTTG AGAACTTAAC CCCGCTGCAC
GCAAACTCTC GTCTGCGTAT GGAGCGTGGT AACGGTTCTA CCGAAGACTT AACGGCGCGC
GTTTTGGATC TGGCTTCGCC GATCGGTCGC GGCCAGCGCG GTCTGATTGT CGCGCCGCCG
AAAGCGGGTA AAACCATGCT GCTGCAGAAC ATCGCGCAGA GCATCGCGTA TAACCACCCA
GACTGCGTGC TGATGGTGCT GCTGATTGAC GAACGTCCGG AAGAAGTAAC CGAGATGCAG
CGTCTGGTGA AAGGCGAAGT GGTTGCGTCT ACCTTTGACG AACCGGCATC CCGCCACGTT
CAGGTTGCCG AAATGGTTAT CGAGAAGGCG AAACGTCTGG TTGAACACAA GAAAGACGTT
ATCATCCTGC TCGACTCCAT CACCCGTCTG GCGCGTGCCT ACAACACCGT GGTGCCGGCT
TCCGGTAAAG TACTGACCGG TGGTGTGGAC GCTAACGCCC TGCATCGTCC GAAGCGTTTC
TTCGGTGCGG CGCGTAACGT GGAAGAGGGC GGTAGCCTGA CTATCATCGC GACGGCGCTG
ATCGATACCG GTTCCAAGAT GGACGAAGTT ATCTACGAAG AGTTTAAAGG CACCGGTAAC
ATGGAGCTGC ATCTCTCGCG TAAGATTGCT GAAAAACGTG TCTTCCCGGC TATCGACTAC
AACCGTTCCG GTACCCGTAA AGAAGAGCTG CTCACCACTC AGGAAGAGCT GCAGAAAATG
TGGATCCTGC GTAAAATCAT CCATCCGATG GGTGAGATTG ACGCGATGGA ATTCCTCATT
AACAAACTGG CGATGACCAA AACTAATGAC GACTTTTTCG AGATGATGAA GCGCTCATAA
 
Protein sequence
MNLTELKNTP VSELITLGES MGLENLARMR KQDIIFAILK QHAKSGEDIF GDGVLEILQD 
GFGFLRSADS SYLAGPDDIY VSPSQIRRFN LRTGDTISGK IRPPKEGERY FALLKVNEVN
YDKPENARNK ILFENLTPLH ANSRLRMERG NGSTEDLTAR VLDLASPIGR GQRGLIVAPP
KAGKTMLLQN IAQSIAYNHP DCVLMVLLID ERPEEVTEMQ RLVKGEVVAS TFDEPASRHV
QVAEMVIEKA KRLVEHKKDV IILLDSITRL ARAYNTVVPA SGKVLTGGVD ANALHRPKRF
FGAARNVEEG GSLTIIATAL IDTGSKMDEV IYEEFKGTGN MELHLSRKIA EKRVFPAIDY
NRSGTRKEEL LTTQEELQKM WILRKIIHPM GEIDAMEFLI NKLAMTKTND DFFEMMKRS