Gene TM1040_2863 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_2863 
Symbolrho 
ID4076397 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp3031417 
End bp3032688 
Gene Length1272 bp 
Protein Length423 aa 
Translation table11 
GC content54% 
IMG OID638008192 
Producttranscription termination factor Rho 
Protein accessionYP_614857 
Protein GI99082703 
COG category[K] Transcription 
COG ID[COG1158] Transcription termination factor 
TIGRFAM ID[TIGR00767] transcription termination factor Rho 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.799607 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.377015 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGAGC AATCTCTGAA CCTTTCTGAT CTCAAGGCGC AAAGCCCGAA GGCCCTGTTG 
GCTATGGCTG AGGAGCTGGA GATCGAAAAC GCCTCCACTA TGCGGAAAGG CGAGATGATG
TTCCAAATCC TGCGTGAACG CGCGGATGAA GGCTGGAACA TCGGAGGGGA CGGGGTTCTC
GAGGTCCTGC AGGACGGATT TGGCTTCTTG CGCTCGCCCG AAGCAAACTA TCTGCCGGGT
CCAGACGATA TCTATGTCTC TCCTGACATG ATCCGCCAGT TTTCGCTACG CACTGGGGAT
ACGGTAGAGG GGCAGATCAA GGCACCACTG GAAAATGAGC GCTACTTTGC GCTGACCACG
GTGACAAAGA TCAACTTTGA AGAGCCCGAA AAGGCCCGTC ACAAGATTGC ATTCGACAAT
CTGACACCGC TCTATCCGGA CGAGCGTCTG CAGATGGAAA TCGAGGATCC GACGGTTAAG
GATCGCTCAG CACGCGTGAT TGACCTTGTC TCGCCGATCG GCAAGGGCCA GCGTTCGCTT
ATTGTAGCGC CGCCGCGAAC CGGTAAGACA GTTCTACTGC AAAACATCGC CCATTCGATC
GAGCAGAACC ACCCTGAGTG CTATCTGATC GTGCTTCTGA TCGACGAGCG CCCTGAAGAA
GTCACCGACA TGCAGCGCTC CGTCAAAGGC GAGGTCGTGT CCTCCACGTT TGACGAACCC
GCAGCGCGCC ACGTGGCGGT TTCCGAGATG GTCATTGAAA AAGCCAAGCG TCTGGTCGAA
CATAAACGTG ACGTTGTGAT CTTGCTCGAC TCGATCACGC GTCTCGGCCG TGCTTTCAAC
ACGGTTGTTC CATCGTCAGG CAAGGTTCTG ACTGGTGGTG TGGATGCAAA CGCTCTGCAA
CGTCCCAAGC GCTTCTTTGG GGCTGCGCGG AATATCGAAG AAGGTGGCTC TTTGACGATC
ATCGCGACCG CGCTTATCGA CACGGGCAGC CGTATGGACG AGGTTATCTT TGAAGAATTC
AAGGGTACGG GTAACTCTGA GATCGTTCTG GATCGCAAGA TTGCAGACAA GAGGGTCTTC
CCTGCGATCG ACATTCTCAA GTCGGGAACG CGGAAGGAAG ACCTCCTGGT CGACAAAGGC
GATCTGGCAA AAACCTTTGT GTTGCGTCGC ATCCTGAATC CGATGGGGAC CACCGACGCA
ATTGAGTTCC TGCTGTCTAA GTTGAAGCAA ACAAAGACAA ACTCAGAGTT TTTTGACTCG
ATGAACACCT AA
 
Protein sequence
MTEQSLNLSD LKAQSPKALL AMAEELEIEN ASTMRKGEMM FQILRERADE GWNIGGDGVL 
EVLQDGFGFL RSPEANYLPG PDDIYVSPDM IRQFSLRTGD TVEGQIKAPL ENERYFALTT
VTKINFEEPE KARHKIAFDN LTPLYPDERL QMEIEDPTVK DRSARVIDLV SPIGKGQRSL
IVAPPRTGKT VLLQNIAHSI EQNHPECYLI VLLIDERPEE VTDMQRSVKG EVVSSTFDEP
AARHVAVSEM VIEKAKRLVE HKRDVVILLD SITRLGRAFN TVVPSSGKVL TGGVDANALQ
RPKRFFGAAR NIEEGGSLTI IATALIDTGS RMDEVIFEEF KGTGNSEIVL DRKIADKRVF
PAIDILKSGT RKEDLLVDKG DLAKTFVLRR ILNPMGTTDA IEFLLSKLKQ TKTNSEFFDS
MNT