Gene Daro_2998 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDaro_2998 
Symbolrho 
ID3567311 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDechloromonas aromatica RCB 
KingdomBacteria 
Replicon accessionNC_007298 
Strand
Start bp3236431 
End bp3237690 
Gene Length1260 bp 
Protein Length419 aa 
Translation table11 
GC content59% 
IMG OID637681469 
Producttranscription termination factor Rho 
Protein accessionYP_286198 
Protein GI71908611 
COG category[K] Transcription 
COG ID[COG1158] Transcription termination factor 
TIGRFAM ID[TIGR00767] transcription termination factor Rho 


Plasmid Coverage information

Num covering plasmid clones37 
Plasmid unclonability p-value0.0194587 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAACTTT CCGAACTAAA GACACTACAC GTCAGCAAAC TGCTGGACAT GGCCACCGAA 
CTGGTCATCG AAAACGCCAA CCGCATGCGC AAGATGGAGC TGATTTACGC CATCCTGAAG
GCCAAGGCAA AAAATGGCGA CACCATCTAC GGCGATGGCA CCCTGGAAGT TCTGCCTGAC
GGTTTCGGTT TCCTCCGTTC CTCGGACACT TCCTACCTGG CCAACCCGGA CGACATCTAT
GTCTCCCCGT CGCAGATCCG CCGTTTCAAT CTGCGCACTG GCGACACGGT CGAAGGTGAA
ATCCGTACCC CGAAGGATGG CGAACGCTAC GTCGCACTGA CCAAGCTCGA CCGTATCAAC
GGCTTCTCGC CGGAAGCCAA CAAGAACAAG ATCATGTTCG AGAACCTGAC GCCGCTACAC
CCGACGCGTC ACCTCAAACT CGAACGTGAA ATCAAGTCCG AAGAAAACAT CACCAGCCGC
GTCATCGACA TGATTGCCCC GATCGGCTGC GGCCAGCGCG GCCTGCTCGT TGCGCCGCCG
AAGACCGGCA AGACGGTGAT GCTGCAGAAC ATCGCCCACG CCATCACGGC CAATCATCCG
GAAGTCGTGC TGATCGTGCT GCTGATCGAC GAGCGTCCGG AAGAAGTCAC CGAAATGACC
CGTACCGTCA AGGGCGAGGT CGTCGCCTCG ACCTTCGACG AACCGGCATC CCGTCACGTC
GCCGTCGCCG AAATGGTCAT CGAGAAAGCC AAGCGCCTGG TTGAGCACAA GAAGGATGTC
GTCATCCTGC TCGACTCGAT CACCCGCCTC GCCCGCGCCT ACAACACCGT CCAGCCAGCC
TCCGGCAAGG TGCTGACCGG TGGCGTCGAC GCCAATGCCC TGCAGAAGCC GAAGCGCTTC
TTCGGTGCGG CGCGCAACAT CGAGGAAGGT GGCTCGCTGA CCATCCTGGC CACTGCGTTG
ATCGACACCG GTTCGCGCAT GGATGAAGTC ATCTACGAAG AATTCAAGGG TACCGGCAAC
TCCGAAATCC ATCTCGACCG TCGCATGGCC GAAAAGCGGA TGTACCCGGC GGTCAACGTC
AATCGTTCCG GCACCCGTCG CGAAGAACTG CTGCTCAAGC CGGACGTCCT GCAAAAAATG
TGGGTGCTGC GCAAGCTCTG CTACCCGATG GACGACCTCG AAGCTATGGA ATTCCTGCTC
GACAAGGTCA AATCGACCAA GGGTAACCAG GAGTTCTTTG ACGCCATGCG TCGGGGTTAA
 
Protein sequence
MQLSELKTLH VSKLLDMATE LVIENANRMR KMELIYAILK AKAKNGDTIY GDGTLEVLPD 
GFGFLRSSDT SYLANPDDIY VSPSQIRRFN LRTGDTVEGE IRTPKDGERY VALTKLDRIN
GFSPEANKNK IMFENLTPLH PTRHLKLERE IKSEENITSR VIDMIAPIGC GQRGLLVAPP
KTGKTVMLQN IAHAITANHP EVVLIVLLID ERPEEVTEMT RTVKGEVVAS TFDEPASRHV
AVAEMVIEKA KRLVEHKKDV VILLDSITRL ARAYNTVQPA SGKVLTGGVD ANALQKPKRF
FGAARNIEEG GSLTILATAL IDTGSRMDEV IYEEFKGTGN SEIHLDRRMA EKRMYPAVNV
NRSGTRREEL LLKPDVLQKM WVLRKLCYPM DDLEAMEFLL DKVKSTKGNQ EFFDAMRRG