Gene Dret_0843 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDret_0843 
Symbolrho 
ID8418662 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfohalobium retbaense DSM 5692 
KingdomBacteria 
Replicon accessionNC_013223 
Strand
Start bp999635 
End bp1000882 
Gene Length1248 bp 
Protein Length415 aa 
Translation table11 
GC content54% 
IMG OID645037412 
Producttranscription termination factor Rho 
Protein accessionYP_003197712 
Protein GI258404970 
COG category[K] Transcription 
COG ID[COG1158] Transcription termination factor 
TIGRFAM ID[TIGR00767] transcription termination factor Rho 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value8.58809e-11 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.0802102 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATCTAT CCGAGATGAA AAGGAAATCC ATGGCCGAAC TCATGCAGTT GGCCAAGGAG 
TATAAAGTCG AAAACCCAAG CGGTCTTCGC AAGCAGGAAC TGATTTTTGC CATCCTCAGT
TCCTGTGCTT CGCAAAACGG CTCGATCTAT GGGGAAGGCG TGCTGGAGAT TTTGCCCGAC
GGGTTCGGCT TTCTTCGCTC CCCCATGTAC AGCTATATTC CGGGACCGGA CGATATTTAC
GTCTCCCCGT CGCAGATCCG CCGCTTCGGC CTGCGCAAGG GTGACGTGGT CTCCGGGCAG
ATCCGCCCGC CCAAGGAAGG GGAGCGGTAT TTCGCTTTGC TCCGGGTACA GCAAGTTGGG
TTTGCCACTC CAGAAGAATC CAAGAACCTA GTCCTGTTTG ATAATTTAAC GCCGATTTAT
CCGGACCAGC GCTTTGTCAT GGAAACTGGG GCGGAAAGCT ACTCCTCGCG GGTTGTGGAT
CTGCTGGCCC CCATAGGTAA GGGACAGCGC GGGCTCATAG TAGCCCCGCC ACGCACCGGA
AAAACGATGC TTTTGCAAAG TATCGCCAAC TCCATTACCG CCAACCACCC TGATTCCTAT
CTTATTGTGC TCCTGATCGA CGAACGACCT GAAGAAGTGA CGGATATGGA GCGTACCGTC
GACGGCGAAG TTGTCAGTTC GACGTTCGAC GAGCCTCCCC AACGTCATGT CCAGGTGGCG
GAAATGGTTT TGGAAAAAGC CAAACGCCTT GTGGAACGCA AAAAAGATGT CGTTATCCTC
CTGGACAGTA TCACCCGTTT CGGTCGAGCC CACAACGCGA TCATTCCTTC GTCAGGACGG
GTCCTCTCTG GCGGTCTGGA CTCCAACGCC CTGCAACGAC CGAAGCGTTT TTTTGGGGCT
GCGCGGAATA TCGAGGAAGG CGGGAGTCTG ACCATTATCG CTACGGCGCT TATCGATACC
GGATCGCGCA TGGATGAGGT CATCTTTGAG GAATTCAAGG GCACCGGAAA TATGGAAATT
TACCTGGATC GCCATCTGGC CGACAAGCGC GTCTTTCCGG CCATTGATAT CAACCGTTCC
GGCACCCGCA AGGAGGACCT GCTTTTGGAC GAGAACGTCT TGAACCGGGT TTGGATATTG
CGTAAGCTTC TGGCTCCCAT GAATTCAGTG GAGAGCATGG AGTTTCTCCT GGACAAAATG
CGCGGCACAA AGAGCAACCG CGAGTTTCTC GATATGATGA ACAGTTAG
 
Protein sequence
MNLSEMKRKS MAELMQLAKE YKVENPSGLR KQELIFAILS SCASQNGSIY GEGVLEILPD 
GFGFLRSPMY SYIPGPDDIY VSPSQIRRFG LRKGDVVSGQ IRPPKEGERY FALLRVQQVG
FATPEESKNL VLFDNLTPIY PDQRFVMETG AESYSSRVVD LLAPIGKGQR GLIVAPPRTG
KTMLLQSIAN SITANHPDSY LIVLLIDERP EEVTDMERTV DGEVVSSTFD EPPQRHVQVA
EMVLEKAKRL VERKKDVVIL LDSITRFGRA HNAIIPSSGR VLSGGLDSNA LQRPKRFFGA
ARNIEEGGSL TIIATALIDT GSRMDEVIFE EFKGTGNMEI YLDRHLADKR VFPAIDINRS
GTRKEDLLLD ENVLNRVWIL RKLLAPMNSV ESMEFLLDKM RGTKSNREFL DMMNS