Gene Dole_0470 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDole_0470 
Symbolrho 
ID5693291 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfococcus oleovorans Hxd3 
KingdomBacteria 
Replicon accessionNC_009943 
Strand
Start bp535141 
End bp536385 
Gene Length1245 bp 
Protein Length414 aa 
Translation table11 
GC content58% 
IMG OID641263053 
Producttranscription termination factor Rho 
Protein accessionYP_001528357 
Protein GI158520487 
COG category[K] Transcription 
COG ID[COG1158] Transcription termination factor 
TIGRFAM ID[TIGR00767] transcription termination factor Rho 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000467448 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGATATAT CCGAGCTGAA AAAAAAGAAG ATGGATGAGT TAAAGGAGAT TGCCGCCGGT 
TACGAGGTCG ACAGCGCCGG CATGAAAAAG CAGGAACTGA TTTTTTCGAT TCTGCAGGCC
GAGGCGGAAA ACAACGGGTA TATCTTCGGT GAAAGCACGC TGGAGGTCCT TTCCGACGGG
TTCGGTTTTT TACGGTCCCC GGACAGCAGC TATCTGCCGG GGCTGGATGA TATTTACGTG
TCGCCTTCCC AGATTCGGCG GTTCAACCTC CGTACCGGCG ACATCGTGTC GGGTCAGATC
CGTCAGCCCA AGGAGAATGA GCGCTATTTC GCGCTGCTGA AGGTGGAGGC CATCAACCAC
GAAGACCCGG AGATCGCGCG GCACAAAACC CCTTTTGACA ACCTCACCCC TCTGTTCCCC
AATGAAAAGA TCAAGCTGGA GCGCGAGTCG GACAACTACT CCATGCGGAT CATGGACCTG
CTGACCCCCA TCGGTTTCGG CCAGCGGGGG CTGATCGTGT CGCCGCCCCG GGCCGGCAAG
ACCATGCTGC TGCAGAACAT TGCCAACAGC ATCATTGCCA GCCACAAGAA GGTGGTGCCC
TTTGTGCTGC TCATCGATGA ACGGCCTGAG GAAGTGACCG ACATGAAGCG TTCCGTTAAC
GCTGAAGTGA TCAGCTCCAC GTTTGACGAG CCGGCCGACC GCCATGTGCA GGTGGCGGAA
ATGGTCATTG AAAAGGCACG GCGCCTGGTG GAGCATAAAA AGGATGTGGT GATCCTGCTT
GACAGCATCA CCCGCCTGGC CCGGGCTTAC AACTCGGTGG TGCCCTCCAG CGGCAAGGTG
CTGTCCGGCG GCGTGGATTC CAACGCCCTG CACCGGCCCA AGCGGTTTTT CGGCGCGGCC
CGCAATATTG AGGAGGGCGG CAGCCTCACC ATCATGGCCA CGGCCCTGAT CGACACCGGC
AGCCGCATGG ATGATGTGAT TTTTGAGGAG TTCAAGGGCA CCGGCAACAT GGAGCTTCAT
CTGGACCGGA AGCTGGCCGA TCGGCGCGTC TACCCGGCCA TCGATATCAA CCGGTCCGGC
ACCCGTAAAG AGGAACTGCT GGTGGAAAAG GATGTGCTCA ACCGGGTATG GGTGCTGCGC
AAGCTGCTGG CGACCCTGAA CTCCGTGGAC GGCATGGAAT TTCTGCTCGA CAAGATGAGC
AGCACCAAAA GCAATAAGGA TTTTATGGAT GCCATGAATT CATAG
 
Protein sequence
MDISELKKKK MDELKEIAAG YEVDSAGMKK QELIFSILQA EAENNGYIFG ESTLEVLSDG 
FGFLRSPDSS YLPGLDDIYV SPSQIRRFNL RTGDIVSGQI RQPKENERYF ALLKVEAINH
EDPEIARHKT PFDNLTPLFP NEKIKLERES DNYSMRIMDL LTPIGFGQRG LIVSPPRAGK
TMLLQNIANS IIASHKKVVP FVLLIDERPE EVTDMKRSVN AEVISSTFDE PADRHVQVAE
MVIEKARRLV EHKKDVVILL DSITRLARAY NSVVPSSGKV LSGGVDSNAL HRPKRFFGAA
RNIEEGGSLT IMATALIDTG SRMDDVIFEE FKGTGNMELH LDRKLADRRV YPAIDINRSG
TRKEELLVEK DVLNRVWVLR KLLATLNSVD GMEFLLDKMS STKSNKDFMD AMNS