Gene DvMF_0491 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDvMF_0491 
Symbolrho 
ID7172378 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfovibrio vulgaris str. 'Miyazaki F' 
KingdomBacteria 
Replicon accessionNC_011769 
Strand
Start bp575162 
End bp576412 
Gene Length1251 bp 
Protein Length416 aa 
Translation table11 
GC content63% 
IMG OID643538991 
Producttranscription termination factor Rho 
Protein accessionYP_002434916 
Protein GI218885595 
COG category[K] Transcription 
COG ID[COG1158] Transcription termination factor 
TIGRFAM ID[TIGR00767] transcription termination factor Rho 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones50 
Fosmid unclonability p-value0.000212536 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAATCTTT CGGAACTCAA GATCAAGAGC ATGAGCGAGC TCATGGAGCT TGCCGAGCAA 
TACAACGTCG AAGGCGCCAG CGGCATGCGC AAGCAGGAGC TGATCTTCGC CCTGCTCCAG
GCCTGTGCCT CGCAGAACGG CGCCATCTAC GGCGACGGCG TGCTGGAGAT ACTGCCCGAC
GGTTTCGGCT TTCTGCGGTC GCCGCTGTGC AGCTACATGC CCGGCCCCGA CGACATCTAT
GTGTCGCCGT CGCAGATTCG CCGCTTCAAC CTGCGCAAGG GTGACGTTGT TTCCGGCCAG
ATACGCCCGC CCAAGGAAGG CGAACGCTAC TTCGCCCTGC TGAAGGTGAC CGAGATCGGC
TTCGAGCCGC CGGAAAACGC CAAGAATCTC GTCCTGTTCG ACAACCTGAC GCCCATCTAC
CCCGACCGCC AGTTCATCAT GGAGAACGGG GACAAGAACT ACTCCAGCCG CGTCATAGAC
ATGATGGCCC CCGTGGGCCG CGGCCAGCGC GGCCTGATCG TGGCGCCCCC CCGCACCGGC
AAGACCATCC TGCTCCAGAC CATCGCCAAC TCCATCAACG CCAACCATCC GGATGCGTAC
CTCATCGTGC TGCTCATCGA CGAGCGGCCC GAGGAAGTGA CCGACATGGA GCGCACGGTG
AAGAACGCCG AAGTGGTCAG CTCCACCTTC GACGAGCCGC CGCAGCGCCA CGTGCAGGTC
TGCGAAATGG TGCTGGAAAA GGCCAAGCGC CTGGTGGAAC GCAAGCGCGA CGTGGTCATC
CTGCTCGACT CCATCACCCG CCTGGGCCGT GCGTACAACG CCGTCACCCC GTCCTCGGGC
CGCGTGCTGT CCGGCGGTCT CGACGCCAAC GCCCTGCAAC GCCCCAAGCG CTTCTTCGGC
GCGGCGCGCA ACATCGAGGA AGGCGGCAGC CTGACCATCA TCGCCACCGC CCTCATCGAC
ACCGGCTCGC GCATGGACGA AGTGATCTTC GAAGAGTTCA AGGGCACCGG CAACATGGAA
ATCTACCTGG AACGCCACCT TGCCGAAAAG CGCGTGTTCC CGGCTATCGA CATCAACCGC
ACCGGCACCC GCAAGGAAGA CCTGCTACTG TCGGACGAGG TGCTCAACCG CGTGTGGATC
CTGCGCAAGA TTCTGGCGCC CATGTCGCCC ATCGACAGCA TGGAATTCCT GCTGGACAAG
ATGCGCGCCA CCAAGAGCAA CCGCGAATTC CTGAACGTGA TGAACAAGTA A
 
Protein sequence
MNLSELKIKS MSELMELAEQ YNVEGASGMR KQELIFALLQ ACASQNGAIY GDGVLEILPD 
GFGFLRSPLC SYMPGPDDIY VSPSQIRRFN LRKGDVVSGQ IRPPKEGERY FALLKVTEIG
FEPPENAKNL VLFDNLTPIY PDRQFIMENG DKNYSSRVID MMAPVGRGQR GLIVAPPRTG
KTILLQTIAN SINANHPDAY LIVLLIDERP EEVTDMERTV KNAEVVSSTF DEPPQRHVQV
CEMVLEKAKR LVERKRDVVI LLDSITRLGR AYNAVTPSSG RVLSGGLDAN ALQRPKRFFG
AARNIEEGGS LTIIATALID TGSRMDEVIF EEFKGTGNME IYLERHLAEK RVFPAIDINR
TGTRKEDLLL SDEVLNRVWI LRKILAPMSP IDSMEFLLDK MRATKSNREF LNVMNK