Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | DvMF_0870 |
Symbol | |
ID | 7172759 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Desulfovibrio vulgaris str. 'Miyazaki F' |
Kingdom | Bacteria |
Replicon accession | NC_011769 |
Strand | + |
Start bp | 1047322 |
End bp | 1049592 |
Gene Length | 2271 bp |
Protein Length | 756 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 643539371 |
Product | RNA binding S1 domain protein |
Protein accession | YP_002435294 |
Protein GI | 218885973 |
COG category | [K] Transcription |
COG ID | [COG2183] Transcriptional accessory protein |
TIGRFAM ID | [TIGR00426] competence protein ComEA helix-hairpin-helix repeat region |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 72 |
Fosmid unclonability p-value | 0.611113 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGACGCCAG AACTTCAGGC CACCGCTTCC GCCCCGGCAG CCCCAACCGG TCCGACCGCT CCCGCGGTTG CGCCAGCCGC ACCGATCTCC GCAGCTCCGG CGGCCTTGGC CGCCGCGCTG TCCCTGCCGG TATCCGGCGT CTCCGCCGTC ATGGCCCTGC TGGACGAAGG GGCCACCATT CCCTTCATCG CCCGCTACCG CAAGGAAGCC ACTGGCAGCC TGGACGAGGT GCAGGTGGTC GCCGTGCGTG ATGCCCTGGA AAAGGCGCGC GAACTGGACA AGCGACGGGC GGCGGTGCTG GCATCGCTGG AGGAACGCGG CCTGCTCACC CCGCAACTGG CGGGGGCAGT GGCCGGTGCA TCCACCCTCA CCGCGCTGGA GGACGTGTAC CTCCCCTACC GGCCAAAGCG CGTCACCCGC GCGGAAAAGG CCCGGCGGCG CGGGCTGGCC CCGCTGGCGG AAGCACTGCG CACCGCGCGG CCCACGGCGG ACGCCCTTGC GCTTGCCGCG CCCCATGTAA CGTCCCATGC CACCCCGGAA GGCGCCGACC CGGAACTGGC CGTGCCCGAT GTGCAGGCCG CCCTGGCCGG TGCCCGGGAC ATCCTGGCCG AAAACTGCGC GGAATCCGCC CCCCTGCGCG GAGCCCTGCG CGACCTGTTC GTGCGGCGCG CCGTGCTGCG CGCCAGGGCC GTACCCGGCA AGGAGGCGGA AGGCGCCACC TACGCCGACT GGTTCGGCCA CGAGGAACGC GCCGCCGCCA TGCCCTCGCA CCGCCTGCTG GCCATGCTGC GCGGCGAACG CGAAGGCTTT CTTTCCGTGT CCGTGCGCCC CGACGACGCC GAGGCCCTGG ACACCCTCCA CCGCCGGGCG GGCATCGCCG CCCCCGGCAC CGCCCCACGG CCCGAAACCG CCGCAGGGCA GGTGGAGGCG GCCCTTGCCG ATGCCTGGAA GCGGCTGCTG GCCCCATCGC TGGAAAACGA ACTGCGCACC GCCCTGCGCG AGCGGGCAGA GGCGCAGGCC ATCGGGGTGT TCGCCGCCAA CCTGCGCGAA CTGATCATGG CCCCGCCCCT GGGCGGGCGC CGCGTGCTGG CGCTGGACCC CGGCTGGCGC ACCGGGTCCA AGCTGGTCTG CCTGGACGCT CAGGGCACCC TGCTGCACCA CGAGGTCATT CATCCGCTTA CGGGCGATGC CGGTGCGGAA CGCGCCGCGC GCACCCTGCG CGACTGCTGC GCCAGGTACG CCATGGAGGT GGTGGCCGTG GGCAACGGCA CCGCCGGGCG CGAGACGGAA GCCTTCGTGC GTAACGCGGG CCTGCCCGCC GGAGTGGACG TAGTACTGGT GAACGAAACC GGGGCCTCGG TGTATTCCGC GTCCGAGGTG GCCCGCCGCG AATTTCCGGA CCACGACCTG ACCGTGCGCG GGGCCGTTTC CATCGGGCGG CGGCTGATGG ACCCGCTGGC GGAATTGGTG AAGATCGACC CGCGCTCGCT GGGCGTTGGC CAGTACCAGC ACGACGTGGA CCAGGCCGCC CTGCGCCGCT CTCTGGACGA GGTGGTGGCC TCGTGCGTCA ACGCAGTCGG CGTGGACGCC AACACCGCCA GCCCGGAACT GCTGGCCCAT GTTTCCGGCA TCGGCCCGGT GCTGGCCCGC AACATTGTGG CCCACCGGGC GGAAAACGGC CCCTTCCGCA ACCGCAGGGA TCTGCTGAAG GTGCCGCGCC TCGGCCCCAA GGCATTCGAA CAGGCGGCAG GCTTTCTGCG CGTGCGCGGC GACAACCCGC TGGACGCCAG CGCGGTGCAC CCGGAACGTT ATGCCCTTGT GGCCCGCATG GCGGCGGACC TCGGCTGCGC CCTGGCCGAC CTGCTGCGCC GCGACGACCT GCGCAGGCGC ATCCGGCCCG AACAGTACGT TGCCGACGGC GTGGGCCTGC CCACGCTCAC CGACATCCTG GCCGAACTGG CACGGCCGGG GCGCGACCCG CGCCCGTCGT TCGCCCCGTT CCGCTTCGCC GAAGGGGTGC ATTCTCCCGA CGACCTGGAG CCGGGCATGG TGCTGCCGGG CATCGTCACC AACGTCACCG CCTTCGGGGC ATTCGTGGAC ATAGGGGTGC ACCGCGACGG GCTGGTGCAT GTCAGCCAGC TGTCCGACCG CTTCGTGCGC GACCCGGCGG AAGTGGTGGC CCCGGGGCGC ACCGTGCGGG TGCGGGTGCT GGAGGTGGAC CGGGCGCGTG GCAGGGTGAG CCTGACCATG AAGGGCGTGG ACCAGCAGTA G
|
Protein sequence | MTPELQATAS APAAPTGPTA PAVAPAAPIS AAPAALAAAL SLPVSGVSAV MALLDEGATI PFIARYRKEA TGSLDEVQVV AVRDALEKAR ELDKRRAAVL ASLEERGLLT PQLAGAVAGA STLTALEDVY LPYRPKRVTR AEKARRRGLA PLAEALRTAR PTADALALAA PHVTSHATPE GADPELAVPD VQAALAGARD ILAENCAESA PLRGALRDLF VRRAVLRARA VPGKEAEGAT YADWFGHEER AAAMPSHRLL AMLRGEREGF LSVSVRPDDA EALDTLHRRA GIAAPGTAPR PETAAGQVEA ALADAWKRLL APSLENELRT ALRERAEAQA IGVFAANLRE LIMAPPLGGR RVLALDPGWR TGSKLVCLDA QGTLLHHEVI HPLTGDAGAE RAARTLRDCC ARYAMEVVAV GNGTAGRETE AFVRNAGLPA GVDVVLVNET GASVYSASEV ARREFPDHDL TVRGAVSIGR RLMDPLAELV KIDPRSLGVG QYQHDVDQAA LRRSLDEVVA SCVNAVGVDA NTASPELLAH VSGIGPVLAR NIVAHRAENG PFRNRRDLLK VPRLGPKAFE QAAGFLRVRG DNPLDASAVH PERYALVARM AADLGCALAD LLRRDDLRRR IRPEQYVADG VGLPTLTDIL AELARPGRDP RPSFAPFRFA EGVHSPDDLE PGMVLPGIVT NVTAFGAFVD IGVHRDGLVH VSQLSDRFVR DPAEVVAPGR TVRVRVLEVD RARGRVSLTM KGVDQQ
|
| |