Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | DvMF_0640 |
Symbol | |
ID | 7172527 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Desulfovibrio vulgaris str. 'Miyazaki F' |
Kingdom | Bacteria |
Replicon accession | NC_011769 |
Strand | - |
Start bp | 763610 |
End bp | 766318 |
Gene Length | 2709 bp |
Protein Length | 902 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 643539140 |
Product | protein of unknown function DUF224 cysteine-rich region domain protein |
Protein accession | YP_002435065 |
Protein GI | 218885744 |
COG category | [E] Amino acid transport and metabolism [R] General function prediction only |
COG ID | [COG0493] NADPH-dependent glutamate synthase beta chain and related oxidoreductases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 64 |
Fosmid unclonability p-value | 0.112639 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGACCAGA CCTCGCTGCG CGACTGGGAG TCGCGCTGCA TACAGGAAGA ACCGCCCCGC TGTCAGGCGG CGTGTCCGCT GCACGTGGAC GCGCGCGCCT TTCTGGGGCA TGCGGCGCAA GGCCGCTGGC GCGAGGCGCG CGGCGTGCTG GAACGCACCA TGCCCCTGCC CGGCGTACTG GCCCGGCTGT GCGAGGCCCC GTGCCAGGCG GCCTGCCTGC GCCAGGAGGC GGGCGGAACC ATTGCCGTGG GCGCGCTGGA GCGGGCCTGC GCCGCACTGT CCGTCCCGGC GGCGGACCCG CGCCCCTTGC CGGGGCGTGG ACTGCGCGCC GCCGTGCTGG GCGGCGGTCT GGCCGCGCTG ACCGTGGCCT GGGATCTGGC GAAGAAGGGC CACGCGGTGA CCTTGGCCGG GTGGCTGGAA GACGGGGATA AGCGTGGGGG CGAGAGTGTC GCCCCCGCCG GTGAGGCCGG TGAGGCTGAC AGGGCCGCCC GGATGGCCGC CACTGGCCGC CTTGCCGCCA TCCCCCCGGA CGTGCTGCCA CCGGATGCCC TGCGCAAGGA ACTGGACCGG CTGGCCGGGC TCAAGGTGCG CTTTGCCGCG CCCGTGGCCC CCACGGCGGC GGTGCTGGAA GAGATGCGCG CCGCGCACGG GGCGGTGTTC GTGGCGTGGA GCCCGGCGGC TGCACGCGCG CTGGGCCTGC CGGACCGCAG CCGGTGCGAT GCGCTGACCC TGGCCCACCC TGATCTGCCC GGCGTGTTCT GCGGCGGCTG GCCCGTGGAC GGCGCGGCAG GTGCAGCGCG CAGCATCGAC GAGGCCGCCG ATGGCCGCCA TGCCGCCACC TCAATGGACC GCCACCTTGC CGGCGTGTCA CTGGCGGCGG GCCGCGAACG GCAGGGGCCG TTCGAAACGC GATTGTTCAC CAGCCTGGAA GGCGTGGCGG CGGTGGCCCC CGTGGCGCTG CCCGCGTTGT CGCCCGCAGG GCAGGGGGCA GGCGGCCCCG CGCTCGAAGC TGCCGAGGGG CCCGACGCCT CCGACTCCCC CGACTTCCCG TATCGGGAAG AGGCGTCCCG CTGCCTGCAA TGCCAGTGTC TGGAATGCGT CAAGGTGTGC CCGTATCTGG AAAAGTACGG CGAATACCCC AAAAAGCACG CCCGGCGCAT CTACAACAAC CTGGCCATCG TCAAGGGCGT GCATCAGGCC AACCGGTTCA TCAATTCGTG CAGCCTGTGC GGGCTGTGCG GCACGGTGTG CCCCACGGGG TTCGACATGG CCCCGCTGTG CCACGAGGCG CGGCGCACAC TGGTGCATGA CGGCAAGATG CCGCCCTCCA CCCACGAATT CGCGCTGGAC GACATGGCCT TCAGCAACGG GCCGCATGCC GCGTTGCTGC GCGCCCCGGC AGGGGCACAG ACCTGCGCGT GGCTGTTCCT GCCCGGTTGC CAGCTTGCGG CGTCCGCGCC GGACAGGGTC GCGCAGGCGT GGGGCATGGT GGCCGACCGC CTGCCCGGCG GCACAGGCAT TGCCCTGCGC TGCTGCGGGG CACCCGCGCT GTGGGCCGGG CGCGACGACC TGGCCGCTGC CGCCGCCGAG GAACTGCGTG ACGGGTGGAA CGCGATGGGA CGCCCCACGC TGGTGGTGGG GTGCCCGTCC TGCGCCACCA CGTTGCGCAC TCTGCTGCCC GACCTGCCCC AGACCATGCT GTGGTCCGTG CTGGCGGAGC ACGGCGTTGG CGGGCAGACG ACGCCCGCAC CGGAGGCGCT TACCCTGCAC GACCCCTGCG CCGCGCGCGG GGACGAACCG CTGCGGACCA CGGTGCGCGG CCTGCTCGCC GCGCGCGGGG TGTCGGTGCA CGAGCCGGAC CTGACCGGAC CACACACCGA ATGCTGCGGC TACGGCGGCC TGATGGCCGA GGCCGATCCC GACCTTGCCC GCACCGTAAT CCAGCGCAGG GCCGATGCCT CGGACTTGCC CTTCGTCACC TACTGCGCCA TGTGCCGCGA CCGGCTGGCC GAGGCGGGCA GCCCCGCCAG CCACCTGCTG GACCTGCTGC TGCCCCCGTT GCCCGGCGGC AATCCGGACC CGGCGGCGCC CGGCCCGCAC ATCACCGCCC GGCAGGAGAA CCGCGCCCGG CTGCGCGACC GCCTGCTGCG CGAGGTGTAC GGCGAAACCC CGCCCGATGC CGCGCCGGAA GCGGTCCTGC GCATCGGGCC GGACATGCGC GTGGTCATGG AGCGGCGACG CATCCTTGAC GACGACCTGC GCGGCGTGAT CGCCGAGGCC GAACGCGCCG GGCGCTATTT CATCGACACC GATACCGGCA TACGTCTGGC CTGTTTGCGG CGGGTGCGCG TCACCCATTG GGTCGGGTAC GAGCCCGTGT CCGGTGGCGC TGGGAACGGC GCAGGGGGCG ATGCCGTGCC CGTCCACACC ATCCGGCAGG CCTATGCCCA CCGCATGGTG TTGCCGCAGG GACCGCCCGC CGGAGGCTGG AAGGAGTCGC CCCATGAACC GGCCTATCTG CCCGCGTCGG GCAACTGGAC CTGCGCCTGC GGCGGCGCGC CGCGCCCGCT GTCGGTGGAG CTGACATACC TTGGCAGCAC CTTCAACGTA CGGTTGCTGA CCTGCCCGGA CTGCGGGCAG GTGCTGGTGG ACGAGGCCCT GGCGCTGGGC AAGATGCTGG AAGTGGAACA GTTGCTGGAG GACAAGTGA
|
Protein sequence | MDQTSLRDWE SRCIQEEPPR CQAACPLHVD ARAFLGHAAQ GRWREARGVL ERTMPLPGVL ARLCEAPCQA ACLRQEAGGT IAVGALERAC AALSVPAADP RPLPGRGLRA AVLGGGLAAL TVAWDLAKKG HAVTLAGWLE DGDKRGGESV APAGEAGEAD RAARMAATGR LAAIPPDVLP PDALRKELDR LAGLKVRFAA PVAPTAAVLE EMRAAHGAVF VAWSPAAARA LGLPDRSRCD ALTLAHPDLP GVFCGGWPVD GAAGAARSID EAADGRHAAT SMDRHLAGVS LAAGRERQGP FETRLFTSLE GVAAVAPVAL PALSPAGQGA GGPALEAAEG PDASDSPDFP YREEASRCLQ CQCLECVKVC PYLEKYGEYP KKHARRIYNN LAIVKGVHQA NRFINSCSLC GLCGTVCPTG FDMAPLCHEA RRTLVHDGKM PPSTHEFALD DMAFSNGPHA ALLRAPAGAQ TCAWLFLPGC QLAASAPDRV AQAWGMVADR LPGGTGIALR CCGAPALWAG RDDLAAAAAE ELRDGWNAMG RPTLVVGCPS CATTLRTLLP DLPQTMLWSV LAEHGVGGQT TPAPEALTLH DPCAARGDEP LRTTVRGLLA ARGVSVHEPD LTGPHTECCG YGGLMAEADP DLARTVIQRR ADASDLPFVT YCAMCRDRLA EAGSPASHLL DLLLPPLPGG NPDPAAPGPH ITARQENRAR LRDRLLREVY GETPPDAAPE AVLRIGPDMR VVMERRRILD DDLRGVIAEA ERAGRYFIDT DTGIRLACLR RVRVTHWVGY EPVSGGAGNG AGGDAVPVHT IRQAYAHRMV LPQGPPAGGW KESPHEPAYL PASGNWTCAC GGAPRPLSVE LTYLGSTFNV RLLTCPDCGQ VLVDEALALG KMLEVEQLLE DK
|
| |