Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dvul_2026 |
Symbol | |
ID | 4662513 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Desulfovibrio vulgaris DP4 |
Kingdom | Bacteria |
Replicon accession | NC_008751 |
Strand | + |
Start bp | 2357913 |
End bp | 2361977 |
Gene Length | 4065 bp |
Protein Length | 1354 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 639820269 |
Product | hypothetical protein |
Protein accession | YP_967469 |
Protein GI | 120603069 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.368957 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCGCGA AACAGAATGA CGCCACGAAA CATCCACACA TACGGGAACT GCTCAGGCGC ATCACCCCTG CCACACGGCG TGGACGCGTC ATTCTCTGGT CGTTGCTGGG GTTGTACATC TTCTGGCTTG TCATCGGCGG TCTCGTACTT CCCCCCGTTG TCCGCTCTGA ACTCGAACGC ACCATGGCAC AACACCTTCG GGCCACCTGC ACCGTAGAGA AGGTGACCAT CAACCCCTTC ACCCTGCGTA TCCGTGTCCT CGGCGTGAAG GTCCCCGATG CCAGCGGCGA AGGGGTGCTC TTCGGCTTTC GTGAACTGAG CATCGCCCCA AGTCCGGCAG CCCTGTTCCG GCTGGCACCT TCGCTCGCTT CGGCGCGCCT CGTAGAACCC GTTGTGGACA TCACCTATTT CGGCGAAGGC CGCTTCTCGT TCTCGGACAT CGTGCCCCCG TCCGAGGCGA CGACAGACGA CAAGGCTACT CCGGTATTCC CCTTCGTCAT CAGCGATTTC GAACTCGTGG ACGGCAGCTT CATCTTCCGT GACGAACCGC GCGGCGTGAC GCACACCATC GCGGACATCG ACTTCATCGT GCCTTTCACC TCCTCGCTGG ACATGTTCCG CGACACCCCC ATCACCCCAT CCCTCAATGC CACAGTCGAC GGCAGCCGCA TGACCGTGGC TGGCAGACTG CTTCCCTTCG CCGAGACACA ACGAACGGAA TTCGACATCG CCACCGAGGA TGTGGCCCTC GAACAATTCA AGGCCTATCT TGCGCCCTTC ACTCCACTGC GGCTTGAGCA GGGCAAGGCT CGTCTCGAGC TCGACCTGCT GGTTGAAAGG CTTCCCTCCG GGCAGGTCGA ACTGGGGCTT GGAGGAGCAC TGCGCCTTTC CGACATCCTT CTCAATACGC CCGACGGGAA AAAGGCGGCA GCACTACGTG AAGCCGAACT GCGTCTGCAC AAGTTCACAC TGGCGGAACG CCGCGTGGAA CTCGAGTCCG CAACTGTGGA CGGCCTTTAT GTCAAGGCAG TGCGCGACAC CGACGGCACC GTCGACTGGC AACGCTGGAT AAGCCCGGCC TCTGGCAAGG CAGCCCCTGT CACGCCTGCG ACATCGCGCA CGGCCATGCA GAACGCGACC GGGGCAGCCA TGACGCAGAA TGCGACCGGG GCAGCCTCCA TGCCTGCCTC CGCCACCGCC ACGGACAAGG CTTCGGCAAA GAACCCTGCG GCGGGAACGC CCCCCGTAGC GGCCACCTCC GGCAGTAGTC CCGAGGGACA TTCCACCGGC AAATCCGCAG CCCCCACAGC CGACAGCAAG GCCTTCATCG TCGAAGGGGC TGCCTTGCAT CTGAGCGATG CGACGCTTGT CTGGCACGAC GCATCACTCT CCGGCACACG CGAGATAGCG GTCACCGGGC TCGACGTGCA GATTCCCCGA TTCTCCACGG GTGACAACAA GACCATGCCG TTTGCCCTCA CTTTCGGATT GAATGGTCAG GGGCGTTTCC ATGTAGATGG CGAGGCGACG CTCTCTCCGT TGAAGGTGAG CGCCGCCATC GACAGCACAG GACTGCCGCT TGCCGCCGCG CGGCCCTTCG CCGGGGGCAC ACCTGCCTCC GACATCGCAG GGAGCTTCGG GGGTAGAGCC AAGGTGGTCT TTCAATCCTC TCCCGCGCTG CAACTGACCG TATCCGAAGG GGCGCTCATG GTGGACGACC TCGCCCTCGC CGCACAGGGA AAACAGGGCC ATGCCCTTGG CGTCAAACAC ATCGGGCTCA AGGGACTCGC CGTCGATTAC GGCAAACAGT CCATCCGGGC AGCGGTGCTC GCGCTGACCT CCCCCTCGGT CAATCTCATC CTCGGTGACG ACGGGCTGCC GCTGCTTCCG GCATCCACCG GAGACTCACA GCCCGACACG GGCAAGAAGG TCAAAGGCGA CAGGCAACGA CGCGCACAAG GCAAGGCGGG CTCTTCCACA AAGGTGGAGT CCAAAGCCCG GGGCACGCAA GAGAAGGCCA GACGCGGCGA CACCAAGACC GCAGACAGGG ACTGGAACCT TGTGCTGGAC AGCCTCGAAC TGGATGGAGG TACCGTCAAC ATCACCGAAC GCGGCGCTAA GGCACCGACC CTGCAGGTGT CCGACCTTCG CGTGCGCACG GGAGCGCTCT CGCCAGACCT GACCCAGCGG CTGCCCTTCG ACGCATCCAT GCGCTGGCAG AAGGACGGGC AACTCGCCCT CAAGGGGAAT GTGCGCATCC GGCCTCTCGA CCTCGACCTG AACGTCAAGG CCACAAAGGT CGACCTTGCG CCGCTCGACA TCCCCCTTGC CATGTCCACG GCCATGCAGG CCGGGGGGCG CCTTTCCGGC GATGTCCGGC TGGGCGCACG GGAACGCGGC GATGACATCC AGATGACGGC ATCGGGCAGG ACGCAACTGG ATGACGCCCG TCTGCGCAGG CGCGGAGACC GGCGTGACCT CATCTCGCTT CGGAGGCTGG CGGTCAGGGA CTTCCGGTAC GGTTCGTCCC CGCTGCGCGT CGAGATTGGC GACATACTCC TCGACCGGCC GCAGGTGTTC CTCGTGCTGC ACAAGGACGG CACCACCAAC GTACTGCGCG CCCTCGACCC GGAAGGCGCG GAACGCAGGG CTGCGGCCAT CCGTACGGCA GAAAAGGCCA AGGCCGCAGA AGGAGCGAAG AAGCAGGGAA CGCAGACCGG GGCCGAGGCC TCCTCGGGTC TTGCGTCGAA GCCGGTGTCC CCCGCCCCTG TGGCTGCCGG AGAGACGACT GCCGAGGCAG ACGCGTCGGG GGCAGACGCT TCGGGGGCAG GAGCGGACGC ATCCTCCGCT GCGGGAGCCC GGGCAGAAGC ACCAGCATCG CTGTTCGACA GGTTCACGCT GGGCGAGGTG ACCGTCCGCG GTGGCAAGAT AGCCTTCCGC GACGAGCGTT TCTCTCCGGC CTTCGACACC TCGCTGGACA AGGTCGATGC AGCCGTGACC GGATTCACCA TGGCCCCGGA AAGCCGCGCC GAGGTCTCGG CTGGCGGAAC GCTCGAGGGT GTGCCCGTCA AGCTCACGGG GACGCTCAAC CCCGTATCGA CGCCCCCCTT CGCCGACATC GTCTTCTCCA TGGAAGGGCT GGACCTCGTT CCGGTATCAC CATACGCGCT TCAGTATATC GCCTACCCGG TCGACAAGGG ACGCCTCACA GCCCGTTTGC AATTGCAGAC ATCCGAATGG GTACTGAGCG CCGATTCGAA GTTCCTGCTG GAAGACATCG AACTTGGCGA CAAGGACTCG CGTCCCGATG CCCCGGACTA TCCGGTCAAA CTCGGACTGG CCCTCTTGCG GGGGCTTGAT GGCAACGTGT CCATCGACCT GCCGGTACGG GGGCGACTCG ACGACCCCAA CTTCAGGCTT GGAGGCGTGG TGGTCCAGGC CGTCATGAAC CTCATGGTCA AGGTGGTCAC ATCGCCCTTC GCCCTCGTCG GCAGCGTGGT GCGCCTTGCC GGAGGCGGCG GGCAGGACAT GCGCAACGTC CCCTTCGAAC CCGGGCGGGA AACGCTCTCC GAAAGGGCCG AGGCACAACT GGCAAGCGTG GCGGAGGTAC TGCGGCAAAG GCCCGGACTT TCACTGGAGG TGCGCGGCAT GGTCGACCCT GCGACCGATG GGCAGGGGCT TCGCGAAGTC GCCCTTCTGC GCCGGATGCA GGAGGCGAAG TATGCTTCGC TGTGGCGCGG TGAGCGCGCC AAGACGACGG TCGAGGCCAT CACCATCGAA GACGACGAGT ACGACGACCT TCTCGAGTCC GTCTACAAGG ATGCCCCCTT CGACAAACCC CGCAATGTTC TGGGACTCGT AAAAGACCAG CCTCGCGAGG TCATGGAAAA GGCCTTCTAT GAGCACGAGG ACGTCACGGA CGACGACCTG ACAGCCCTCG CACAGCAGCG GGCACGTGCC GTGCGCGACA GATTGCTTGA AATCGACCCG GCACTCGGCG CACGGCTTTC ACTCGCCGCT GCCACAGGCA AGGGGAAGAG CGCGGCAGAG ATGCTCTTGC GCTAG
|
Protein sequence | MTAKQNDATK HPHIRELLRR ITPATRRGRV ILWSLLGLYI FWLVIGGLVL PPVVRSELER TMAQHLRATC TVEKVTINPF TLRIRVLGVK VPDASGEGVL FGFRELSIAP SPAALFRLAP SLASARLVEP VVDITYFGEG RFSFSDIVPP SEATTDDKAT PVFPFVISDF ELVDGSFIFR DEPRGVTHTI ADIDFIVPFT SSLDMFRDTP ITPSLNATVD GSRMTVAGRL LPFAETQRTE FDIATEDVAL EQFKAYLAPF TPLRLEQGKA RLELDLLVER LPSGQVELGL GGALRLSDIL LNTPDGKKAA ALREAELRLH KFTLAERRVE LESATVDGLY VKAVRDTDGT VDWQRWISPA SGKAAPVTPA TSRTAMQNAT GAAMTQNATG AASMPASATA TDKASAKNPA AGTPPVAATS GSSPEGHSTG KSAAPTADSK AFIVEGAALH LSDATLVWHD ASLSGTREIA VTGLDVQIPR FSTGDNKTMP FALTFGLNGQ GRFHVDGEAT LSPLKVSAAI DSTGLPLAAA RPFAGGTPAS DIAGSFGGRA KVVFQSSPAL QLTVSEGALM VDDLALAAQG KQGHALGVKH IGLKGLAVDY GKQSIRAAVL ALTSPSVNLI LGDDGLPLLP ASTGDSQPDT GKKVKGDRQR RAQGKAGSST KVESKARGTQ EKARRGDTKT ADRDWNLVLD SLELDGGTVN ITERGAKAPT LQVSDLRVRT GALSPDLTQR LPFDASMRWQ KDGQLALKGN VRIRPLDLDL NVKATKVDLA PLDIPLAMST AMQAGGRLSG DVRLGARERG DDIQMTASGR TQLDDARLRR RGDRRDLISL RRLAVRDFRY GSSPLRVEIG DILLDRPQVF LVLHKDGTTN VLRALDPEGA ERRAAAIRTA EKAKAAEGAK KQGTQTGAEA SSGLASKPVS PAPVAAGETT AEADASGADA SGAGADASSA AGARAEAPAS LFDRFTLGEV TVRGGKIAFR DERFSPAFDT SLDKVDAAVT GFTMAPESRA EVSAGGTLEG VPVKLTGTLN PVSTPPFADI VFSMEGLDLV PVSPYALQYI AYPVDKGRLT ARLQLQTSEW VLSADSKFLL EDIELGDKDS RPDAPDYPVK LGLALLRGLD GNVSIDLPVR GRLDDPNFRL GGVVVQAVMN LMVKVVTSPF ALVGSVVRLA GGGGQDMRNV PFEPGRETLS ERAEAQLASV AEVLRQRPGL SLEVRGMVDP ATDGQGLREV ALLRRMQEAK YASLWRGERA KTTVEAITIE DDEYDDLLES VYKDAPFDKP RNVLGLVKDQ PREVMEKAFY EHEDVTDDDL TALAQQRARA VRDRLLEIDP ALGARLSLAA ATGKGKSAAE MLLR
|
| |