Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dvul_1145 |
Symbol | |
ID | 4662483 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Desulfovibrio vulgaris DP4 |
Kingdom | Bacteria |
Replicon accession | NC_008751 |
Strand | + |
Start bp | 1394304 |
End bp | 1396001 |
Gene Length | 1698 bp |
Protein Length | 565 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 639819374 |
Product | extracellular solute-binding protein |
Protein accession | YP_966592 |
Protein GI | 120602192 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.00838015 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.546746 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCGGTTT TGCGTTGTCT GACCATCGTA TCCCTTGTGG CTGTGTGCCT TTGTGCGGGA TGTGGCGATG ATACTGCCCC CACGGTATCC AAAGAGGCAC CGCAGGTGCC TTCCGGCAGT ACCGCCGGCG AGGGCTTCCC GCCAGCAGGG CGACCCACCG CACAGGAAAT GGCGGAAGAT GGCGGACGCA TCATCATGGG GTCCATCGGG GAACCGTCGA ACCTCATCTC GTTCCTTTCC AGCGATGCGT CCTCGCACGA GGTCGCAAGC CAGCTTTACG TGGGGCTCTT GCGGTATAAC CGTGACCTCG AGATCGAGCC TTGGGCGGCA GAGTCCTACG AGGTGCTCGA CGGGGGCAGG CTGCTGCGGT TCAAGCTGCG CAAGGGTATC CGGTGGCAGG ACGGCGTGGA ACTCACGGCG GATGACGTCG AGTTCACCTA CAAGCTTATC ATCGCGCCCA CGACACCGAC ACCGTACGCG GGCGACTTCC TTGCCGTGCA GGAGTTCCGC AAGACGGGCC GCTACAGCTT CGAGGTGCGC TACGACAAGC CCTTTGCCCG GTCGCTCATA AGCTGGATGC AGGACATCAT GCCGAAGCAC CTGCTGGAAG GACAGGACGT GCGGACGACC CCCTTCGCGC GCAAGCCGGT GGGGGCGGGG CCGTACATGC TCGAATCGTG GGAACCCGGA ACCCGTCTGG TGCTGCGTGC CAACCCAGAC TACTTCGAGG GCAGACCGCA TATCGACGAG GTCGTGTACC GCATCATTCC CGACAACGCG ACCATGTTCC TCGAACTCAA GGCGGGCAAG CTCGACATGA TGGGGCTCTC GCCGCAGCAG TACCTGCGGC AGACCGATGG CTACACATGG GAGCGTGACT GGCGCAAATA CCGCTATCTT TCGTTCGGGT ACACCTATCT CGGCTACAAC CTGAAGCATC CGTTCTTCGC CGATGCGCGT GTCCGCCGTG CCATCGCCCA TGCCATCGAC CGCGAGGGCA TCATCAAGGG CGTGCTTCTG GGACAGGGGG TGCCCACGGT CGGTCCCTAC AAGCCCGGCA CATGGGTATA CAACGACAGG TTAACGGCGT ACTCCTATGA TCCTGCCCTC GCAGCCGAGA TGCTGCGCGA AGCGGGGTGG CAGGACACGG ACGGGGATGG CATCCTCGAC CGTGAGGGAA GGCCCTTCGC CTTCACCATC CTGACCAACC AGGGCAACGA CCAGCGTATC AAGACGGCAA CCATCATCCA GAGCCAGCTC AAGGATGTGG GAATACGTGT ACAGATACGG ACGGTGGAGT GGGCCGCGTT CATCAAGGAG TTCGTGAACA CGGGCAGGTT CGATGCCGTC ATTCTCGGCT GGAACATCAC GCAGGACCCG GATGCCTATG ACGTGTGGCA TTCCTCCAAG GCTGAACCCG GCGGGCTCAA TTTCGTGGGC TATCGCAATG CAGAGGTGGA CGCCGTACTA GAAAAGGCCC GTCGCACTTT CGACCAGGAC GAGCGCAAGC AGTACTACGA CCGTTTCCAG GAGATCGTCC ACCGCGACCA GCCCTACTGC TTTCTGTACG TGCCGTATGC CCTGCCGGTG GTGGCAGCGC GTTTTCGAGG CATCGACCCC GCCCCGGCGG GCCTCATGCA CAACTTCAAC CGCTGGTGGG TTCCCGTCGA CCAGCAGCGC TCAACGGTGC AGCAGTAA
|
Protein sequence | MSVLRCLTIV SLVAVCLCAG CGDDTAPTVS KEAPQVPSGS TAGEGFPPAG RPTAQEMAED GGRIIMGSIG EPSNLISFLS SDASSHEVAS QLYVGLLRYN RDLEIEPWAA ESYEVLDGGR LLRFKLRKGI RWQDGVELTA DDVEFTYKLI IAPTTPTPYA GDFLAVQEFR KTGRYSFEVR YDKPFARSLI SWMQDIMPKH LLEGQDVRTT PFARKPVGAG PYMLESWEPG TRLVLRANPD YFEGRPHIDE VVYRIIPDNA TMFLELKAGK LDMMGLSPQQ YLRQTDGYTW ERDWRKYRYL SFGYTYLGYN LKHPFFADAR VRRAIAHAID REGIIKGVLL GQGVPTVGPY KPGTWVYNDR LTAYSYDPAL AAEMLREAGW QDTDGDGILD REGRPFAFTI LTNQGNDQRI KTATIIQSQL KDVGIRVQIR TVEWAAFIKE FVNTGRFDAV ILGWNITQDP DAYDVWHSSK AEPGGLNFVG YRNAEVDAVL EKARRTFDQD ERKQYYDRFQ EIVHRDQPYC FLYVPYALPV VAARFRGIDP APAGLMHNFN RWWVPVDQQR STVQQ
|
| |