Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dvul_0845 |
Symbol | |
ID | 4663985 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Desulfovibrio vulgaris DP4 |
Kingdom | Bacteria |
Replicon accession | NC_008751 |
Strand | - |
Start bp | 1036562 |
End bp | 1038478 |
Gene Length | 1917 bp |
Protein Length | 638 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 639819067 |
Product | extracellular solute-binding protein |
Protein accession | YP_966293 |
Protein GI | 120601893 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG4166] ABC-type oligopeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.0910462 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCCGCCC TGCCGCGCCT GCACCCTCGC GCCTACGTCT TCAGCGGCCT CACCGCCCTC ATGCTCTGTC TCGCCGCCAT CGCCCCCGCC ATGGCGGGTG TCCGCACACA CGCCCTCACC CTTGCCGGGG CCGTACCCGC ACTCTCAACG GGCTTCGCGC ACATGCCGCA TGTGAACCCC GAGGCACCCA AAGGGGGCAC ATTGCGTCTT GGCGTCATCG GCGGCTTCGA CAGCCTGCAC CCCTTCATCA TGCGTGGCAT CCCCGCCGCA CAACTGGGAC TGACCTACGA GACGCTGGGC GAGACCGTAC CCGGCGAAGA TGGCGTCATC TACGGTCTGC TGGCCGAGAG TTTCGAAGAG TCCGCAGACG GCGGGCATCT GGTTTGCCAT CTCGCACCCG CGGCACGCTT CGCCGACGGG CACCCCGTCA CCGCCGCCGA CGTGGTCTTC TCGTTCGATA TGCTCACCCG CCACGGGTCG CCCTTCTATC GCGACTACTA CGCCGGGGTC AGCACCGTGA CCGCGGTGGA TGCCCGCACC GTGCGCTTCG ACTTCACATC GCGTCACAAC GCCGAACTTC CCTACATCGT GGCCCAGTTG CCTGTGCTGC CCCGTCACTG GTGGCAAGGG CGCGACTTCA CCGCACCGCA AGCCGAAGCC GCACCGGGCA GCGGCCCGTA CCGCGTACGC GAGGCACGCC CCGGTTCAGG CATCACCTAC GAACGCGTTC CGGGATGGTG GGGGGCGAAG CTTCCCATCA ACAGGGGCCG CTACAATTTC GACGTCATCC GCTGCGACTA CTACCGCGAC ACCACCGTGG CACGGCAGGC GTTCCTCGCC GGAGAGTTCG ACCTGTGGAA CGAGAACACC ATCAAGGACT GGTTCGCCTC GTACGACGTG CCCTCCGTCC GCGAAGGGCG CATCACCCGC GAGGAGATTC CCCACGGCAG GCCCGAGGGC ATGGGCGGCT TCGTCTTCAA CACGCGCCGT GCCATCTTCG CTGACGTACG CGTCCGCCGT GCCCTTGCCA TGTGCTTCGA CTTCGAATGG ACGAACCGCG CCCTCTTCCA CGACGCCTAC AGACGTTACG ACAGCTTCTT CTCCAATTCT CCGTTCGCAG CCACCGGCAT CGCCACCGAA GGCGAACGCG CCCTGCTGCG CGAAGTGGCC CCCGAAAGGG CCGCCATGCT CTCCGGCCCC CCCCCTCTGC CCGCCACGCA CGACGGCAGC GGCGACATCC GCCCCGTCCT GCGCGACGCC CTGCGCCTGA TGCACGACGC AGGCTGGAAC CTGCGCGACG GACGCCTTGT CGATGGCACG GGCAGGCCCT TCAGGTTCAC GCTGCTGCTC TCGTCCAAAG GTCTGGAACG CGTGGTGCTG CCCTTCAGAC GCAACCTCTC GCGCCTCGGC GTGGAGATGG ATGTGCAGGT GGTCGACCAG ACGCAGTACG TCTCACGGGT GCGTGCCTTC GACTACGACA TGGTGCATGC CACCATGCGC CAGTCGTCGA ATCCCGGTAA CGAACAGCGC GCCTTCTGGA CGACGGCGGC AGCGGACGCG CCGGGGTCGC GCAACTACGC GGGCGTGCGC GACACCGCCA TCGACGCACT GGTCGAACGC ATCATCGCCG CACGCGACGC CGCCACCCTG CGCGACGCCG TCCACGCCCT CGACCGGGTG CTGCTGCACG GGTACTACGT CGTCCCCGGC TGGTACAGCG ACAGGCAACG CATGGCCTAC TGGAAGACCC GCGTGGCGCA CCCCGCCTTC ACCCCGCGTG GGGGCATCGA CCTGCACTCG TGGTGGGCCG TATCCGGCGA TGACACGCAG GCCAAGGGTG TTCAGGCCAA GCCAGCACAG GGCGACGGTA CTCAGGCTGA TGCGGCCCGC AGCACACCCG CAACGGAGAC ACGCTGA
|
Protein sequence | MSALPRLHPR AYVFSGLTAL MLCLAAIAPA MAGVRTHALT LAGAVPALST GFAHMPHVNP EAPKGGTLRL GVIGGFDSLH PFIMRGIPAA QLGLTYETLG ETVPGEDGVI YGLLAESFEE SADGGHLVCH LAPAARFADG HPVTAADVVF SFDMLTRHGS PFYRDYYAGV STVTAVDART VRFDFTSRHN AELPYIVAQL PVLPRHWWQG RDFTAPQAEA APGSGPYRVR EARPGSGITY ERVPGWWGAK LPINRGRYNF DVIRCDYYRD TTVARQAFLA GEFDLWNENT IKDWFASYDV PSVREGRITR EEIPHGRPEG MGGFVFNTRR AIFADVRVRR ALAMCFDFEW TNRALFHDAY RRYDSFFSNS PFAATGIATE GERALLREVA PERAAMLSGP PPLPATHDGS GDIRPVLRDA LRLMHDAGWN LRDGRLVDGT GRPFRFTLLL SSKGLERVVL PFRRNLSRLG VEMDVQVVDQ TQYVSRVRAF DYDMVHATMR QSSNPGNEQR AFWTTAAADA PGSRNYAGVR DTAIDALVER IIAARDAATL RDAVHALDRV LLHGYYVVPG WYSDRQRMAY WKTRVAHPAF TPRGGIDLHS WWAVSGDDTQ AKGVQAKPAQ GDGTQADAAR STPATETR
|
| |