Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dvul_0516 |
Symbol | |
ID | 4663180 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Desulfovibrio vulgaris DP4 |
Kingdom | Bacteria |
Replicon accession | NC_008751 |
Strand | - |
Start bp | 657988 |
End bp | 658998 |
Gene Length | 1011 bp |
Protein Length | 336 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 639818726 |
Product | ApbE family lipoprotein |
Protein accession | YP_965966 |
Protein GI | 120601566 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG1477] Membrane-associated lipoprotein involved in thiamine biosynthesis |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 4 |
Fosmid unclonability p-value | 0.0154508 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGACCT TCAGCAGAAG AACCTTTCTC GGCATGGCCG GACTCACCGG CGCCGGTCTC TGTCTCGGGC TGCCCGTGGG TGCCCGGGCG GCGCAGGCAC TGCCCGTGAC CGAAACCCGT GTGCAGATGG GAACCTACGT CGGCATCACG GTCGCCGGAG TCTCCGCCAT GCAGGCGGAA GAGGCCCTCG GGCGGGCCTT CGCGGAGGTG GCACGCCTCG AGGCGGTGTT CAGCCGCTTC GACGGGGCGA GCCCCGTCAG TGAACTGAAC CGCGCCGGGC GTCTGCCTGA CGCACCCGCC GAACTCGTCA CCCTCGTCGA CCGCGCCCGT CGCTACGGCA GCCTCACCGA CGGTGCCTTC GACATCACCG TCCAGCCCGT GGTCGACCTC TTCCGGCGCA ACGGCAACCC GCGTGGAACC ATGCATGTCG ACGAGGCCGA CCTCAAGGCC GCCCGTGAAC TGGTGGGGCT TGCCCACCTG CAGTCCGGCA GCGGACGACT CGGCTTCGAC CGGTCGGGCA TGGGCATCAC CCTCGACGGC ATCGCCAAGG GGCACATCGC GGACATGGCG TCCGCAGTCC TCACCGCGCA CGGCGTCACC GACCACATCG TCAATGCGGG CGGCGACATC ATGGTGCGGG GCATGAAGGC TCCCGACACG GCATGGCGCG TGGCAGTGGC CTCGCCCAAC GGAGGGGCCT CCTATCCCGA GACCGTGCGG CTGACCGAAT GCGCCATCGC CACTTCGGGC ACATCCGAGG TGTACTTCGA CGCACGGCAC CAGCATCACC ATCTCATCAC CCCTGTGGCA GGGCGCAGCC CCGCCAGTAC GGGCAGTGTG TCCGTCATCG CCCCCACGGT GATGGAGGCG GACGCCCTCG CCACCGCGCT CTCTGTCATT CCCCCGCAGG ACGCGCTGCG CCTTGTGGCA TCGCTGCCCG GTCGCGCCTG CTGCATCTTC ACACGCGACG GCCGGCGCTT CACCTCGTCC AACTGGGCGA CCTTCGCCTG A
|
Protein sequence | MKTFSRRTFL GMAGLTGAGL CLGLPVGARA AQALPVTETR VQMGTYVGIT VAGVSAMQAE EALGRAFAEV ARLEAVFSRF DGASPVSELN RAGRLPDAPA ELVTLVDRAR RYGSLTDGAF DITVQPVVDL FRRNGNPRGT MHVDEADLKA ARELVGLAHL QSGSGRLGFD RSGMGITLDG IAKGHIADMA SAVLTAHGVT DHIVNAGGDI MVRGMKAPDT AWRVAVASPN GGASYPETVR LTECAIATSG TSEVYFDARH QHHHLITPVA GRSPASTGSV SVIAPTVMEA DALATALSVI PPQDALRLVA SLPGRACCIF TRDGRRFTSS NWATFA
|
| |