Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | DvMF_0357 |
Symbol | |
ID | 7172241 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Desulfovibrio vulgaris str. 'Miyazaki F' |
Kingdom | Bacteria |
Replicon accession | NC_011769 |
Strand | - |
Start bp | 417126 |
End bp | 418022 |
Gene Length | 897 bp |
Protein Length | 298 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 643538855 |
Product | histidine triad (HIT) protein |
Protein accession | YP_002434782 |
Protein GI | 218885461 |
COG category | [F] Nucleotide transport and metabolism [G] Carbohydrate transport and metabolism [R] General function prediction only |
COG ID | [COG0537] Diadenosine tetraphosphate (Ap4A) hydrolase and other HIT family hydrolases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 63 |
Fosmid unclonability p-value | 0.0469904 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAAACCC TCTGGGCGCC GTGGCGCCTC GACTACATCC TGGGCCCCAA GCCCGACAGC TGCGTGTTCT GCCTGCCCGA ACACACCGAA GAGGACGAGG CACGCCTGGT CCTGTACCGG GGGCGGCACA ATTTCGTGAT CATGAACAAG TTCCCCTACA ACAACGGGCA CCTCATGGTC ACGCCGTTCC GGCACGTCAT GGACCTTGCC GCGCTGGAAC CGGACGAAGC CCACGAAATG ATGGACCTGC TCCAGACCTG CACCACCATG CTGCGCCAGC GCTTCAACCC GCAGGGCATC AACGTGGGGC TGAACCTGGG CGAGGCCGCC GGGGCGGGCA TCCGCGAACA TCTGCATTTC CATCTGGTGC CGCGCTGGAA CGGCGATTCG TCGTTCATGG CGGTGATGAG CGAAACCCGG GTCATTCCCG AGCATCTGCA TTCCACGTAT CATGCCCTGA AACCCTGTTC GACCGCCTGC CCCGCGCGGG ACGGTAAGGA GGCCGACATG CGCTTCGTCA AGGTTCTCGT TCTGGTTCTG GTGTTCTTCA TCTCCATGAT GTTCTTCGTG CAGAACAACG CGGTGCTCTC GCAGACGGTC ACCCTGAAGC TGGACCTGTT CTTCGACACC GCGTGGAGCT CCATCGCCCT GCCGTTCTAT TTCATGGTTC TGTGTGCCTT CCTGATGGGC GCCCTGCTGA CCATGCTGCT GCTCATGATC AGCCGCATGC GCGCCGGTGC CGCCCTGCGC CGCGCCAACA AGCGCATCCG CGTGCTGGAA AAGGAACTGA ACTCGCTGCG CAACCTGCCG CTGGAAACCG CGCGCAAGAC CCCGGAACCT GTGGCCGCCC CCGCGCCCGT TGCCGCCACG GACCCCACCC CGGCCAAGGC CGGCTAG
|
Protein sequence | METLWAPWRL DYILGPKPDS CVFCLPEHTE EDEARLVLYR GRHNFVIMNK FPYNNGHLMV TPFRHVMDLA ALEPDEAHEM MDLLQTCTTM LRQRFNPQGI NVGLNLGEAA GAGIREHLHF HLVPRWNGDS SFMAVMSETR VIPEHLHSTY HALKPCSTAC PARDGKEADM RFVKVLVLVL VFFISMMFFV QNNAVLSQTV TLKLDLFFDT AWSSIALPFY FMVLCAFLMG ALLTMLLLMI SRMRAGAALR RANKRIRVLE KELNSLRNLP LETARKTPEP VAAPAPVAAT DPTPAKAG
|
| |