Gene DvMF_1952 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDvMF_1952 
Symbol 
ID7173870 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfovibrio vulgaris str. 'Miyazaki F' 
KingdomBacteria 
Replicon accessionNC_011769 
Strand
Start bp2412960 
End bp2414591 
Gene Length1632 bp 
Protein Length543 aa 
Translation table11 
GC content68% 
IMG OID643540468 
Productglycosyl transferase family 2 
Protein accessionYP_002436363 
Protein GI218887042 
COG category[R] General function prediction only 
COG ID[COG1216] Predicted glycosyltransferases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones105 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCACTACC GCTACCTCGC CCCCGGCCTT CAGACGGAAC TGGAAGCCCT CACCGTCGAT 
GACGCCATCG ACCACCTGCG CAACCATCTT GGCAACTTCC TGCTCGATGC ATCGGTGTGC
GTCAGCTACC TGAACCGGCT GGCCGCAGAA TCCGGCGGAC CTGATTCGCC TCGCCATGTG
ACCTGGCTGC GCTGGCTGGT CCGCGCCCTT GCCCGGCTGC GGCCATTCGA CGAGCAGGCC
GTAACCCTGG CCGCCCGGGT GAACGGCACA CCGGAAGACA CCCCCCTGAT CCGGGCCATG
GCCAAGGTGC GCACCCCGGA GGCCCTGCAC CACCGGGTGG AACAGACCGC CAACCAGGCC
CCGGCTGAAG CACGCGACGT GTTGTTGCGG CTGTTCCGCG AAATGCCGTT TTGCGTGGAC
ATGGCCGAGC GGCTGCTGTT CCTGGACCTG CAACTGGGAC TTGTGCCCGG AGGCGGCTGG
TACGAAGGGC TGCGCTGCCC GCCCCTGCTG CGCGACATGC TGGACCGGGA GCGTTTTCGG
GCGTGCATGC TGTGCGGCAA CGACGCCATG GCCCTGGAAT TGCTTGACCA CACGCGCACA
GCCGGAACAC ACGACCCCGG CTGGCTGAAC TGCGCGGCGG AACTGGCCGT GCGCACCGGC
GACCGCGCGA CGGGCATGGA CTTCTACCGG GCGTCACTGG GTCTGGACCC CATGCAGGTA
CCCGTGGCCC TGCGCCTGCA CGAACTGGAG CAGCCCTTCG CCACCCCGCC GGACGCCCTT
GCCCCTGCGC ACGGCCCCGT GGCCGTCTTT CTGTACTCCT GGAACAAACG CGACCTGCTG
GAGCAGACCC TGCGCTCGCT GGCGGCGTCG GACACGGGCG GTGCCTCCGT CACCCTGCTG
CTGAACGGCT GCACCGACGG CTCTCCCGAA ATGGCGGCGG GGCTGAACGC CAGTCTGTTC
GGCGGGCGCA TGGATATCAT CGAACTGCCC GTCAACGTGG GCGCCCCGGC GGCGCGCAAC
TGGCTGCTGC ACACCCCGCG CGGTCGCGAA GCGGCCTTCG TGGCCTTTCT GGACGACGAC
GTGGAGGTGC CCGCCGACTG GCTGTCCACG CTGATTTCCG TACTGCGGGC CAATCCGCGC
GCCGGGGTGG TGGGTGCCAA GACGGTCTTT CCCGGTTCCC CCCGCCGGTT GCAGTACCTG
TACCGCAACG TTTCCGTGGC CCAGCCGGGG CTGCTGCGCG TCAGCCTGGG CACGCCCAGC
TTCAACTACG ATGGCGGAAC CTACGACGTC ATCCGGCCCA CGGCCAGCGT CATGGGCTGC
TGCCACGTGT TCACCCGCAC CGCGCTGGAC GCGGTGCGCG ACTTCGACAT CCGCTTCTCG
CCCTCGCAGA TGGACGACAT CGCCCACGAC CTGGATCTGT GCCTGCACGG CTTCGAGGTG
GTGTACTGCG GCCTTGTCTC CTGCGTACAC CATCAGATGT CTGGAGTGGG CATCGGCAAC
GTGCATGCCG CCCGCATGGG TAACGTGCTG GGCAACGACG TGAAGTTCTA CTACCGCTTC
GCCGAACATC TGGACGCGTT ACGCAGGCTT ACCGCACGCC ACGCGGTGCC CCAGATGCCG
CCGGATGCAT AG
 
Protein sequence
MHYRYLAPGL QTELEALTVD DAIDHLRNHL GNFLLDASVC VSYLNRLAAE SGGPDSPRHV 
TWLRWLVRAL ARLRPFDEQA VTLAARVNGT PEDTPLIRAM AKVRTPEALH HRVEQTANQA
PAEARDVLLR LFREMPFCVD MAERLLFLDL QLGLVPGGGW YEGLRCPPLL RDMLDRERFR
ACMLCGNDAM ALELLDHTRT AGTHDPGWLN CAAELAVRTG DRATGMDFYR ASLGLDPMQV
PVALRLHELE QPFATPPDAL APAHGPVAVF LYSWNKRDLL EQTLRSLAAS DTGGASVTLL
LNGCTDGSPE MAAGLNASLF GGRMDIIELP VNVGAPAARN WLLHTPRGRE AAFVAFLDDD
VEVPADWLST LISVLRANPR AGVVGAKTVF PGSPRRLQYL YRNVSVAQPG LLRVSLGTPS
FNYDGGTYDV IRPTASVMGC CHVFTRTALD AVRDFDIRFS PSQMDDIAHD LDLCLHGFEV
VYCGLVSCVH HQMSGVGIGN VHAARMGNVL GNDVKFYYRF AEHLDALRRL TARHAVPQMP
PDA