Gene Dvul_2033 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDvul_2033 
Symbol 
ID4662496 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfovibrio vulgaris DP4 
KingdomBacteria 
Replicon accessionNC_008751 
Strand
Start bp2368817 
End bp2370013 
Gene Length1197 bp 
Protein Length398 aa 
Translation table11 
GC content62% 
IMG OID639820276 
Producttyrosyl-tRNA synthetase 
Protein accessionYP_967476 
Protein GI120603076 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0162] Tyrosyl-tRNA synthetase 
TIGRFAM ID[TIGR00234] tyrosyl-tRNA synthetase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0657081 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCGACA TCGACAGGCA ACTGGAGCAC ATCAAGCGGG GCTGCGCCGA ACTCATCGAC 
GAGGGTGAAC TCCGCAAGAA GCTTGAGCGG GGCACGCCGT TGCGCATCAA GGCGGGGTTC
GACCCCACTG CGCCCGACCT GCACCTCGGG CACACGGTGC TCATCCACAA GCTGCGCCAT
TTTCAGGAAC TCGGCCACAC CGTAATCTTC CTCATCGGCG ACTTCACCGG GCTCATCGGT
GACCCCTCGG GTCGTTCCGA TACCCGTCCG CCGCTGACGC GCGAGCAGGT GCTCGCCAAT
GCCGAGACCT ACAAGCAGCA GGTCTTCAAG ATTCTCGACC CGGAAAAGAC CGTGGTCGAC
TTCAATTCGC GCTGGATGGG TGAATTCGGC GCGGCGGACT TCATCAGGCT CGCATCTCGC
TATACCGTGG CGCGGATGAT GGAGCGTGAC GATTTCGAGA AACGCTACAA GGAAGGACGC
CCCATCGCCG TCCACGAATT CCTGTACCCG TTGGTGCAGG GCTACGATTC CGTGGCCCTC
AAGGCCGATG TGGAACTGGG CGGTACGGAC CAGAAGTTCA ACCTGCTCGT GGGGCGGCAT
CTGCAGTCTC AATACGGGCA GGAGCCTCAG TGCATCCTCA CCATGCCGCT CCTCGAAGGG
CTGGATGGCG TCAAGAAGAT GTCAAAATCC CTGGGCAACT ATGTGGGTAT CGATGAATCG
CCCGCCGACA TGTTCGGCAA GCTCATGTCC GTCTCAGACG AACTGATGTG GCGCTACTTC
GAACTCATCT CCTCGCGTTC CCTCGATGAA ATCGCCGACC TTCGCCGCAA GGTGGAGACG
GGTGAGGCGC ATCCCAAGCT GGTGAAGGAG TCGCTGGCCT ACGAATTGAC CACCCGCTAC
CATGGCGAAG ACAAGGCCGC AGAGGCACAG CAGGGCTTCA ATGCCGTATT CGCCGGTGGC
GGCGTGCCGG ACGACGCGCC GGTGCATGCC TGCGACCATG GCGACGACAG CACCCCGCCC
GCCTTCCTTG AAGCCGCAGG ACTCGTGAAG TCCCGTGGCG AGGCCAAGCG CCTCATCAAG
GAAGGGGCAC TGTCTGTGGA TGGGGTACGC TGCGATGACG CCAATAGCCC CCTTGCCTCT
GGCGAGTACG TCATCAAACT CGGCAAGAAG CGCTTCCTGC GCCTCACCGT GCGCTAG
 
Protein sequence
MIDIDRQLEH IKRGCAELID EGELRKKLER GTPLRIKAGF DPTAPDLHLG HTVLIHKLRH 
FQELGHTVIF LIGDFTGLIG DPSGRSDTRP PLTREQVLAN AETYKQQVFK ILDPEKTVVD
FNSRWMGEFG AADFIRLASR YTVARMMERD DFEKRYKEGR PIAVHEFLYP LVQGYDSVAL
KADVELGGTD QKFNLLVGRH LQSQYGQEPQ CILTMPLLEG LDGVKKMSKS LGNYVGIDES
PADMFGKLMS VSDELMWRYF ELISSRSLDE IADLRRKVET GEAHPKLVKE SLAYELTTRY
HGEDKAAEAQ QGFNAVFAGG GVPDDAPVHA CDHGDDSTPP AFLEAAGLVK SRGEAKRLIK
EGALSVDGVR CDDANSPLAS GEYVIKLGKK RFLRLTVR