Gene Dvul_3047 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDvul_3047 
Symbol 
ID4661976 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfovibrio vulgaris DP4 
KingdomBacteria 
Replicon accessionNC_008741 
Strand
Start bp125404 
End bp126723 
Gene Length1320 bp 
Protein Length439 aa 
Translation table11 
GC content70% 
IMG OID639813967 
Producthypothetical protein 
Protein accessionYP_961246 
Protein GI120586901 
COG category[E] Amino acid transport and metabolism
[F] Nucleotide transport and metabolism 
COG ID[COG0458] Carbamoylphosphate synthase large subunit (split gene in MJ) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.836021 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCGTCG TCCTCGTCCT CGGTGCCGAG ACCATGCTGG CGCGCGTGGT CAGCCGCAGC 
CTGCATCGCG CGGGTTTCAC CGTGCTGGCG GCCTCCTCCA CGCCGTGGCC CATCTGCGCC
TATTCGCGCT ATGTGCGGCG CACCTTCACC CACGCCGACC CGAAGCACGA CGAGTCGCGG
TTCATCGACG ACATCCGCCG CATCTGCGAG ACGCAGGGCG TGGACGTGCT GTTGCCCATC
CTGCGCGAGT GCGCCGTCAT CGCACGGCAT CGCCACCTCT TCGGCCCCGG CGTGCGGATG
CTGCTTGGCG ACGCCGCGAC GCTGGCCGAC TTCGGCGACA AGTACCGTAC CTACGAGGTG
GCGCGCGACG CGGGGCTTGC CGTGCCCGAG TACCGCAGGG CTGCCGACCT CGCTGCCGAC
CCGGCGGCCC TTGCGGCCTT TCCGTGCCCG TGCCTCGCCA AGCCCGTGTG GGGGTGGGGC
GGCTACGGGA TGTACGAATG CGCCAGCCCG CAGGAGGTCG CCGCCCGCAT CACGGCCATG
ACCGACCGAC AACGCGAAGA CTACTTCATG CAACAGCGGA TGCCGGGTGA CGTGGTGTGC
GTGGCCATGC TGTGCGAGGC GGGGCAGATG CACGCGTGCG ACACCTTCCG CATCGTGGCC
TCGTACCCGA GGCGGCACGG GCAGTCGACA CTGCGCGAGT CGGTGCGGGC CGACGCCGCC
GTGGACGCGC TGCGGACGCT GCTTGCCCAT GTGGGCTGGA CGGGCCCGTG TCAGGCCGAC
TTCATCATCG ACCCGGTCAC GGGCACGCCG TACCTCATCG ACATCAACGC CCGGTACTGG
AATTCGCTGA TTCAGAGCAC CGCCCGCGGG GTGGACTTTC CCGTCATGCA CTGCCGGATG
GCGCTGGGCA TGGGTGATGC GGGCGGCGCA GGCGGCGCGG GAGATGTTCC GGGCACCGGC
GCGGCTGGCG TGTCGCCAAG TGTTCCGCCG GGTGTGCCCC CGAGTGTGCC GCCGGGTGTG
CCGGAAGGCG CGCCGGGCAT GAGTGCAGGC ATGTGCGCGG ACAAGGACAC GGGGGTGAGC
ACGGCATGGT TCAGTCGCGC CCTGCGCGGC GACCCGGCCC TGCTGCTGCG GCGTCTCTTC
TCGCGCCCGC AGGGTCAGGC CGCACGGGGC ATCGCCGCCT TCGACGACTG GGACGTCCGC
GACCCGCTGC CCTTCTTCGC ATGGCCCCTG CGGCATCTGC TTGGGCGCAT CGCCTCGCGG
GTGGCCCCGC ATCACTACGC AACCGATGGA ACAGGACGAG GTGAAGCATG TCGCTCATGA
 
Protein sequence
MSVVLVLGAE TMLARVVSRS LHRAGFTVLA ASSTPWPICA YSRYVRRTFT HADPKHDESR 
FIDDIRRICE TQGVDVLLPI LRECAVIARH RHLFGPGVRM LLGDAATLAD FGDKYRTYEV
ARDAGLAVPE YRRAADLAAD PAALAAFPCP CLAKPVWGWG GYGMYECASP QEVAARITAM
TDRQREDYFM QQRMPGDVVC VAMLCEAGQM HACDTFRIVA SYPRRHGQST LRESVRADAA
VDALRTLLAH VGWTGPCQAD FIIDPVTGTP YLIDINARYW NSLIQSTARG VDFPVMHCRM
ALGMGDAGGA GGAGDVPGTG AAGVSPSVPP GVPPSVPPGV PEGAPGMSAG MCADKDTGVS
TAWFSRALRG DPALLLRRLF SRPQGQAARG IAAFDDWDVR DPLPFFAWPL RHLLGRIASR
VAPHHYATDG TGRGEACRS