Gene Dvul_2621 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDvul_2621 
Symbol 
ID4663669 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfovibrio vulgaris DP4 
KingdomBacteria 
Replicon accessionNC_008751 
Strand
Start bp3054578 
End bp3056149 
Gene Length1572 bp 
Protein Length523 aa 
Translation table11 
GC content68% 
IMG OID639820867 
Productanthranilate synthase component I/chorismate-binding protein 
Protein accessionYP_968060 
Protein GI120603660 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0147] Anthranilate/para-aminobenzoate synthases component I 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.260936 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCTGCA TACTCTCCGA CTGCGTTGAC AGTGCCACCT TCGCCCGTCT CGCCCTCGCG 
CTGGCACGCG AAGGGGCGGA CATGCTGCTG TGTCACCCTG CCCCCGACGG CGGCGGGATG
CCCCCCGGCT GGACGGCATC CGATGCGACG TCGCCCGCCC GTCCATGCTG CCTTGTGGGA
TTCGACGTGG CTGGCGAGAT ATCGCCTCCG CCCGACGGAG ACCTCACAGA CCTTCATGCC
TTCTGCTTCG GCGACGGCAC GCAACAGGTG CGCATCGCCT CCGACGTGCC ACGCCTCGCC
TTTGGCTGGC TTTCCTACGG GTGGGGCATG GCGCTGCACG GCATCGCCTC CGCAAAGCCT
GCCGAGTCGG GATACGGCGT GCTACGCCGC TACCACTGCC TACTGCGCTG GTATCCGCAT
GGCGGCGACC TTGAAGCCGA GGTCTCGGCA ACGGGACAAC GCAGGCTGCC CATGCTGCGT
CGCCTGCTTG ATGCGCTACG CACGGAAGAA CGCGCCCATA CGCTTACGGA GACACTGACG
GAAGAACACC GCGAGGCTGG TACGGCGATG CGGCAGGAGA AGCAGGGAGC ACCCCGGGCA
GCAGCGACCA GCAAGGGAAG CGGGGGAGAT GGGGGCGACA ATCCCTACGG GGACGTCAGG
AACCACGGGG AAGGCAGGGG CTTGTCCGGG GCGCCGCGAG ACGGCACCGC GCCGGACGAC
GACCTCGCCC TCGCACTGGA CGAACTTGTC CCGTCGCTGG ATGCCACGAC CTACCCGGAT
GGCGTACGGC AGGTGCTGGC TGCCATCCGC AGGGGCGACA CCTACCAGCT GAACCTCACC
TCACGCTTCA CCGCCCGCCG CCCCGGCATG GACGCCGCCG CAGTCCTGTT GCGCCTGTGG
CAGCATCGCC CCGCGCCCTT CGCCGCCTAT CTGCACGCAG GACGTCACCG CATCCTCTCA
CTCTCGCCCG AACGATTCCT GCGCGTACGG GGGGGTGAGG TACTGGCCCA GCCCATCAAG
GGAACGCGCA GCTTCGACCC GGCGACCACC TCGCCCGGAG AACGGGCACG CCTCGAAGCC
GCCCTGCGCG CCGACCCCAA GGAACACGCC GAACTCTCGA TGGTGGTCGA CCTGCTGCGC
AACGACATCT CCGCCACATG CGCCTACGAC AGTGTGCGCG TGCCGCGGCA CTGCGCCACC
TTCGCCGTCG GGCCCCTCAT ACAGATGTGC AGCGACGTGA CGGGAACCCT GCGCGACGGG
ACGACCTGCC TCGACCTTCT GCGTCACGCC TTCCCCGGCG GTTCGGTGAC GGGCTGCCCC
AAACCGCGTA CCATGAGCCT CATCGAACGC ATCGAACCCC ACCCGCGCGA CGTCTACTGC
GGCAGTCTCG TCGCCGTGGC GGGCCCCCGT GACATGGACA GTTCCATAGC CATTCGCACA
GCCCTGTACG ACACGACGAC AGGCCTCCTG CATCTGTACG CAGGCAGCGG GCTCACCGTC
GATTCCGACC CCGAGGGCGA ATACCGCGAG ACCGTCGACA AGACGTCGGC ATTCAGGAAG
GAGACGGCAT GA
 
Protein sequence
MRCILSDCVD SATFARLALA LAREGADMLL CHPAPDGGGM PPGWTASDAT SPARPCCLVG 
FDVAGEISPP PDGDLTDLHA FCFGDGTQQV RIASDVPRLA FGWLSYGWGM ALHGIASAKP
AESGYGVLRR YHCLLRWYPH GGDLEAEVSA TGQRRLPMLR RLLDALRTEE RAHTLTETLT
EEHREAGTAM RQEKQGAPRA AATSKGSGGD GGDNPYGDVR NHGEGRGLSG APRDGTAPDD
DLALALDELV PSLDATTYPD GVRQVLAAIR RGDTYQLNLT SRFTARRPGM DAAAVLLRLW
QHRPAPFAAY LHAGRHRILS LSPERFLRVR GGEVLAQPIK GTRSFDPATT SPGERARLEA
ALRADPKEHA ELSMVVDLLR NDISATCAYD SVRVPRHCAT FAVGPLIQMC SDVTGTLRDG
TTCLDLLRHA FPGGSVTGCP KPRTMSLIER IEPHPRDVYC GSLVAVAGPR DMDSSIAIRT
ALYDTTTGLL HLYAGSGLTV DSDPEGEYRE TVDKTSAFRK ETA