Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dvul_2621 |
Symbol | |
ID | 4663669 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Desulfovibrio vulgaris DP4 |
Kingdom | Bacteria |
Replicon accession | NC_008751 |
Strand | + |
Start bp | 3054578 |
End bp | 3056149 |
Gene Length | 1572 bp |
Protein Length | 523 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 639820867 |
Product | anthranilate synthase component I/chorismate-binding protein |
Protein accession | YP_968060 |
Protein GI | 120603660 |
COG category | [E] Amino acid transport and metabolism [H] Coenzyme transport and metabolism |
COG ID | [COG0147] Anthranilate/para-aminobenzoate synthases component I |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.260936 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGCTGCA TACTCTCCGA CTGCGTTGAC AGTGCCACCT TCGCCCGTCT CGCCCTCGCG CTGGCACGCG AAGGGGCGGA CATGCTGCTG TGTCACCCTG CCCCCGACGG CGGCGGGATG CCCCCCGGCT GGACGGCATC CGATGCGACG TCGCCCGCCC GTCCATGCTG CCTTGTGGGA TTCGACGTGG CTGGCGAGAT ATCGCCTCCG CCCGACGGAG ACCTCACAGA CCTTCATGCC TTCTGCTTCG GCGACGGCAC GCAACAGGTG CGCATCGCCT CCGACGTGCC ACGCCTCGCC TTTGGCTGGC TTTCCTACGG GTGGGGCATG GCGCTGCACG GCATCGCCTC CGCAAAGCCT GCCGAGTCGG GATACGGCGT GCTACGCCGC TACCACTGCC TACTGCGCTG GTATCCGCAT GGCGGCGACC TTGAAGCCGA GGTCTCGGCA ACGGGACAAC GCAGGCTGCC CATGCTGCGT CGCCTGCTTG ATGCGCTACG CACGGAAGAA CGCGCCCATA CGCTTACGGA GACACTGACG GAAGAACACC GCGAGGCTGG TACGGCGATG CGGCAGGAGA AGCAGGGAGC ACCCCGGGCA GCAGCGACCA GCAAGGGAAG CGGGGGAGAT GGGGGCGACA ATCCCTACGG GGACGTCAGG AACCACGGGG AAGGCAGGGG CTTGTCCGGG GCGCCGCGAG ACGGCACCGC GCCGGACGAC GACCTCGCCC TCGCACTGGA CGAACTTGTC CCGTCGCTGG ATGCCACGAC CTACCCGGAT GGCGTACGGC AGGTGCTGGC TGCCATCCGC AGGGGCGACA CCTACCAGCT GAACCTCACC TCACGCTTCA CCGCCCGCCG CCCCGGCATG GACGCCGCCG CAGTCCTGTT GCGCCTGTGG CAGCATCGCC CCGCGCCCTT CGCCGCCTAT CTGCACGCAG GACGTCACCG CATCCTCTCA CTCTCGCCCG AACGATTCCT GCGCGTACGG GGGGGTGAGG TACTGGCCCA GCCCATCAAG GGAACGCGCA GCTTCGACCC GGCGACCACC TCGCCCGGAG AACGGGCACG CCTCGAAGCC GCCCTGCGCG CCGACCCCAA GGAACACGCC GAACTCTCGA TGGTGGTCGA CCTGCTGCGC AACGACATCT CCGCCACATG CGCCTACGAC AGTGTGCGCG TGCCGCGGCA CTGCGCCACC TTCGCCGTCG GGCCCCTCAT ACAGATGTGC AGCGACGTGA CGGGAACCCT GCGCGACGGG ACGACCTGCC TCGACCTTCT GCGTCACGCC TTCCCCGGCG GTTCGGTGAC GGGCTGCCCC AAACCGCGTA CCATGAGCCT CATCGAACGC ATCGAACCCC ACCCGCGCGA CGTCTACTGC GGCAGTCTCG TCGCCGTGGC GGGCCCCCGT GACATGGACA GTTCCATAGC CATTCGCACA GCCCTGTACG ACACGACGAC AGGCCTCCTG CATCTGTACG CAGGCAGCGG GCTCACCGTC GATTCCGACC CCGAGGGCGA ATACCGCGAG ACCGTCGACA AGACGTCGGC ATTCAGGAAG GAGACGGCAT GA
|
Protein sequence | MRCILSDCVD SATFARLALA LAREGADMLL CHPAPDGGGM PPGWTASDAT SPARPCCLVG FDVAGEISPP PDGDLTDLHA FCFGDGTQQV RIASDVPRLA FGWLSYGWGM ALHGIASAKP AESGYGVLRR YHCLLRWYPH GGDLEAEVSA TGQRRLPMLR RLLDALRTEE RAHTLTETLT EEHREAGTAM RQEKQGAPRA AATSKGSGGD GGDNPYGDVR NHGEGRGLSG APRDGTAPDD DLALALDELV PSLDATTYPD GVRQVLAAIR RGDTYQLNLT SRFTARRPGM DAAAVLLRLW QHRPAPFAAY LHAGRHRILS LSPERFLRVR GGEVLAQPIK GTRSFDPATT SPGERARLEA ALRADPKEHA ELSMVVDLLR NDISATCAYD SVRVPRHCAT FAVGPLIQMC SDVTGTLRDG TTCLDLLRHA FPGGSVTGCP KPRTMSLIER IEPHPRDVYC GSLVAVAGPR DMDSSIAIRT ALYDTTTGLL HLYAGSGLTV DSDPEGEYRE TVDKTSAFRK ETA
|
| |