Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avin_00190 |
Symbol | dprA |
ID | 7758987 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Azotobacter vinelandii DJ |
Kingdom | Bacteria |
Replicon accession | NC_012560 |
Strand | + |
Start bp | 21561 |
End bp | 22661 |
Gene Length | 1101 bp |
Protein Length | 366 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | 643802946 |
Product | DNA processing protein, DprA (SMF family) |
Protein accession | YP_002797262 |
Protein GI | 226942189 |
COG category | [L] Replication, recombination and repair [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG0758] Predicted Rossmann fold nucleotide-binding protein involved in DNA uptake |
TIGRFAM ID | [TIGR00732] DNA protecting protein DprA |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.0224115 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCCCTTT CTCCCGCCGA ACTGGAGGCG CGCCTGCGCC TGCACGCCCT GCCCGGCTTC GGACCGCGCC GCCATCGCCG TCTGCTCGAG GCCTTCGCCA GCGCCTCGGC GGCGCTCAGT GCGCCGGCCG CCGCCTGGCG CGCGTTGGGG CTGCCGGCCG CTTGCGCGGA GGCGCGGCGC AGCGCCGCGG TCCGCCAGCA GGCCGCGCAG GCGCTGGCCT GGCTGGAGAA GCCACGGCGG CACCTGCTGC CATTCGGCGA GCCCGACTAT CCGGCCCTGC TCGCCGAACT GGACGACGCT CCGCCGCTGC TCTTCGTCGA AGGCGATGCG GCCATTCTGG AGCGGCCGCA ACTGGCCATG GTCGGCAGCC GGCGCGCCTC GAAGCCCGGG CTCGACACCG CCCGCGCCTT CGCCCGGCAG CTCGCCGGCG CCGGTTTCGT CGTCACCAGC GGACTGGCCC TGGGCATCGA TGGCGCCGCC CACCGGGGCG CGCTGGATGC CGGCGGACGG ACGGTGGCCG TGCTCGGCAC GGGCCTGCGG CGCCTCTATC CGGCGCGCCA CGAAACCCTG GCGGCGGAGA TCCTCGAAGG CGGCGGCGCG CTGCTATCCG AACTGCCGCT GGACTGTCCG CCCCAGGCGG GCAACTTCCC GCGCCGCAAC CGTATCGTCA GCGGCCTGTC CCTCGGCGTC CTGGTGGTCG AGGCCGGCCC TTCCAGCGGC TCGCTGATCA CCGCCCGGCT GGCCGCCGAG CAGGGTCGCG AGGTCTATGC GATTCCCGGC TCCATCCACC ATCCCGGCGC CCGTGGCTGC CATCGGCTGA TTCGCGACGG TGCCATTCTG GTGGAAAGCA TCGAGCACAT CCTCGAAGCG CTGCGCGGCT GGCAGAACCT GGCGCCGCCG CCCCGCTCGC AGGCCGGTCC GGCTCCGCTC GACGCGCGGC ATCCACTGCT GGCGCTGCTG CATGCCGCGC CGCAGACCAG CGAGGAGCTG GCCCTGGCCA GCGGCTGGCC GCTCGCCCGG GTACTGGCGG CACTCACCGA ACTGGAGCTG GACGGTAAGG TGGCCCGCGA AGGCGGACGC TGGCAGGGAT GCGCCGGCTG A
|
Protein sequence | MPLSPAELEA RLRLHALPGF GPRRHRRLLE AFASASAALS APAAAWRALG LPAACAEARR SAAVRQQAAQ ALAWLEKPRR HLLPFGEPDY PALLAELDDA PPLLFVEGDA AILERPQLAM VGSRRASKPG LDTARAFARQ LAGAGFVVTS GLALGIDGAA HRGALDAGGR TVAVLGTGLR RLYPARHETL AAEILEGGGA LLSELPLDCP PQAGNFPRRN RIVSGLSLGV LVVEAGPSSG SLITARLAAE QGREVYAIPG SIHHPGARGC HRLIRDGAIL VESIEHILEA LRGWQNLAPP PRSQAGPAPL DARHPLLALL HAAPQTSEEL ALASGWPLAR VLAALTELEL DGKVAREGGR WQGCAG
|
| |