Gene Avin_00190 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvin_00190 
SymboldprA 
ID7758987 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAzotobacter vinelandii DJ 
KingdomBacteria 
Replicon accessionNC_012560 
Strand
Start bp21561 
End bp22661 
Gene Length1101 bp 
Protein Length366 aa 
Translation table11 
GC content75% 
IMG OID643802946 
ProductDNA processing protein, DprA (SMF family) 
Protein accessionYP_002797262 
Protein GI226942189 
COG category[L] Replication, recombination and repair
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG0758] Predicted Rossmann fold nucleotide-binding protein involved in DNA uptake 
TIGRFAM ID[TIGR00732] DNA protecting protein DprA 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0224115 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCCCTTT CTCCCGCCGA ACTGGAGGCG CGCCTGCGCC TGCACGCCCT GCCCGGCTTC 
GGACCGCGCC GCCATCGCCG TCTGCTCGAG GCCTTCGCCA GCGCCTCGGC GGCGCTCAGT
GCGCCGGCCG CCGCCTGGCG CGCGTTGGGG CTGCCGGCCG CTTGCGCGGA GGCGCGGCGC
AGCGCCGCGG TCCGCCAGCA GGCCGCGCAG GCGCTGGCCT GGCTGGAGAA GCCACGGCGG
CACCTGCTGC CATTCGGCGA GCCCGACTAT CCGGCCCTGC TCGCCGAACT GGACGACGCT
CCGCCGCTGC TCTTCGTCGA AGGCGATGCG GCCATTCTGG AGCGGCCGCA ACTGGCCATG
GTCGGCAGCC GGCGCGCCTC GAAGCCCGGG CTCGACACCG CCCGCGCCTT CGCCCGGCAG
CTCGCCGGCG CCGGTTTCGT CGTCACCAGC GGACTGGCCC TGGGCATCGA TGGCGCCGCC
CACCGGGGCG CGCTGGATGC CGGCGGACGG ACGGTGGCCG TGCTCGGCAC GGGCCTGCGG
CGCCTCTATC CGGCGCGCCA CGAAACCCTG GCGGCGGAGA TCCTCGAAGG CGGCGGCGCG
CTGCTATCCG AACTGCCGCT GGACTGTCCG CCCCAGGCGG GCAACTTCCC GCGCCGCAAC
CGTATCGTCA GCGGCCTGTC CCTCGGCGTC CTGGTGGTCG AGGCCGGCCC TTCCAGCGGC
TCGCTGATCA CCGCCCGGCT GGCCGCCGAG CAGGGTCGCG AGGTCTATGC GATTCCCGGC
TCCATCCACC ATCCCGGCGC CCGTGGCTGC CATCGGCTGA TTCGCGACGG TGCCATTCTG
GTGGAAAGCA TCGAGCACAT CCTCGAAGCG CTGCGCGGCT GGCAGAACCT GGCGCCGCCG
CCCCGCTCGC AGGCCGGTCC GGCTCCGCTC GACGCGCGGC ATCCACTGCT GGCGCTGCTG
CATGCCGCGC CGCAGACCAG CGAGGAGCTG GCCCTGGCCA GCGGCTGGCC GCTCGCCCGG
GTACTGGCGG CACTCACCGA ACTGGAGCTG GACGGTAAGG TGGCCCGCGA AGGCGGACGC
TGGCAGGGAT GCGCCGGCTG A
 
Protein sequence
MPLSPAELEA RLRLHALPGF GPRRHRRLLE AFASASAALS APAAAWRALG LPAACAEARR 
SAAVRQQAAQ ALAWLEKPRR HLLPFGEPDY PALLAELDDA PPLLFVEGDA AILERPQLAM
VGSRRASKPG LDTARAFARQ LAGAGFVVTS GLALGIDGAA HRGALDAGGR TVAVLGTGLR
RLYPARHETL AAEILEGGGA LLSELPLDCP PQAGNFPRRN RIVSGLSLGV LVVEAGPSSG
SLITARLAAE QGREVYAIPG SIHHPGARGC HRLIRDGAIL VESIEHILEA LRGWQNLAPP
PRSQAGPAPL DARHPLLALL HAAPQTSEEL ALASGWPLAR VLAALTELEL DGKVAREGGR
WQGCAG