Gene Avin_32920 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvin_32920 
Symbol 
ID7762190 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAzotobacter vinelandii DJ 
KingdomBacteria 
Replicon accessionNC_012560 
Strand
Start bp3370991 
End bp3372232 
Gene Length1242 bp 
Protein Length413 aa 
Translation table11 
GC content71% 
IMG OID643806160 
Producthypothetical protein 
Protein accessionYP_002800424 
Protein GI226945351 
COG category[R] General function prediction only 
COG ID[COG3975] Predicted protease with the C-terminal PDZ domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCCGCCT GTCCGCGTCT GCCGCTCTGC GCCGTCCTGC TGATCGCGTC CGTTCCGTCC 
TGGGCGGCGA ACAAGGTCGA TCTGGACTAC CACGTGCGTT TCCTGCCGGA GCGCGACCAG
GCCGAGGTGC GCCTGACCCT GGAGCAGGGC AGCGCGGTCC GCAGCCTGCG CTTCGATCTG
GGCGACCAAG GCCGCTACAG CGATTTCCAG GCCGACGGGC AGTGGCAGCA GGAGGAGCCC
GGCAGCGGCG TCTGGCGGCC GGCGGAGGGC AAGAGCAGCC TGAGCTACCG GGTACGGGTC
AACCATGCGC GCGCCTCCTC CGGGCGTTTC GATGCGCGGA TGACCGGGAA CTGGGCGCTG
CTGCGCGGCG ACGATCTGGT GCCCAGCGCC CATCTGGACC AGCAGGACGG CGTGGAACTG
GTGGCGCGCC TGGAGTTCGA GCTGCCCGAG GGCTGGACGG GCGTCGAGAC CGGCTGGCCG
CGCATCGGCA GGAACCGTTT CCGCATCGAC AACCCGGCGC GCCGCTTCGA CCGGCCGACC
GGCTGGCTGC TCGCCGGCCA GCTCGGCACC CGGCGGGCGA TCCTGGGCGG CAGCGAGGTC
AGTGTGGCGG CGCCGCTCGG CGAGGGCGTG CGGCGGATGG ACATCCTGAC CCTGCTGACC
TTCGTCTGGG ACGAATACCG GACGGTGTTC CTGCGCGCGC CCGGCAAGCT GCTGGTGGTC
GGCGCCGGCA ACCCGATGTG GCGTGGCGGC CTGTCGGCCC CCAACTCCCT GTACCTGCAC
GCCGATCGTC CGCTGGTCAG CGAGAACGGT ACCAGTCCCT TGCTGCACGA ACTGGTGCAC
GTGTTCGCCC GGATTCGCGA CACCGATGCG AGCGACTGGA TCAGCGAGGG GCTGGCCGAG
TACTACGCCA TCGAACTGCT GCGCCGCGCC GGCGGTCTCG CCGAGGATCG CTACGAGCGG
ATCTATCGGC AACTGGAGCA CTGGAGCCGC GAGGTCGGCA GCCTGCGCGG CGAACGGATC
AGCGGTCCGG TCACCGCCCG CGCCGTGCTC CTGCTGCGGG CGCTCGACGC GGAGATCCGC
GCGCGCAGTG AGAACCGCCA TTCGCTGGAC GATGTGGTGC ACGGGCTGAT TCGCATGGAG
CGGGTCAATA CCGACGACTT CGTCGCGCTC AGCGAGAACC TCATGGGCGG CGAGTCGCGG
GTGCTGGATA CGCCTCTGCT GGCGCCCGGG GCCGGGCGGT GA
 
Protein sequence
MSACPRLPLC AVLLIASVPS WAANKVDLDY HVRFLPERDQ AEVRLTLEQG SAVRSLRFDL 
GDQGRYSDFQ ADGQWQQEEP GSGVWRPAEG KSSLSYRVRV NHARASSGRF DARMTGNWAL
LRGDDLVPSA HLDQQDGVEL VARLEFELPE GWTGVETGWP RIGRNRFRID NPARRFDRPT
GWLLAGQLGT RRAILGGSEV SVAAPLGEGV RRMDILTLLT FVWDEYRTVF LRAPGKLLVV
GAGNPMWRGG LSAPNSLYLH ADRPLVSENG TSPLLHELVH VFARIRDTDA SDWISEGLAE
YYAIELLRRA GGLAEDRYER IYRQLEHWSR EVGSLRGERI SGPVTARAVL LLRALDAEIR
ARSENRHSLD DVVHGLIRME RVNTDDFVAL SENLMGGESR VLDTPLLAPG AGR