Gene Avin_18310 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvin_18310 
Symbol 
ID7760765 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAzotobacter vinelandii DJ 
KingdomBacteria 
Replicon accessionNC_012560 
Strand
Start bp1812985 
End bp1814985 
Gene Length2001 bp 
Protein Length666 aa 
Translation table11 
GC content66% 
IMG OID643804729 
ProductPeptidase, U32 family 
Protein accessionYP_002799018 
Protein GI226943945 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0826] Collagenase and related proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCATTAC CCAAGCACCA TCTGGAACTG CTCAGCCCGG CACGGGATGT GTCCATCGCC 
AAGGAGGCGA TCCTGCATGG CGCCGATGCC GTGTACATCG GCGGCCCCAG CTTCGGTGCG
CGGCACAGCG CCGGCAACGA TGTGACGGAT ATCGCCGGCC TGGTGGAGTT CGCCCACCAG
TTCCACGCCC GGGTGTTCGT CACCCTCAAC ACCATCCTGC ACGACGACGA GCTGGAACCA
GCCCGCCAGT TGATCTGGCA GCTACACGAC GCGGGCATCG ACGCGCTGAT CGTGCAGGAC
ATGGGCATCC TGGAAATGGA CATTCCGCCC ATCGAGCTGC ACGCCAGCAC CCAGTGCGAC
ATCCGCACCC TGGAAAAGGC GAAATTCCTG GCCGACGCCG GTTTCTCCCA ACTGGTGCTA
GCGCGCGAAC TCGGCCTTGC GGAGGTCCGC GCCATCGGCG CGCGAGTGGA CTCGACCATC
GAGTTCTTCA TCCACGGCGC GCTGTGCGTG GCCTTCTCCG GGCAGTGCTA CATCTCCCAC
GCGCAGACCG GGCGCAGCGC CAACCGCGGC GACTGCTCGC AGGCCTGCCG GCTGCCCTAC
ACCCTGAAGG ACGACCAGGG CCGGGTGGTG GCCTACGACA AGCACCTGCT GTCGATGAAG
GACAACAACC AGAGCGCCAA CCTGCGCGCG CTGATTGATG CCGGGGTGCG CTCCTTCAAG
ATCGAGGGAC GCTACAAGGA CATGGGCTAC GTGAAGAACA TCACCGCCTA CTACCGGCAA
CGGCTCGACG AGGTCCTGAG CGAGCGCCCG GAGCTGGCAC GAGCCTCCAG CGGCCGGACC
GAGCACTTCT TCGTGCCGGA CCCGGACAAG ACCTTCCACC GCGGCAGCAC GGACTACTTC
GTCAACGAGC GCAGGATCGA CATCGGCGCC TTCGATTCGC CGACCTTCAC CGGCCTGCCG
GTGGGCGAGA TACTCAAGGT CCGCAAGAAG GATTTCATCG CCGGCACCAC GGAGCCGCTC
GGCAACGGCG ACGGCCTGAA CGTGCTGGTC AAGCGCGAAG TGGTGGGCTT CCGTGCCAGC
GTCGTCGAAC AGCTCGAACG CTCCGAGCGG GGCGGCAAGC CCCACTGGCA ATACCGCATC
GAGCCCAACG AGATGCCGGC CGCGCTGAAG CAGTTGCGTC CGCATCATGC GCTGAACCGC
AACCTGGACC ACGACTGGCA GCAGGCGCTG CAGAAGACTT CCGCCGAGCG CCGGGTCGGG
GTGCGCTGGC AGGTCCGGCT GACGGAGGAG GCACTCGACC TGAACGTGAC CAGCGAGGAA
GGCGTCGGCG CTTCGGCGAG TCTCGCCGGG CCCTTCGGAC TGGCGAAGAA GCCGGAACAG
GCGCTCGAGC AGTTGCGCGA CCTGCTCGGT CAATTGGGCA CCACCCTCTA CCATGCCGAA
GAAGTGGAGA TCGATGCACC GCAGGCATTC TTCGTCCCGA ACTCCCTGCT CAAGGCCCTG
CGCCGCGAGG CCATCGAGGC CCTGACTGCC GCCCGCCTGG CCACCCACCG GCGCGGCAGC
CGCAAGCCGG TCAGCGAGCC GCCGCCGGTG TACCCGGAAT CGCACCTGAC CTTCCTGGCC
AACGTGTACA ACGAGAAGGC CCGGGCGTTC TATCGCCGCT ACGGCGTGCA ACTGATCGAC
GCGGCCTACG AGGCTCACGC GGAGGCCGGC GAAGTGCCGG TGATGATCAC CAAGCACTGT
CTGCGTTTCT CCTTCAACCT GTGCCCGAAA CAGGCCAAGG GCGTCACGGG CGTGCGCACC
AGGGTCGCGC CGATGCAACT GGTGCATCAG GACGAGGTGC TGACCCTGAC GTTCGACTGC
AAGGCCTGCG AGATGCACGT CATCGGCAGG ATGAAAGACC ACATCCTCGC CCAGCCCCAA
CCCGGCAGCT CGACCGGTAT CGTCGGCCAG ATCAGTCCCG AGGAGTTGCT CAAGACCATC
AGGAACAGAG CGCAACGCTG A
 
Protein sequence
MSLPKHHLEL LSPARDVSIA KEAILHGADA VYIGGPSFGA RHSAGNDVTD IAGLVEFAHQ 
FHARVFVTLN TILHDDELEP ARQLIWQLHD AGIDALIVQD MGILEMDIPP IELHASTQCD
IRTLEKAKFL ADAGFSQLVL ARELGLAEVR AIGARVDSTI EFFIHGALCV AFSGQCYISH
AQTGRSANRG DCSQACRLPY TLKDDQGRVV AYDKHLLSMK DNNQSANLRA LIDAGVRSFK
IEGRYKDMGY VKNITAYYRQ RLDEVLSERP ELARASSGRT EHFFVPDPDK TFHRGSTDYF
VNERRIDIGA FDSPTFTGLP VGEILKVRKK DFIAGTTEPL GNGDGLNVLV KREVVGFRAS
VVEQLERSER GGKPHWQYRI EPNEMPAALK QLRPHHALNR NLDHDWQQAL QKTSAERRVG
VRWQVRLTEE ALDLNVTSEE GVGASASLAG PFGLAKKPEQ ALEQLRDLLG QLGTTLYHAE
EVEIDAPQAF FVPNSLLKAL RREAIEALTA ARLATHRRGS RKPVSEPPPV YPESHLTFLA
NVYNEKARAF YRRYGVQLID AAYEAHAEAG EVPVMITKHC LRFSFNLCPK QAKGVTGVRT
RVAPMQLVHQ DEVLTLTFDC KACEMHVIGR MKDHILAQPQ PGSSTGIVGQ ISPEELLKTI
RNRAQR