Gene Avin_00540 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvin_00540 
Symbol 
ID7759021 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAzotobacter vinelandii DJ 
KingdomBacteria 
Replicon accessionNC_012560 
Strand
Start bp55825 
End bp57186 
Gene Length1362 bp 
Protein Length453 aa 
Translation table11 
GC content67% 
IMG OID643802980 
ProductPeptidase, U32 family 
Protein accessionYP_002797296 
Protein GI226942223 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0826] Collagenase and related proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.210117 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCGCTT CCGCCTTCCG TCCCGAACTG CTGTCTCCCG CCGGCACCCT CAAGTCCATG 
CGCTACGCCT TCGCCTACGG CGCCGATGCG GTGTACGCCG GGCAGCCGCG CTACAGCCTG
CGGGTGCGCA ACAACGAATT CGATCATGCC CACCTGGCGC TCGGCATCGC GGAAGCCCAC
GCCCAGGGCA GACGCTTCTA CGTGGTGGTC AACATCGCCC CGCACAACAC CAAGCTGAAG
ACCTTCCTCA AGGACCTGCA GCCGGTGATC GACATGCAAC CGGATGCGCT GATCATGTCC
GACCCGGGAC TGATCATGCT GGTGCGCGAG CACTTTCCGG AAATGGCCAT CCACCTCTCG
GTGCAGGCCA ACGCGGTGAA CTGGGCCAGC GTGGAGTTCT GGCGCCGCCA GGGGCTGACC
CGGACGATCC TCTCCCGCGA GCTGTCGTTG GAGGAGATCG GCGAGATGCG CGAGCGGGTG
CCGGGCATGG AGCTGGAGGT GTTCGTCCAC GGCGCGCTGT GCATGGCCTA TTCCGGGCGC
TGCCTGCTGT CCGGCTACAT CAACCACCGC GATCCCAATC AGGGCACCTG CACCAACGCC
TGCCGCTGGG AGTACCGGGC GCACGAGGGC AGGGAAGACG AACTGGGCAA CATCGTCCAC
GTCCAGGAGC CGGTCCGGGC GCAGCCGGCC GAGCCGACCC TGGGCAGCGG CGTGCCCACC
GAACGGCTGA TGCTGCTCGA GGAGAGCAAG CGGCCGGGCG AGTACATGGA GGCCTTCGAG
GACGAGCACG GCACCTACAT CATGAACTCC AAGGACCTGC GCGCCGTGCA GCACGTCGAG
CGGCTGGTGA AGATGGGCGT GCATTCGCTG AAGATCGAGG GCCGCACCAA GAGCCACTAC
TACGTGGCGC GCACCGCCCA GGTCTACCGC AAGGCGATCG ACGACGCGGT GGCCGGCCGG
CCGTTCGACA AGTCGTTGAT GGACACCCTG GAATCGCTGG CCCATCGCGG CTACACCGAG
GGCTTCCTGC GCCGCCACGT ACACGACGAA TACCAGAACT ATGCCCATGG CTATTCGCTG
TCCGAACGCC AGCAGTTCGT CGGCGAGCTG ACCGGCGAGC GCCGCAACGG TCTGGCCGAG
GTGCAGGTGA AGAACCGTTT CGCCCTCGGC GACCGCCTGG AGCTGATGAC CCCCCGGGGC
AACCTGAACT TCCGCCTGGA GGCGCTGGAG AACAAGCGCG GCGAGCGCGC CGAGGTGGCC
CCGGGCGACG GCCACACCCT CTACCTGCCG GTACCGGAAG GCGTGGACCT CGGCCATGCC
CTCCTGATGC GCGAGCTGGA CGGCGCCACC ACCCGCGGCT GA
 
Protein sequence
MTASAFRPEL LSPAGTLKSM RYAFAYGADA VYAGQPRYSL RVRNNEFDHA HLALGIAEAH 
AQGRRFYVVV NIAPHNTKLK TFLKDLQPVI DMQPDALIMS DPGLIMLVRE HFPEMAIHLS
VQANAVNWAS VEFWRRQGLT RTILSRELSL EEIGEMRERV PGMELEVFVH GALCMAYSGR
CLLSGYINHR DPNQGTCTNA CRWEYRAHEG REDELGNIVH VQEPVRAQPA EPTLGSGVPT
ERLMLLEESK RPGEYMEAFE DEHGTYIMNS KDLRAVQHVE RLVKMGVHSL KIEGRTKSHY
YVARTAQVYR KAIDDAVAGR PFDKSLMDTL ESLAHRGYTE GFLRRHVHDE YQNYAHGYSL
SERQQFVGEL TGERRNGLAE VQVKNRFALG DRLELMTPRG NLNFRLEALE NKRGERAEVA
PGDGHTLYLP VPEGVDLGHA LLMRELDGAT TRG