Gene Avin_12950 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvin_12950 
SymbolalgW 
ID7760237 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAzotobacter vinelandii DJ 
KingdomBacteria 
Replicon accessionNC_012560 
Strand
Start bp1258768 
End bp1259919 
Gene Length1152 bp 
Protein Length383 aa 
Translation table11 
GC content63% 
IMG OID643804197 
ProductHtr-like protease 
Protein accessionYP_002798496 
Protein GI226943423 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain 
TIGRFAM ID[TIGR02037] periplasmic serine protease, Do/DeqQ family
[TIGR02038] periplasmic serine pepetdase DegS 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTAAAGG CCCTGCGTTT TCTTGGCTGG CCCCTGGCGG TCGGAGTGCT GCTGGCCCTG 
CTGATCATCC AGCGTTATCC GGAGTGGGTC GGCTTGCCCC GGCAGCTTGC CGATGAGCAG
CAACTGTCCC GCTCCATCCT CGCTCCCCAA GGCCCCGTCT CCTATGCCAA TGCCGTGAGC
AGCGCCGCAC CGGCGGTAGC CAACCTGTAC ACCACCAAGG TCGTCAAGAA ATCGAACCAG
CCGCTGTTCG ACGATCCACT GCTGCAACAG TACTTCGGCA ACTCCCTGCC CAGTCAGCGG
CGCCTGGAAT CCAGCCTGGG GTCTGCGGTG ATCATGCGCC GGGATGGCTA CCTGCTGACC
AACAACCACG TCACCGCCGG TGCCGACCAG ATCGTCGTCG CCCTGCGGGA CGGACGGGAA
GTCCTCGCCC GGGTGATAGG CAACGATTCG GAAACCGATC TGGCCGTGCT CAAGATCGAT
CTGGACGAAT TGCCGGTCAT GCATCTCGGA CGCTCCGACA GCATCCGCAT CGGTGATGTC
GCCCTGGCCA TCGGCAACCC CTTCGGCGTC GGCCAGACCG TGACCATGGG CATCATCAGC
GCCACCGGGC GCAACCAACT GGGCTTGAAT ACCTACGAGG ACTTCATCCA GACCGATGCA
GCGATCAATC CGGGCAATTC GGGCGGCGCG CTGATCGATG CCAATGGCTA TCTGATCGGC
ATCAATACCG CCATTTTCTC CAAGTCGGGC GGATCCCAGG GTATCGGCTT CGCGATTCCG
GCCAAGCTTG CCCTGGAGGT GATGGAGGAA ATCATCAAGC ACGGTCAGGT AATTCGCGGC
TGGCTCGGAC TCGAGGTGCA ACCACTGACC AAGGAGTTGG CCGAATCCTT CGGCCTGGAA
GGCCGGCCAG GCATCGTCGT CGCCGGCATA TACCGTGACG GCCCCGCACA ACGGGCCGGT
CTGCAGCCGG GCGACCTGAT CGTCAGTATC GATGGCCAGC CGGCCACCGA TGGACGCCAT
GCCATGAATC AGGTCGCCCA GACTCGACCG GGAGAAACCA TCGAAATCGA GGTCCTGCGC
AACGGCCAAG CCCTCACCCT TAGCGCCGAG ATCGGCCTGC GCCCGCCACC CACCGCCGTG
CAGCAGCCAT GA
 
Protein sequence
MLKALRFLGW PLAVGVLLAL LIIQRYPEWV GLPRQLADEQ QLSRSILAPQ GPVSYANAVS 
SAAPAVANLY TTKVVKKSNQ PLFDDPLLQQ YFGNSLPSQR RLESSLGSAV IMRRDGYLLT
NNHVTAGADQ IVVALRDGRE VLARVIGNDS ETDLAVLKID LDELPVMHLG RSDSIRIGDV
ALAIGNPFGV GQTVTMGIIS ATGRNQLGLN TYEDFIQTDA AINPGNSGGA LIDANGYLIG
INTAIFSKSG GSQGIGFAIP AKLALEVMEE IIKHGQVIRG WLGLEVQPLT KELAESFGLE
GRPGIVVAGI YRDGPAQRAG LQPGDLIVSI DGQPATDGRH AMNQVAQTRP GETIEIEVLR
NGQALTLSAE IGLRPPPTAV QQP