Gene Avin_04640 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvin_04640 
Symbol 
ID7759422 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAzotobacter vinelandii DJ 
KingdomBacteria 
Replicon accessionNC_012560 
Strand
Start bp438366 
End bp439685 
Gene Length1320 bp 
Protein Length439 aa 
Translation table11 
GC content68% 
IMG OID643803386 
Productcarboxyl-terminal protease S41A 
Protein accessionYP_002797694 
Protein GI226942621 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0793] Periplasmic protease 
TIGRFAM ID[TIGR00225] C-terminal peptidase (prc) 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.900151 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCGTACC TGCCCCGCCC CCTGTCGCTG GCCCTGGTCG CCGCCCTGGC GCTCGGCGCT 
CCCCTCCTGC GCGCCGAGGA GGCGCCCGCG CCAGCCGCCG AGACGCGAGG CAAGCCCCTC
CTGCCCCTCG AAGAGCTGCG CACCTTCGCC GAGGTCATGG ACCGCATCAA GGCCGCCTAT
GTCGAGCCGG TGGACGACAA GACCCTGCTG GAAAACGCCA TCAAGGGCAT GATCAGCAAT
CTCGATCCGC ACTCGGCCTA CCTCGAGCCC GAGGAATTCC TCGACCTGCA GGAGAGCACC
AGCGGCGAGT TCGGCGGTCT CGGCGTCGAG GTCGGCATGG AGGACGACCA GCTCAAGGTG
GTCGCGCCGA TCGACGACAC GCCGGCGGCC AAGGCCGGCA TCGAGGCCGG CGACCTGATC
GTCCGGATCG ACGGCCAGCC GACCAAGGGC ATGTCCATGC TCGAAGCCGT GGACAAGATG
CGCGGCAAGC CCGGCAGCAA GATCGAGCTG ACCCTGGTGC GCGAGGGCGG CAGGCCGTTC
GATGTCAGCC TGACCAGGGC GGTGATCAAG GTCAAGAGCG TGAAGAGCCA GTCGCTCGAG
TCCGGCTACG GCTACCTGCG CATCACCCAG TTCCAGGTCA ACAGCGGCGA GGAAGTCGGC
AAGGCGCTGG CGCGCCTGAA GCAGGAGAAC GGCGGACAGA AGCTGAAGGG CCTGGTGCTG
GACCTGCGCA ACAACCCCGG CGGCGTGCTG CAGTCGGCCG TCGAGGTGAC CGACCATTTC
CTCACCAAGG GCCTGATCGT CTACACCAAG GGCCGCATCG CCAACTCCGA ACTGCGCTTC
TCCGCCGACC CGGCCGACGC CAGCGAAGGC GTGCCGATGG TGACGCTGAT CAACGGCGGC
AGCGCCTCGG CCGCGGAGAT CGTCGCCGGC GCCCTGCAGG ATCACAAGCG GGCGGTGCTG
ATGGGCACCG ACAGCTTCGG CAAGGGCTCG GTGCAGACCG TGCTGCCGCT GAACAACGAC
CGCGCCCTGA AGCTGACCAC CGCTCTCTAC TTCACCCCCA ACGGCCGTTC GATCCAGGCT
CAGGGCATAG TGCCGGACAT CGTGGTGGAG CGCGCCAAGC TGACCCAGGA CGCCCAGCAG
GAACACCTGC GCGAGGCCGA TCTGGCCGGC CACCTGGGCA ACGGCAACGG CGGGCCGGAC
AAGCCCAGCG GCAAGGCCGG GCAGGAAGGC AAGGCGCGTC CGCAGGACGA CGACTACCAG
TTGAGCCAGG CGCTCAACCT GCTCAAGGGC CTCAACCTGA CCCGCGGACT GCAGCGGTAA
 
Protein sequence
MPYLPRPLSL ALVAALALGA PLLRAEEAPA PAAETRGKPL LPLEELRTFA EVMDRIKAAY 
VEPVDDKTLL ENAIKGMISN LDPHSAYLEP EEFLDLQEST SGEFGGLGVE VGMEDDQLKV
VAPIDDTPAA KAGIEAGDLI VRIDGQPTKG MSMLEAVDKM RGKPGSKIEL TLVREGGRPF
DVSLTRAVIK VKSVKSQSLE SGYGYLRITQ FQVNSGEEVG KALARLKQEN GGQKLKGLVL
DLRNNPGGVL QSAVEVTDHF LTKGLIVYTK GRIANSELRF SADPADASEG VPMVTLINGG
SASAAEIVAG ALQDHKRAVL MGTDSFGKGS VQTVLPLNND RALKLTTALY FTPNGRSIQA
QGIVPDIVVE RAKLTQDAQQ EHLREADLAG HLGNGNGGPD KPSGKAGQEG KARPQDDDYQ
LSQALNLLKG LNLTRGLQR