Gene Avi_1523 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvi_1523 
Symbol 
ID7386481 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAgrobacterium vitis S4 
KingdomBacteria 
Replicon accessionNC_011989 
Strand
Start bp1273881 
End bp1274936 
Gene Length1056 bp 
Protein Length351 aa 
Translation table11 
GC content57% 
IMG OID643650889 
Productextracellular metalloprotease precursor protein 
Protein accessionYP_002549094 
Protein GI222148137 
COG category[E] Amino acid transport and metabolism 
COG ID[COG3227] Zinc metalloprotease (elastase) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.103754 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAGGTTG CCATGTGCCG CTATTGCCAA ATCATTCCCG AAAAAGTGCT GATTGCCCTA 
TCCCGTGACC AGGATTTTCC AGCCGCCGTG CGCGAGCGGT TACAGGAAAC CATGCATCAC
GACCATCAAC TGCGCCAGTT TCGCGACAGC GCCCGCCAGC TGACCATCGC CAAACGGCCT
TTCCGGGCGT TTTCCGTGAC CGTGGCCCAG GCACCTGATA TCCCGGTCTA TACCTGCAAT
AACGGCATGA CCCTTCCCGG CGTACAGATC GCCAATCCCG GCTCTTCAAC GGATGCCCAG
GTGAAGACCA CATTCGATAC AACGACCGGG GTGGAGCAAT TCTACAGCAG CGTCTTCAAG
CGTAACTCCA TCGACGGCAA TGGCATGACG ATCCTGTCGT CTGTGCATTA CGGCAGGGAT
TACAACAACG CCTTCTGGAA CGGCTCGCAA ATGGCCTATG GCGATGGCGA CGGGGAGATT
TTCACCCCGT TTTGTGAGAG CGCCGATGTG GTCGGCCATG AACTGACCCA TGGCATCACC
CAATATACCC TGGGCCTCGA CTATGAAAAC CAGCCGGGTG GGCTGAATGA AAGTCTCTCG
GATGTGTTCG GCAGCATGTT CAAGCAATGG ACGAAAGATC AGAATGCCGA TGAGGCCGAC
TGGCTGATCG GCAATGACAT TCTCGGCCCG ACGGCCCGGC AGAAATATAC CTGCCTGCGC
GACATGGCCA ATCCTGAAGC ATCCCATTGC ATGGCCGAGC AGATCAGCCA TTTCAGCGAT
TACCGCGATG GCATGGACCC ACATGAGAGC AGCGGCATCG CCAACCGCGC CTTTTATCTA
GCCGCCACCC GCATCGGCGG CAAAAGCTGG GACAAGGCCG GACAGATATG GTATGATGCA
CTCACAAAGA ATGGCAGCAA CCCGGACATG ACCATGGCGG AATTTGCCGA TGCGACCCGG
GCCGGAGCTG CCAGGCTTTA TCCAGGAGAT GGATCGCTCG CAGAGGTGCT CGACACCGCC
TGGAGCGAAG TGGGTCTGCA AAGTGCTTCG GTCTGA
 
Protein sequence
MEVAMCRYCQ IIPEKVLIAL SRDQDFPAAV RERLQETMHH DHQLRQFRDS ARQLTIAKRP 
FRAFSVTVAQ APDIPVYTCN NGMTLPGVQI ANPGSSTDAQ VKTTFDTTTG VEQFYSSVFK
RNSIDGNGMT ILSSVHYGRD YNNAFWNGSQ MAYGDGDGEI FTPFCESADV VGHELTHGIT
QYTLGLDYEN QPGGLNESLS DVFGSMFKQW TKDQNADEAD WLIGNDILGP TARQKYTCLR
DMANPEASHC MAEQISHFSD YRDGMDPHES SGIANRAFYL AATRIGGKSW DKAGQIWYDA
LTKNGSNPDM TMAEFADATR AGAARLYPGD GSLAEVLDTA WSEVGLQSAS V