Gene Avi_1368 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvi_1368 
Symboldop 
ID7389105 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAgrobacterium vitis S4 
KingdomBacteria 
Replicon accessionNC_011989 
Strand
Start bp1143352 
End bp1144827 
Gene Length1476 bp 
Protein Length491 aa 
Translation table11 
GC content61% 
IMG OID643650776 
Productserine protease DO-like protease 
Protein accessionYP_002548982 
Protein GI222148025 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain 
TIGRFAM ID[TIGR02037] periplasmic serine protease, Do/DeqQ family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.200414 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTGGCGG GTGCCGTGAC GGTAACGCCT GCGCTGGCCG CGCCCGTCGA GGTTCAGGCG 
CCGCAGGTGC CAAGCTTTGC CGATCTGGTC AGTGCCGTTT CGCCCGCCGT CGTCTCCATC
CGCGTCAAAT CGGATGTGCA GCAGGCCTCC GAGGATGGCA GCAATTTCTC CTTCAATGGT
CGTGACTTCG ACCAGCTTCC CGATCCTCTG AAGCGCTTCT TCAAGGAATG GGGCATGCCA
GGCCCCGGCG GTCCAGGTGG TCCAGGAGGC CCCAAGGGTG GACCGCATGC CGAGCGTCAT
GGCAAGCTGC GTCCGATTGC CCAAGGGTCG GGCTTCTTCA TCTCCGAGGA CGGCTATGTT
GTGACCAACA ATCACGTAGT TTCCGATGGC CAGGCCTATA CAGTGGTGAT GAATGACGGC
ACCGAATACG ATGCCAAGCT GGTCGGTAAG GACCCGCGCA CTGACCTCGC CGTTTTGAAG
GTCGATCAGC CGACCAAGAA ATTCACCTAT GTCGAATGGG CGCAGGACGA GAAGATCCGC
GTTGGTGACT GGGTCGTGGC CGTCGGCAAT CCTTTCGGTC TCGGCGGAAC CGTGACGTCG
GGTATCGTTT CGGCTTTTGG CCGTGATATC GGCTCCGGCC CTTATGACGA TTACATCCAG
ATCGATGCAC CGGTAAACCG GGGCAATTCG GGTGGGCCGG ACTTCAACCT CAGCGGCAAG
GTGGTCGGGA TCAACACGGC GATCTTCTCG CCATCGGGCG GTAGCGTCGG CATCGCCTTC
GCTATTCCGG CGGCGACTGC CAAGGATGTC GTTGCTGAAT TGATCAAGCA TGGCTCGGTG
CAGCGCGGCT GGCTTGGCGT GCAGATCCAG CCTGTCACCA AGGATATTGC CGAATCGCTC
GGTCTGGCCG ATGCCAAGGG CGCACTGGTG GCTGAGCCGC AAACCGGTTC TCCCGGTGAA
AAGGCCGGTA TCAAGCAGGG CGACGTGATT ACCGCCGTGA ATGGCGATCC GGTCAAGGAC
CCGCGTGACC TCGCCAAGCG CATTGCTGCC TTCCCGCCCA ATACCAAGGT CGATATTTCT
ATCTGGCGCA ATGGCAAGCC GACTGCCGTC AAGGTCGATC TCGGCACCTT GCCTGCTGAA
AAGGATACGG CCAGCAGTGA TGAGGATCAG GGCGCGCCCG AGCAGAACGC ACCGGCCACC
GAGCAGGCGC TTGCCAATCT CGGAGTCACT GTCCAGCGTG CCGATGACGG CAAAGGCCTG
ACGATCACCA ATGTCGATCC GGATTCCGAC GCTGCCGACA AGGGGCTGAA GACCGGCCAG
AAGATCACGT CCGTTAACAA CCAGCAGGTC TCCAGCGCCG CCGAGGTCAA GAAGATCCTT
GATCAGGCCA AGAAGGACGG TCGCACCAAG GCGCTCTTCC AGGTGGAAAC CGACAATGGC
AGCCGCTTCA TCGCCCTGCC GATCAACCAG GGCTGA
 
Protein sequence
MLAGAVTVTP ALAAPVEVQA PQVPSFADLV SAVSPAVVSI RVKSDVQQAS EDGSNFSFNG 
RDFDQLPDPL KRFFKEWGMP GPGGPGGPGG PKGGPHAERH GKLRPIAQGS GFFISEDGYV
VTNNHVVSDG QAYTVVMNDG TEYDAKLVGK DPRTDLAVLK VDQPTKKFTY VEWAQDEKIR
VGDWVVAVGN PFGLGGTVTS GIVSAFGRDI GSGPYDDYIQ IDAPVNRGNS GGPDFNLSGK
VVGINTAIFS PSGGSVGIAF AIPAATAKDV VAELIKHGSV QRGWLGVQIQ PVTKDIAESL
GLADAKGALV AEPQTGSPGE KAGIKQGDVI TAVNGDPVKD PRDLAKRIAA FPPNTKVDIS
IWRNGKPTAV KVDLGTLPAE KDTASSDEDQ GAPEQNAPAT EQALANLGVT VQRADDGKGL
TITNVDPDSD AADKGLKTGQ KITSVNNQQV SSAAEVKKIL DQAKKDGRTK ALFQVETDNG
SRFIALPINQ G