Gene Avi_2201 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvi_2201 
Symbol 
ID7387907 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAgrobacterium vitis S4 
KingdomBacteria 
Replicon accessionNC_011989 
Strand
Start bp1807666 
End bp1809591 
Gene Length1926 bp 
Protein Length641 aa 
Translation table11 
GC content60% 
IMG OID643651388 
Productserine protease 
Protein accessionYP_002549582 
Protein GI222148625 
COG category[I] Lipid transport and metabolism 
COG ID[COG0671] Membrane-associated phospholipid phosphatase 
TIGRFAM ID[TIGR02601] autotransporter-associated beta strand repeat 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAAAC TCTCTTCAAT CGCCGCTTTC GCGGCTGTCC TTTCACTTGC AACAGCGCTG 
ACGCCAGCCT TCGCAGATGG CCTCACCCTG CCCCCAGTTC CAGCCGGTGT CGGCCATGCC
GATAGCGCGC CGCCACCGGC AGGCGTCATG GCCTATGTGG ATACCGGCGC CACCAACCAG
CGCGGCGATG CCTGCCATGC GACACCACAG ACCAATGCCG GCGTGCGTGT TCTGTCCGGC
TTCCTGGAGC TTTGGACCCC TCGTACGCCC TTTGTCGACG CCGATCAGGA AGCACCGGCC
AAGGATGGCT GCCCGGCGAT TGCCAAATCG GACTGGAGCG GCATTCCCGG CAGCCCGACC
GATGGGGTCA AGAAGCTCCC GAAATTGCAT GAGCAGAATC TCGCCTATTC CATCAAGGTC
ACCGGCGAAC ATACGCCGGA ACGCGATCTG GCCGCCTATC TGGATGACCG GCGGGGCAAG
AATGTCAGCA TTACCGATGG TCTTGGTCCC CTGGCCGATG CCTGGCGCCA GGGCGTGCGT
CAGACCACGA CGATCACCGG CATGCCCGCT GACGCGACCA CCGTCAAATA TGACGACAAG
GGCAATAATC GCGGCGTCGG CTCCAAGGAC AATACAGATC TCGGCAAGGC CGTCGATCTG
ATCGAGGCGG GCAGTGCCGA TGGTTCAACT GAGCCTGCCA AGCGCTATTA CAAATATGCC
CGTCCTTACC GCGCCAGCGA CAAGGTGCGG ATCGTACCGC AACTGGAACT GGCCAAGAGC
GACAAACCGG CCTCCGACGG CGGCTTCCCC TCCGGCCATA CCGCCGAAGC CTGGCGCGAT
GCGCTGGTGA TGGCCTATCT GGTGCCGCAG CGCTATCAGG AAATGCTGAC CCGCGCCGCC
ATGCTCGGCG AAAACCGCAT TCGCGCTGGC ATGCACCAGA CCTTCGACGT GCTGGGCGGG
CGGGTGTTGG CCACCGCCAT CGTCGCCTAT AATCTCAACC GGCCGGACTA TACGCCGCTG
CGCAGCGAAG CCTATCAGCA GAGCCAGACA TGGCTGATGA AGCAAACCGG TGCAAAAGAT
GGCCAGGCGC TGCTGGCGGC GGCCCATGCT CTTCCGAAAT CGACGGATGC CTATGCCGAT
TACGCCTGGA ACAAGCAATT TTTCGAACCA CGCCTGACCT ATGGCTACAA GCAGATCGGC
GATCCATCCT TGGCGCCCAG CGTGCCCAAG GGTGCCGAAG TGCTGCTGGA AACCCGCCTG
CCCTATCTTA GTGCCGATCA GCGCCGGGTC GTCTTGAAAA CCACGGAAAT TGCCTCTGGC
TATCCAATCA TCAATGATCC GGAAGGCTGG GGTCGTCTCG ACCTGTTCCG CGCCGCCGAT
GGTTATGGTG CGTTCGACGG TGATGTTACC CTTATTATGG ATGGTACCAA AGGCGGCTTT
AACGCCGACG ACACCTGGAA AAACCCGATT TCCGGCAAAG GCAAGCTGAC CAAGCAGGGC
AGCGGCACAC TGACGCTGTC AGCCAACAAC AGCTGGAGCG GCGGCACGGT GATCGAGGAC
GGTCGCCTTG TTGCGCAATC ACCGACTGCC TTTGGCAAGG GCGATGTGTA TCTTGCTGGT
GGCACGATGG ACATTGCCTC CGCGCCTCTG ACCGTGACCG GCACCCTGAC GCTGCGCAAG
GATGCGACCC TGGAAATTAC CTCCACCAAA GCCACGAAAG CGCCAAGCCT TGCTGTCAGC
AAGACGCTGT TCATCGACGG TGGAAAACTG GTGGTCAAGC CTAACGGTCA ATGGAAAGCC
GGACAGACCA TCAAGTTGAT CACTGCCACC AGGATTGCTG GAAAATTCGG CGCTATCGAA
GTTGATGGCC ATAAGGTCAA GGCGGTTTAC GGTAAGAAAA CCATCTCCCT GCGCATCGAA
GGATAA
 
Protein sequence
MKKLSSIAAF AAVLSLATAL TPAFADGLTL PPVPAGVGHA DSAPPPAGVM AYVDTGATNQ 
RGDACHATPQ TNAGVRVLSG FLELWTPRTP FVDADQEAPA KDGCPAIAKS DWSGIPGSPT
DGVKKLPKLH EQNLAYSIKV TGEHTPERDL AAYLDDRRGK NVSITDGLGP LADAWRQGVR
QTTTITGMPA DATTVKYDDK GNNRGVGSKD NTDLGKAVDL IEAGSADGST EPAKRYYKYA
RPYRASDKVR IVPQLELAKS DKPASDGGFP SGHTAEAWRD ALVMAYLVPQ RYQEMLTRAA
MLGENRIRAG MHQTFDVLGG RVLATAIVAY NLNRPDYTPL RSEAYQQSQT WLMKQTGAKD
GQALLAAAHA LPKSTDAYAD YAWNKQFFEP RLTYGYKQIG DPSLAPSVPK GAEVLLETRL
PYLSADQRRV VLKTTEIASG YPIINDPEGW GRLDLFRAAD GYGAFDGDVT LIMDGTKGGF
NADDTWKNPI SGKGKLTKQG SGTLTLSANN SWSGGTVIED GRLVAQSPTA FGKGDVYLAG
GTMDIASAPL TVTGTLTLRK DATLEITSTK ATKAPSLAVS KTLFIDGGKL VVKPNGQWKA
GQTIKLITAT RIAGKFGAIE VDGHKVKAVY GKKTISLRIE G