Gene Avi_3752 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvi_3752 
Symbol 
ID7388211 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAgrobacterium vitis S4 
KingdomBacteria 
Replicon accessionNC_011989 
Strand
Start bp3113434 
End bp3116778 
Gene Length3345 bp 
Protein Length1114 aa 
Translation table11 
GC content61% 
IMG OID643652527 
Producthypothetical protein 
Protein accessionYP_002550708 
Protein GI222149751 
COG category[E] Amino acid transport and metabolism
[S] Function unknown 
COG ID[COG1305] Transglutaminase-like enzymes, putative cysteine proteases
[COG4196] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.139209 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCGATCA AGGCAAGCAT TTATCATCTC ACCCATTATA AATATGACAC GCCGATCCGC 
CTGGGGCCGC AGATCATTCG CCTGAAACCG GCGGCCCATT CCAGAACAAA GGTTCTGAGC
CACTCCCTCA AGGTCACGCC CGAAAATCAC TTCGTCAATC TCCAGCAGGA CCCCTACGGC
AACTATCTGG CCCGTTTCGT GTTTCCCGAT CCGGTCACCG AGTTCAAGAT CGAAGTCGAT
CTCACCGCCG ATATGAGCAT CTACAATCCC TTCGATTTCT TCACGGAGGA AGAGGCCGTT
ACCTGGCCGT TCGAATACCC GGAGGATATC CGCGAGGATC TGGCGATCTA CAAAAAACCG
GAGCCGGACA GCCCCGCCCT CGATTCCTAT CTCAAGACCC TGGACATGAC GCCGGGCCAA
GGCACGGTCG ATATGATCGT CGCGCTCAAT GCCCGCCTGC AAAGCGAAAT CGGCTATGTC
ATCCGCCTGG AGCCGGGCGT GCAGACACCG GATGAAACGC TGACCTCGGC GCGTGGCTCC
TGCCGAGATA CCAGCTGGCT GCTGGTGCAG ATTCTCCGGC ATCTGGGCAT TGCCGCCCGC
TTCGTCTCCG GCTATCTGAT CCAGCTGAAG CCGGATCTGG AAGCGCTGGA TGGCCCCTCC
GGCACCAAGG TGGATTTCAC CGACCTGCAT GCCTGGGCGG AAGCCTATAT TCCCGGCGCC
GGCTGGATCG GCCTGGACCC GACCTCCGGC CTGATGACCG GCGAAAGCCA TATTCCACTG
GCCGCCACAC CGCATTACAA AAACGCCGCC CCGATTTCCG GCGGCTATTT CGGGCAAGCC
AAGACCGATT TCGACTTTGA TATGAAGGTG ACACGGGTTG CCGAGCATCC GCGCATCACC
AAGCCGTTTT CCGATGAAAG CTGGGAAGCA CTGAACGCGC TTGGCCTGAA GGTCGATGGC
GATCTGAAGG CCCATGACGT GCGCCTGACC ATGGGCGGCG AACCGACCTT CGTGTCGATC
GACGATTTCC AGTCGGCGGA ATGGAATACC GATGCGGTTG GCCCGACCAA GCGGGCGCTG
GCCGATCAGT TGATCCGCAA GCTGCGTACC CGCTTTGCCC CCGGCGGCTT CCTGCATTAC
GGGCAGGGTA AATGGTATCC GGGTGAAAGC CTGCCACGCT GGACCTTCTC GCTCTACTGG
CGCAAGGACG GCAAGCCGAT CTGGCATAAT CCGGACCTGA TTGCCACGGA GACAGCCGAT
ACCAATGTGA GCCATGAGCA GGCCCAGGCG CTGATGGCCG GCATTGCCAC CGAGCTGGAG
ATCGAGCCTG ACATGATCCT CCCGGCCTAT GAAGATCCCG CAGCCTGGAT CATCAAGGAA
GGCAGCCTGC CGGAAAATGT CGATCCGTCC AATTCCAAGC TGGAAAGCCC CGAAGAGCGC
GCCCGCATCG CCAAGGTGTT CGAGCGCGGC TTGACGATCC CGACCGGCTA TATCCTGCCG
GTCCAGGCCT GGAACGCCAA GGCCAGCGGT CGGCGCTGGA TCAGCGAGAA ATGGCGCACC
CGGCGCGGCA AGATCTTCCT GATTCCAGGC GACAGTCCCG TTGGCTTTCG CATGCCGCTC
GGCACCCTGC CCTATGTGCC ACCCTCGCAA TATCCCTATA TTCACACGGC GGACCCATCC
ATCCCACGCA CACCGCTGCC GGATTTCGGC CCGGATGCCC GCGAAGGCCG GGCGCTGTCG
GAAGCCTCGC GCAAAACCAG CGACGCCCAG CAGGACCGCA ACGAACAGAA TATTGCCGGC
TCAACCGGTG ACATAACCGG CGCCGTGCGC ACCGCCATGA GCGTCGAGCC GCGTGATGGC
CGGCTCTGCG TGTTCATGCC GCCGGTGGAG CGGATCGAGG ACTATCTGGA ACTGGTAGCC
GCCGCCGAGA CCGCTGCGCA CAATCTCGGC CTGCCGATCC ACATCGAAGG CTATGCCCCA
CCGCAAGACG AGCGCATCAA TGTCATCCGC GTCGCCCCCG ATCCTGGGGT TATCGAGGTC
AACATCCACC CCGCCGATAG CTGGCAGGAT TGCGTGGCCA CCACCGATAT CATCTATGAA
GAGGCCCGCC AGACGCGGCT TGGCGCCGAT AAGTTCATGA TCGATGGCCG CCATACCGGC
ACCGGCGGCG GCAACCATGT GGTGGTTGGC GGCGCCAATC CCGGCGACAG CCCGTTCCTG
CGCCGCCCGG ATCTGCTGAA AAGCCTGGTC CTGCATTGGC AGCGCCATCC GGCTTTGTCC
TATATGTTCT CGGGCATGTT TATCGGCCCG ACCAGCCAGG CGCCACGCTT TGACGAGGCC
CGCCATGATA CGCTCTATGA GCTGGAAATT GCGCTGGCCC AGATCCCCAT GCCAGACAGT
GGCGCGGCCC CGCCTTTGCC CTGGCTGGTC GACCGGCTAT TTCGCAACCT GCTGACCGAT
GTCACCGGCA ATACGCATCG CTCGGAAATC TGCATCGACA AGCTGTTTTC GCCTGACGGT
CCAACGGGGC GGCTGGGTCT GGTTGAATTC CGTGGCTTTG AAATGCCACC GAATGCCCGC
ATGTCGCTGG CCCAGCAATT GCTGGTGCGG GCGCTGATCG CCAGGTTCTG GAAGAACCCG
ATCGGTGGAA ATTTCGTGCG CTGGGGCACG GCATTGCACG ACCGCTTCAT GCTGCCGCAT
TATCTCTGGC AGGATTTTCT CGAAGTGTTG TCAGACCTGC GCGAACACGG CTTCGACTTC
AAGCCGGAAT GGTTTGCCGC CCAGCTGGAA TTCCGCTTCC CCTTCGTCGG ACAGGTAGAA
TACGAAGACA GCAAACTGGA GCTGCGCCAG GCGCTGGAGC CCTGGCATGT GATGGGCGAG
GAAGGTGCCA TCGGCGGTAC GGTGCGCTAT GTCGATAGTT CCGTTGAGCG TTTACAGGTC
AAGCTGGAGA CCGCCAATCC CGAGCGCTAC ACCATTGCCT GCAATGGCCG CCGTCTGCCG
CTGAAGAAAA GCGGCACAAA TGGCGTGGCC GTCGCCGGTG TTCGCTACAA GGCCTGGCAA
CCGGCATCAG GCCTGCATCC GGTCCTGCCT GTAAACACAC CGCTAACATT CGACGTTTAT
GATATATGGA CAGGGCGGTC GATCGGTGGT TGTGTGTATC ATGTCGCGCA TCCCGGTGGT
CGCAGTTATG ATACTTTCCC TGTGAATGGC AATGAAGCGG AGGCTAGGCG GCTTGCGCGG
TTCGAACCCT GGGGCCATAC AGCCGGATCG TATCCGCTGT GGCCGGAAGC CGTCTCGCCG
GAATTTCCGC ACACATTGGA TTTGCGGCGA CCACATGGGA TCTAA
 
Protein sequence
MAIKASIYHL THYKYDTPIR LGPQIIRLKP AAHSRTKVLS HSLKVTPENH FVNLQQDPYG 
NYLARFVFPD PVTEFKIEVD LTADMSIYNP FDFFTEEEAV TWPFEYPEDI REDLAIYKKP
EPDSPALDSY LKTLDMTPGQ GTVDMIVALN ARLQSEIGYV IRLEPGVQTP DETLTSARGS
CRDTSWLLVQ ILRHLGIAAR FVSGYLIQLK PDLEALDGPS GTKVDFTDLH AWAEAYIPGA
GWIGLDPTSG LMTGESHIPL AATPHYKNAA PISGGYFGQA KTDFDFDMKV TRVAEHPRIT
KPFSDESWEA LNALGLKVDG DLKAHDVRLT MGGEPTFVSI DDFQSAEWNT DAVGPTKRAL
ADQLIRKLRT RFAPGGFLHY GQGKWYPGES LPRWTFSLYW RKDGKPIWHN PDLIATETAD
TNVSHEQAQA LMAGIATELE IEPDMILPAY EDPAAWIIKE GSLPENVDPS NSKLESPEER
ARIAKVFERG LTIPTGYILP VQAWNAKASG RRWISEKWRT RRGKIFLIPG DSPVGFRMPL
GTLPYVPPSQ YPYIHTADPS IPRTPLPDFG PDAREGRALS EASRKTSDAQ QDRNEQNIAG
STGDITGAVR TAMSVEPRDG RLCVFMPPVE RIEDYLELVA AAETAAHNLG LPIHIEGYAP
PQDERINVIR VAPDPGVIEV NIHPADSWQD CVATTDIIYE EARQTRLGAD KFMIDGRHTG
TGGGNHVVVG GANPGDSPFL RRPDLLKSLV LHWQRHPALS YMFSGMFIGP TSQAPRFDEA
RHDTLYELEI ALAQIPMPDS GAAPPLPWLV DRLFRNLLTD VTGNTHRSEI CIDKLFSPDG
PTGRLGLVEF RGFEMPPNAR MSLAQQLLVR ALIARFWKNP IGGNFVRWGT ALHDRFMLPH
YLWQDFLEVL SDLREHGFDF KPEWFAAQLE FRFPFVGQVE YEDSKLELRQ ALEPWHVMGE
EGAIGGTVRY VDSSVERLQV KLETANPERY TIACNGRRLP LKKSGTNGVA VAGVRYKAWQ
PASGLHPVLP VNTPLTFDVY DIWTGRSIGG CVYHVAHPGG RSYDTFPVNG NEAEARRLAR
FEPWGHTAGS YPLWPEAVSP EFPHTLDLRR PHGI