Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avi_3752 |
Symbol | |
ID | 7388211 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Agrobacterium vitis S4 |
Kingdom | Bacteria |
Replicon accession | NC_011989 |
Strand | + |
Start bp | 3113434 |
End bp | 3116778 |
Gene Length | 3345 bp |
Protein Length | 1114 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 643652527 |
Product | hypothetical protein |
Protein accession | YP_002550708 |
Protein GI | 222149751 |
COG category | [E] Amino acid transport and metabolism [S] Function unknown |
COG ID | [COG1305] Transglutaminase-like enzymes, putative cysteine proteases [COG4196] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.139209 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCGATCA AGGCAAGCAT TTATCATCTC ACCCATTATA AATATGACAC GCCGATCCGC CTGGGGCCGC AGATCATTCG CCTGAAACCG GCGGCCCATT CCAGAACAAA GGTTCTGAGC CACTCCCTCA AGGTCACGCC CGAAAATCAC TTCGTCAATC TCCAGCAGGA CCCCTACGGC AACTATCTGG CCCGTTTCGT GTTTCCCGAT CCGGTCACCG AGTTCAAGAT CGAAGTCGAT CTCACCGCCG ATATGAGCAT CTACAATCCC TTCGATTTCT TCACGGAGGA AGAGGCCGTT ACCTGGCCGT TCGAATACCC GGAGGATATC CGCGAGGATC TGGCGATCTA CAAAAAACCG GAGCCGGACA GCCCCGCCCT CGATTCCTAT CTCAAGACCC TGGACATGAC GCCGGGCCAA GGCACGGTCG ATATGATCGT CGCGCTCAAT GCCCGCCTGC AAAGCGAAAT CGGCTATGTC ATCCGCCTGG AGCCGGGCGT GCAGACACCG GATGAAACGC TGACCTCGGC GCGTGGCTCC TGCCGAGATA CCAGCTGGCT GCTGGTGCAG ATTCTCCGGC ATCTGGGCAT TGCCGCCCGC TTCGTCTCCG GCTATCTGAT CCAGCTGAAG CCGGATCTGG AAGCGCTGGA TGGCCCCTCC GGCACCAAGG TGGATTTCAC CGACCTGCAT GCCTGGGCGG AAGCCTATAT TCCCGGCGCC GGCTGGATCG GCCTGGACCC GACCTCCGGC CTGATGACCG GCGAAAGCCA TATTCCACTG GCCGCCACAC CGCATTACAA AAACGCCGCC CCGATTTCCG GCGGCTATTT CGGGCAAGCC AAGACCGATT TCGACTTTGA TATGAAGGTG ACACGGGTTG CCGAGCATCC GCGCATCACC AAGCCGTTTT CCGATGAAAG CTGGGAAGCA CTGAACGCGC TTGGCCTGAA GGTCGATGGC GATCTGAAGG CCCATGACGT GCGCCTGACC ATGGGCGGCG AACCGACCTT CGTGTCGATC GACGATTTCC AGTCGGCGGA ATGGAATACC GATGCGGTTG GCCCGACCAA GCGGGCGCTG GCCGATCAGT TGATCCGCAA GCTGCGTACC CGCTTTGCCC CCGGCGGCTT CCTGCATTAC GGGCAGGGTA AATGGTATCC GGGTGAAAGC CTGCCACGCT GGACCTTCTC GCTCTACTGG CGCAAGGACG GCAAGCCGAT CTGGCATAAT CCGGACCTGA TTGCCACGGA GACAGCCGAT ACCAATGTGA GCCATGAGCA GGCCCAGGCG CTGATGGCCG GCATTGCCAC CGAGCTGGAG ATCGAGCCTG ACATGATCCT CCCGGCCTAT GAAGATCCCG CAGCCTGGAT CATCAAGGAA GGCAGCCTGC CGGAAAATGT CGATCCGTCC AATTCCAAGC TGGAAAGCCC CGAAGAGCGC GCCCGCATCG CCAAGGTGTT CGAGCGCGGC TTGACGATCC CGACCGGCTA TATCCTGCCG GTCCAGGCCT GGAACGCCAA GGCCAGCGGT CGGCGCTGGA TCAGCGAGAA ATGGCGCACC CGGCGCGGCA AGATCTTCCT GATTCCAGGC GACAGTCCCG TTGGCTTTCG CATGCCGCTC GGCACCCTGC CCTATGTGCC ACCCTCGCAA TATCCCTATA TTCACACGGC GGACCCATCC ATCCCACGCA CACCGCTGCC GGATTTCGGC CCGGATGCCC GCGAAGGCCG GGCGCTGTCG GAAGCCTCGC GCAAAACCAG CGACGCCCAG CAGGACCGCA ACGAACAGAA TATTGCCGGC TCAACCGGTG ACATAACCGG CGCCGTGCGC ACCGCCATGA GCGTCGAGCC GCGTGATGGC CGGCTCTGCG TGTTCATGCC GCCGGTGGAG CGGATCGAGG ACTATCTGGA ACTGGTAGCC GCCGCCGAGA CCGCTGCGCA CAATCTCGGC CTGCCGATCC ACATCGAAGG CTATGCCCCA CCGCAAGACG AGCGCATCAA TGTCATCCGC GTCGCCCCCG ATCCTGGGGT TATCGAGGTC AACATCCACC CCGCCGATAG CTGGCAGGAT TGCGTGGCCA CCACCGATAT CATCTATGAA GAGGCCCGCC AGACGCGGCT TGGCGCCGAT AAGTTCATGA TCGATGGCCG CCATACCGGC ACCGGCGGCG GCAACCATGT GGTGGTTGGC GGCGCCAATC CCGGCGACAG CCCGTTCCTG CGCCGCCCGG ATCTGCTGAA AAGCCTGGTC CTGCATTGGC AGCGCCATCC GGCTTTGTCC TATATGTTCT CGGGCATGTT TATCGGCCCG ACCAGCCAGG CGCCACGCTT TGACGAGGCC CGCCATGATA CGCTCTATGA GCTGGAAATT GCGCTGGCCC AGATCCCCAT GCCAGACAGT GGCGCGGCCC CGCCTTTGCC CTGGCTGGTC GACCGGCTAT TTCGCAACCT GCTGACCGAT GTCACCGGCA ATACGCATCG CTCGGAAATC TGCATCGACA AGCTGTTTTC GCCTGACGGT CCAACGGGGC GGCTGGGTCT GGTTGAATTC CGTGGCTTTG AAATGCCACC GAATGCCCGC ATGTCGCTGG CCCAGCAATT GCTGGTGCGG GCGCTGATCG CCAGGTTCTG GAAGAACCCG ATCGGTGGAA ATTTCGTGCG CTGGGGCACG GCATTGCACG ACCGCTTCAT GCTGCCGCAT TATCTCTGGC AGGATTTTCT CGAAGTGTTG TCAGACCTGC GCGAACACGG CTTCGACTTC AAGCCGGAAT GGTTTGCCGC CCAGCTGGAA TTCCGCTTCC CCTTCGTCGG ACAGGTAGAA TACGAAGACA GCAAACTGGA GCTGCGCCAG GCGCTGGAGC CCTGGCATGT GATGGGCGAG GAAGGTGCCA TCGGCGGTAC GGTGCGCTAT GTCGATAGTT CCGTTGAGCG TTTACAGGTC AAGCTGGAGA CCGCCAATCC CGAGCGCTAC ACCATTGCCT GCAATGGCCG CCGTCTGCCG CTGAAGAAAA GCGGCACAAA TGGCGTGGCC GTCGCCGGTG TTCGCTACAA GGCCTGGCAA CCGGCATCAG GCCTGCATCC GGTCCTGCCT GTAAACACAC CGCTAACATT CGACGTTTAT GATATATGGA CAGGGCGGTC GATCGGTGGT TGTGTGTATC ATGTCGCGCA TCCCGGTGGT CGCAGTTATG ATACTTTCCC TGTGAATGGC AATGAAGCGG AGGCTAGGCG GCTTGCGCGG TTCGAACCCT GGGGCCATAC AGCCGGATCG TATCCGCTGT GGCCGGAAGC CGTCTCGCCG GAATTTCCGC ACACATTGGA TTTGCGGCGA CCACATGGGA TCTAA
|
Protein sequence | MAIKASIYHL THYKYDTPIR LGPQIIRLKP AAHSRTKVLS HSLKVTPENH FVNLQQDPYG NYLARFVFPD PVTEFKIEVD LTADMSIYNP FDFFTEEEAV TWPFEYPEDI REDLAIYKKP EPDSPALDSY LKTLDMTPGQ GTVDMIVALN ARLQSEIGYV IRLEPGVQTP DETLTSARGS CRDTSWLLVQ ILRHLGIAAR FVSGYLIQLK PDLEALDGPS GTKVDFTDLH AWAEAYIPGA GWIGLDPTSG LMTGESHIPL AATPHYKNAA PISGGYFGQA KTDFDFDMKV TRVAEHPRIT KPFSDESWEA LNALGLKVDG DLKAHDVRLT MGGEPTFVSI DDFQSAEWNT DAVGPTKRAL ADQLIRKLRT RFAPGGFLHY GQGKWYPGES LPRWTFSLYW RKDGKPIWHN PDLIATETAD TNVSHEQAQA LMAGIATELE IEPDMILPAY EDPAAWIIKE GSLPENVDPS NSKLESPEER ARIAKVFERG LTIPTGYILP VQAWNAKASG RRWISEKWRT RRGKIFLIPG DSPVGFRMPL GTLPYVPPSQ YPYIHTADPS IPRTPLPDFG PDAREGRALS EASRKTSDAQ QDRNEQNIAG STGDITGAVR TAMSVEPRDG RLCVFMPPVE RIEDYLELVA AAETAAHNLG LPIHIEGYAP PQDERINVIR VAPDPGVIEV NIHPADSWQD CVATTDIIYE EARQTRLGAD KFMIDGRHTG TGGGNHVVVG GANPGDSPFL RRPDLLKSLV LHWQRHPALS YMFSGMFIGP TSQAPRFDEA RHDTLYELEI ALAQIPMPDS GAAPPLPWLV DRLFRNLLTD VTGNTHRSEI CIDKLFSPDG PTGRLGLVEF RGFEMPPNAR MSLAQQLLVR ALIARFWKNP IGGNFVRWGT ALHDRFMLPH YLWQDFLEVL SDLREHGFDF KPEWFAAQLE FRFPFVGQVE YEDSKLELRQ ALEPWHVMGE EGAIGGTVRY VDSSVERLQV KLETANPERY TIACNGRRLP LKKSGTNGVA VAGVRYKAWQ PASGLHPVLP VNTPLTFDVY DIWTGRSIGG CVYHVAHPGG RSYDTFPVNG NEAEARRLAR FEPWGHTAGS YPLWPEAVSP EFPHTLDLRR PHGI
|
| |