Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avi_7274 |
Symbol | |
ID | 7380500 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Agrobacterium vitis S4 |
Kingdom | Bacteria |
Replicon accession | NC_011981 |
Strand | + |
Start bp | 230317 |
End bp | 233289 |
Gene Length | 2973 bp |
Protein Length | 990 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 643641364 |
Product | hypothetical protein |
Protein accession | YP_002539661 |
Protein GI | 222102622 |
COG category | [S] Function unknown |
COG ID | [COG4625] Uncharacterized protein with a C-terminal OMP (outer membrane protein) domain |
TIGRFAM ID | [TIGR01414] outer membrane autotransporter barrel domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAACCAT TGCTGCCAGC ATTCCGTCGC CGATTATGTG TGTTCACGGC ACTGGCCTCC ACCGTGTTGC CCGCGCATTG CTTCGCGCAA TCAGCATGGT CTGGAAATGC TGACAACGAA TTTAGCAATG CATCGAACTG GTCCCCTTCG GCGCCCGGGG CAGGCGATGA TGCGCATGTC GATACAGGCT CTCCACAAGT TACAAACGAC GTGACAGTCA ATCGTCTGGA GGTTGGTGGA GGAAACGTCA CCATCACCGA CACAGGTGCA TTGACCGCCA CGAACGGCAC CACGATCACA GCAGGCAGCG TTAGCGTCAA TGCGGGCGGA GTAATGAACT CCAACGTCAG CCTTGATGGC GGTAGCCTTT CCATCGATGG CGACCTCAAT GGCCAGTTGA CGCTCAACAA CGGCAACGTC ACCGTGAACG GTACGCTCAG TGGCGCCATG GTAGAGACAG GGACCGCGCT GTCCAACAAC GGCACTGTTG ATGAGGTCAA TATCTCCAAA GGAGGGACCT TCGTCAATAA TAGCGGCGTA ACCGCGGGCG CAGTTACCAA TGCCGGGACG ACATCCAACG CCGGAACGAT CGGGAGCCTG ACCAATACCG CCGGAAATTT TACCAATAAT GCTGGCGGGA CGATTTCGGG AAAGACGACG GTTTCGGGTG GAACTGTCAC CAACAATTTC GTCGTTACCG ACGCCGATGT CGCGGCAGCG GCGGCCTTCA TCAACAACAC CGGTGCAACA GCCGGCGCAA TCAGAAATTC AGGCACCGTC GTCAATGCGG GAACCATTGT CTCCCTACAG AACGATGCCG GTACATTCAC CAATGATGCC GGCGGGGTGG TTACGGGGGA TAGCACTATT ACCGGTGGCA GCGTGACGAA CAACGCCACC CTTCACGACG TCGATGTCGG AGCAGATGCA ACATTCACCA ATGCCAACGG CGCCACCGCC GGTGTCGTCG TCAATGCAGG CAATTCGTCG AATGCCGGGA CTATTGATAG CCTGACCAAT ACGGCTGGAA ATTTTACCAA CAATGCAGGC GGTACGATTA CCGGCAAATC AACAATCACA GGCGGAACCG TCACCAATAA CTTCGTCATC ACCGATGCGG ACGTTGCCGC CGCCGCAATA TTCGTCAACA ACTCGGGAGC CACCGCTGGC ACGATAAGAA ACTCCGGCAC GGTTACCAAT GCGGGGACCG TCGCGGCGCT TCAAAATGAT GCCGGCACTT TCACGAACAA TTCTGGCGGT ACCGTTACGG GTACAACAAC GATCAACGGA GGTCGTGTTG TCAACAATGC CACATTGGCA GACGTCGATA ATGAGGCGGC GGGCGAATTC ATCAACAACA ACGGCGCTGT GGCAGGCACG GTTACCAATT CTGGCAGTGC CTCGAACGAC GGAACAATCG CCGCGCTCGT CAACACTGAC GGCATCTTTT CCAATACCGG TACAATTGAT GGAACAGCCA CGATTTCGGG CGGTTCACTG ATCAACGATG GAACCGTCGC CGGAGCAGTC GTCATCGATG AGGGCGGGCT TCTTTCAGGC AACGGGACCA TTGGCGAGCT TTTCGTCAAC GCAGGCGGTG TACTTTCACC CGGGCGAGAC ATAGGAACAC TCACCGTCGA CGGGGATGTG ACCTTTAGCA CTGGCTCAAT CTACCAGGTC GACATCGATG CCAACGGCGC CTCCGACCGG GTGGATGCGA CAGGCACGGT CTCCATACAG GGGGGAACGC TTGAAATTAG AGCGGTGGGT GGCAATTACG GGCTGACAAC CACTTATACC ATCCTGAGCG CAAGCCGCAT CACGGGCACC TTCGACAGCG TTTCAAGCGA TTTTGCTTTC CTGACGCCAA CGTTGACCTA TGGTGCTGCG ACGATAGACA TGCGTTTCGA CCGCAATAAC GTGCAATTCT CAGATGTCGC GGATACGGCA AACGGTCGCG CAACCGCGGT TGCCGTTGAA GCATTGGGAA CAGGAAACTC GATTTACGAT GCGGTCTTGT CACTGAATTC TTCCACCGCA AACAGTGCCT TCTCGCAGTT GAGCGGTGAG GTTCATGCCT CGCTCAAAAG CGCTCTGCTG TGGGAGAGCC GTTTTGCACG CGACGCGGTG CTTGACCAAA TGGCCATGGA TCTTGATCAG CGGAAGGATG ACGTAACGGT CTGGACCAAC AGTTTTCTGT CGTCCAACCG TTGGTCAGGG AATGGAAATG CAGCCGGAAT CGATACCCGC ACCGCAGGCG TAGTCATGGG AGTGGATGCG TCAGTCTCCG ACCCGTGGCG TCTAGGCGGC CTGTTGGGTT ACAGCCATGA CTCGTTGGAG CAGGTGAAGA CAGAGTCTTA TCACGCCGGT CTCTATGCTA CGGGTGATAT TGGTCCCTTG AACGTCATTG GCGGCGCCAT CTTTTCCCAT AACGATGCGT CAACGCTGCG CGATATCTCC TTTGGAACGA TTACCAGCCA GCTCACCGCT GACTATGCCA GTGCCACCAG CCAGGTTTTC GCAGATCTCT CTTCGACGCA TGAACTGGAT GCGATCAAGC TCCAGCCCTT CGCCAACCTC GCTTACGTCA ATCTCAAGAC TGACGCCTTC CGCGAAAATG GCGGCGATGC GAGCATGTTG GCAGCAAAAA GCAGTGACGA GATCGCAACC TCCACGATCG GTCTGCGCTG GTCTTGGAGG TGGCCGGAAG ACAATTTACC CGTTGCCGTC TCCGGCATGC TGGGCTGGCG GCATATCGAG GGTGACGTGT CGCCGTATTC TAGCGTCGCG TTTTCTGGCG GCACGCCATT TGTCGTCGAA GGCGTGGAAA TGCCAAAGGA TGCTCTGCTC GCAAAATTCG GGGTTTCCGC CAGGCTTTCG AAGTCGGCTC GCCTGACGTT CAGCTATTCC GGCGAGTTTG GCAAAGGCCT GCGGTCCAGC GCAGCGCAGG TCAATCTGGC GGGGAGTTTT TAA
|
Protein sequence | MKPLLPAFRR RLCVFTALAS TVLPAHCFAQ SAWSGNADNE FSNASNWSPS APGAGDDAHV DTGSPQVTND VTVNRLEVGG GNVTITDTGA LTATNGTTIT AGSVSVNAGG VMNSNVSLDG GSLSIDGDLN GQLTLNNGNV TVNGTLSGAM VETGTALSNN GTVDEVNISK GGTFVNNSGV TAGAVTNAGT TSNAGTIGSL TNTAGNFTNN AGGTISGKTT VSGGTVTNNF VVTDADVAAA AAFINNTGAT AGAIRNSGTV VNAGTIVSLQ NDAGTFTNDA GGVVTGDSTI TGGSVTNNAT LHDVDVGADA TFTNANGATA GVVVNAGNSS NAGTIDSLTN TAGNFTNNAG GTITGKSTIT GGTVTNNFVI TDADVAAAAI FVNNSGATAG TIRNSGTVTN AGTVAALQND AGTFTNNSGG TVTGTTTING GRVVNNATLA DVDNEAAGEF INNNGAVAGT VTNSGSASND GTIAALVNTD GIFSNTGTID GTATISGGSL INDGTVAGAV VIDEGGLLSG NGTIGELFVN AGGVLSPGRD IGTLTVDGDV TFSTGSIYQV DIDANGASDR VDATGTVSIQ GGTLEIRAVG GNYGLTTTYT ILSASRITGT FDSVSSDFAF LTPTLTYGAA TIDMRFDRNN VQFSDVADTA NGRATAVAVE ALGTGNSIYD AVLSLNSSTA NSAFSQLSGE VHASLKSALL WESRFARDAV LDQMAMDLDQ RKDDVTVWTN SFLSSNRWSG NGNAAGIDTR TAGVVMGVDA SVSDPWRLGG LLGYSHDSLE QVKTESYHAG LYATGDIGPL NVIGGAIFSH NDASTLRDIS FGTITSQLTA DYASATSQVF ADLSSTHELD AIKLQPFANL AYVNLKTDAF RENGGDASML AAKSSDEIAT STIGLRWSWR WPEDNLPVAV SGMLGWRHIE GDVSPYSSVA FSGGTPFVVE GVEMPKDALL AKFGVSARLS KSARLTFSYS GEFGKGLRSS AAQVNLAGSF
|
| |