Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avi_1667 |
Symbol | |
ID | 7386679 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Agrobacterium vitis S4 |
Kingdom | Bacteria |
Replicon accession | NC_011989 |
Strand | - |
Start bp | 1394574 |
End bp | 1395788 |
Gene Length | 1215 bp |
Protein Length | 404 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 643650996 |
Product | hypothetical protein |
Protein accession | YP_002549200 |
Protein GI | 222148243 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG5653] Protein involved in cellulose biosynthesis (CelD) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATCGATC CGAAACGGCT GACTGAAACG GCCAGCGGTC AAACTTTTGT GGCAGATATC GTCGTGGACG TTCTGCCCGT GATGGAGCCG CTGGAGGCCG ACTGGCGCTG TCTGGAGCGC AACAACCATC TGTCGCTGCA TCAAGGTTAC GACTGGTGCC GCGCCTGGGT GAAAACCCAT GGCAATCCGC TGGCCATTCT GCATGGCAAC AGCAACGGAC GCAGCCTGTT CATCCTACCG CTGGAAATCA CCCGTCACGC CATGGTCCGC AAGGCCAGTT TCATCGCCAC CCGCTTCACC AATATCAATA CCGGCCTGTT CGACCCGGCT TTTCTCCAGC AAATCGAGTC GGACACAGCC AAACAGTTGG GAAGGCAGAT TGTCCAGGCA ATGACCGGGC ATGCCGATCT CGTTCATCTC GGCAATAGTC CGCTCTCCTG GCGTGGATTT ACTCATCCGC TGGTCGGCCT ACCGGCTGTC GAACATCAGA ACCATGCCTT CCAACTGCCG CTTCTAGGCG ACTTCGAACA GACGCTCTCC CAGATCAACG CCAAGCGGCG GCGCAAGAAA TACCGCAATC AGGTCCGCAA GCTGGAAGCA AGCGGCGGCT TTGAGCATAT TATTGCGTGC GGTGAGGAGC AGAAAGCCTG GCTGCTGGAT CTGTTCTTCC GGCAAAAAGC CGTCCGCTTC GAAACCCTCG GTCTGCCGGA CGTGTTTCAG GAGCCGGAGA CACAGGCATT CTTCCAGTTG CTGCTGCAAA GCGAGGCAGG TGGATTGAAC GTGCCACTGG AACTGCATGC ACTGCGGTTG TCGGGCAGCC ATCATAACGG CAAGATTGCC GCCATCGCCG GTCTTTCACG CAAGGGCGAT CACGTTATCT GCCAGTTCGG CTCGATAGAT GAAAGCATCG CCCCGGAGAC CAGCCCCGGC GAATTGCTGT TCTGGCTGAT GATTGAACAA TGCTGCGCCG AGGGGGCCGC CCTGTTCGAC TTCGGCCTCG GCGACCAGAT CTACAAGCGA AGCTGGTGCC CTATGGAAAC CGTGCAACAC GATATTTTGC TGCCCGTGAC CCCTCTGGGG CACCTCGCAG CCACAGCCGA GCGTAGCCTG ACCCGCTCCA AGGCGTTCAT CAAGGGGCAT CCCCAACTCT ATAGCGCGCT GCAAAAACTC CGCGCTCGTA GCGATGCGCA AGCACATGAT CAGGGTAAGG AATGA
|
Protein sequence | MIDPKRLTET ASGQTFVADI VVDVLPVMEP LEADWRCLER NNHLSLHQGY DWCRAWVKTH GNPLAILHGN SNGRSLFILP LEITRHAMVR KASFIATRFT NINTGLFDPA FLQQIESDTA KQLGRQIVQA MTGHADLVHL GNSPLSWRGF THPLVGLPAV EHQNHAFQLP LLGDFEQTLS QINAKRRRKK YRNQVRKLEA SGGFEHIIAC GEEQKAWLLD LFFRQKAVRF ETLGLPDVFQ EPETQAFFQL LLQSEAGGLN VPLELHALRL SGSHHNGKIA AIAGLSRKGD HVICQFGSID ESIAPETSPG ELLFWLMIEQ CCAEGAALFD FGLGDQIYKR SWCPMETVQH DILLPVTPLG HLAATAERSL TRSKAFIKGH PQLYSALQKL RARSDAQAHD QGKE
|
| |