Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avi_3984 |
Symbol | |
ID | 7387324 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Agrobacterium vitis S4 |
Kingdom | Bacteria |
Replicon accession | NC_011989 |
Strand | + |
Start bp | 3355580 |
End bp | 3356863 |
Gene Length | 1284 bp |
Protein Length | 427 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 643652714 |
Product | hypothetical protein |
Protein accession | YP_002550889 |
Protein GI | 222149932 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG5653] Protein involved in cellulose biosynthesis (CelD) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACACAC TACCGGTGAC CGACGAGACC AGCCGGGAAA CGAGCACCGA GCGCCCGGTT CTGCATACCG TGTCTGCACC GGTCAATCGT CCGCCCGCCG ATGGCCAGAT TGCCGTGGGA CGCCCCGGCC GCGATCTTTG GCTCTATCCA CGCCAGACCA GCTACGACCT TGAACAGGAA ATGGACTACC TGACCAACCG GGCGCTGGAG CAGAATGTCT TCTTCTCGGC CCGCTTTCTG GCCCCCGCCA TCCCCCGCCT GGACGAGCGT GAAGTGCGCA TGGCACTGAT CCGCGACGAG CGGCAGGGTC GCAGCCGGAT CCGGTTGCTG ATGCCTTTCT CCGTGGAAAA ACCGGGATTT GCAGTCGGTC CATCCATTGT TCGCGTCTGG TCCAACCCCT TCGGCCCGCT TGGCACGCCT CTGGTCGATG CAGAGGATGC GGTGGAAACG CTCGACAACC TTTTCGAAGG ACTGAGCGAT CCCAAAGCCA AATTGCCCTC CGTTCTCGTC CTGCCGGATC TGCGAATCGA TGGGCCGGTC ACAAAACTGC TGCGGGCCGT GGCGATCAGC CGTGACCTAC CACTGACCGT GACAAATCCC TACCAGCGCC CAATGCTCGA GAGCCTGGAG GATGGAGAAA CCTATCTAAG CCAGGCCATT GGCAAGTCCC ATTGGCGCGA TATGCGCCGG CAGATGCGCC TGCTGGGTCA GCAGGGCGAA TTGACCTATT CCGTCGCCCG TCAGCCGCAG GATCTGCATG TCCGCATGGA GGAATTCCTG GCGCTTGAAG CCAGCGGCTG GAAAGGCCGC AAGCGTAGCG CCCTTGTCAT GGACCGGCTA CGAGCCGCCT TTGCCCGCGA GGCGATGACC AATCTGGCCG AGCGCGATTC GGTCCGCATC CATACGCTCG ATCTCGATGG AAAAGCCATC GCCTCCATGG TTGTCTTCAT TATGGGTGCC GAAGCCTATA CGTGGAAGAC TGCCTATGAT GAGCGCTATG CGCGATATTC GCCCGGCAAG CTTCTGGTGG CTCGATTGAC AGAATGGCAT CTGGACGATG CCAATATTCT GCGCACCGAT TCCTGCGCCG TACCCGATCA CCCGGTCATG AGCAGGCTCT GGCGGGAGCG GGAAGACATG GGAACCATGG TTATCGGCCT GAAGCGCAAT GCCGACCGCG ATGTCCGCCA GGTGGCGGCG CAACTGCATC TCTATCGCAA CACGCGCAAT ATCGCCCGCA TCCTGCGGGA CAAAGTGCTG GGCAGGCGCC AAAAAGACAG CTGA
|
Protein sequence | MNTLPVTDET SRETSTERPV LHTVSAPVNR PPADGQIAVG RPGRDLWLYP RQTSYDLEQE MDYLTNRALE QNVFFSARFL APAIPRLDER EVRMALIRDE RQGRSRIRLL MPFSVEKPGF AVGPSIVRVW SNPFGPLGTP LVDAEDAVET LDNLFEGLSD PKAKLPSVLV LPDLRIDGPV TKLLRAVAIS RDLPLTVTNP YQRPMLESLE DGETYLSQAI GKSHWRDMRR QMRLLGQQGE LTYSVARQPQ DLHVRMEEFL ALEASGWKGR KRSALVMDRL RAAFAREAMT NLAERDSVRI HTLDLDGKAI ASMVVFIMGA EAYTWKTAYD ERYARYSPGK LLVARLTEWH LDDANILRTD SCAVPDHPVM SRLWREREDM GTMVIGLKRN ADRDVRQVAA QLHLYRNTRN IARILRDKVL GRRQKDS
|
| |