Gene Avi_3984 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvi_3984 
Symbol 
ID7387324 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAgrobacterium vitis S4 
KingdomBacteria 
Replicon accessionNC_011989 
Strand
Start bp3355580 
End bp3356863 
Gene Length1284 bp 
Protein Length427 aa 
Translation table11 
GC content61% 
IMG OID643652714 
Producthypothetical protein 
Protein accessionYP_002550889 
Protein GI222149932 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG5653] Protein involved in cellulose biosynthesis (CelD) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACACAC TACCGGTGAC CGACGAGACC AGCCGGGAAA CGAGCACCGA GCGCCCGGTT 
CTGCATACCG TGTCTGCACC GGTCAATCGT CCGCCCGCCG ATGGCCAGAT TGCCGTGGGA
CGCCCCGGCC GCGATCTTTG GCTCTATCCA CGCCAGACCA GCTACGACCT TGAACAGGAA
ATGGACTACC TGACCAACCG GGCGCTGGAG CAGAATGTCT TCTTCTCGGC CCGCTTTCTG
GCCCCCGCCA TCCCCCGCCT GGACGAGCGT GAAGTGCGCA TGGCACTGAT CCGCGACGAG
CGGCAGGGTC GCAGCCGGAT CCGGTTGCTG ATGCCTTTCT CCGTGGAAAA ACCGGGATTT
GCAGTCGGTC CATCCATTGT TCGCGTCTGG TCCAACCCCT TCGGCCCGCT TGGCACGCCT
CTGGTCGATG CAGAGGATGC GGTGGAAACG CTCGACAACC TTTTCGAAGG ACTGAGCGAT
CCCAAAGCCA AATTGCCCTC CGTTCTCGTC CTGCCGGATC TGCGAATCGA TGGGCCGGTC
ACAAAACTGC TGCGGGCCGT GGCGATCAGC CGTGACCTAC CACTGACCGT GACAAATCCC
TACCAGCGCC CAATGCTCGA GAGCCTGGAG GATGGAGAAA CCTATCTAAG CCAGGCCATT
GGCAAGTCCC ATTGGCGCGA TATGCGCCGG CAGATGCGCC TGCTGGGTCA GCAGGGCGAA
TTGACCTATT CCGTCGCCCG TCAGCCGCAG GATCTGCATG TCCGCATGGA GGAATTCCTG
GCGCTTGAAG CCAGCGGCTG GAAAGGCCGC AAGCGTAGCG CCCTTGTCAT GGACCGGCTA
CGAGCCGCCT TTGCCCGCGA GGCGATGACC AATCTGGCCG AGCGCGATTC GGTCCGCATC
CATACGCTCG ATCTCGATGG AAAAGCCATC GCCTCCATGG TTGTCTTCAT TATGGGTGCC
GAAGCCTATA CGTGGAAGAC TGCCTATGAT GAGCGCTATG CGCGATATTC GCCCGGCAAG
CTTCTGGTGG CTCGATTGAC AGAATGGCAT CTGGACGATG CCAATATTCT GCGCACCGAT
TCCTGCGCCG TACCCGATCA CCCGGTCATG AGCAGGCTCT GGCGGGAGCG GGAAGACATG
GGAACCATGG TTATCGGCCT GAAGCGCAAT GCCGACCGCG ATGTCCGCCA GGTGGCGGCG
CAACTGCATC TCTATCGCAA CACGCGCAAT ATCGCCCGCA TCCTGCGGGA CAAAGTGCTG
GGCAGGCGCC AAAAAGACAG CTGA
 
Protein sequence
MNTLPVTDET SRETSTERPV LHTVSAPVNR PPADGQIAVG RPGRDLWLYP RQTSYDLEQE 
MDYLTNRALE QNVFFSARFL APAIPRLDER EVRMALIRDE RQGRSRIRLL MPFSVEKPGF
AVGPSIVRVW SNPFGPLGTP LVDAEDAVET LDNLFEGLSD PKAKLPSVLV LPDLRIDGPV
TKLLRAVAIS RDLPLTVTNP YQRPMLESLE DGETYLSQAI GKSHWRDMRR QMRLLGQQGE
LTYSVARQPQ DLHVRMEEFL ALEASGWKGR KRSALVMDRL RAAFAREAMT NLAERDSVRI
HTLDLDGKAI ASMVVFIMGA EAYTWKTAYD ERYARYSPGK LLVARLTEWH LDDANILRTD
SCAVPDHPVM SRLWREREDM GTMVIGLKRN ADRDVRQVAA QLHLYRNTRN IARILRDKVL
GRRQKDS