Gene Avi_5534 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvi_5534 
SymbolcelB 
ID7381447 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAgrobacterium vitis S4 
KingdomBacteria 
Replicon accessionNC_011988 
Strand
Start bp527966 
End bp530242 
Gene Length2277 bp 
Protein Length758 aa 
Translation table11 
GC content59% 
IMG OID643649119 
ProductcelB protein 
Protein accessionYP_002547356 
Protein GI222106565 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCCCCG AAAGCAATAC AATGCCGGGC GCGCCGATAC CCGGCCTGCC CGCGACCAAG 
GCACCTGTCC CCGTCCCGAC TATGCCGATC CCCGGCATTC CCCTGTCCAA CCGCCCGGCA
CAACAGCAAA GCGAGAGCCT GTCACGCCGT TACCTGCTGA CAACACCGGT TCTCAAGCTG
CAAGGCGAAC TGGCCCGGCA GGCGATGTCG ATTTATCTGA CACCGGAACA AGCCTCAGCC
GAGGCAAAGC TGGTGCTGAG CTATAGCAAC GCGCTGGTGG TTGCGCCGGA AGCCTCCAAC
ATTCTTATCA CCATCAATGA CACGCCGGTT CTAAACCTGC CGATCAGAGG CGGCCAGTCT
GCGCAGCAAA ATACGATTTC CGTGCCGCGA GGCGTTCTTG TTCCGGGCTT TAACCGGATC
AGTTTTGCCG CGGAGCAAAG GCACCGCACC GACTGCACCA TCCAATCCAC CTATGAATTG
TGGACGGAAG TGGATGCATC AAAAACCTAT CTGACATTTG CGGACCCTAA CGCCAACGGC
ATGAAACGTC TCGACGATGT GCGCGCACTT GGTGTCGATG AAAAAGGCCG CAGCCATTTC
ACCATCGTTT CTCCGGATTT CGAACAGCCA TCCGCCACTC CCGCTTTGCT CCAGTTGGCC
CAAGGCCTGT CACTGCTGAG CGGCGTTGCC AATGGCTCCT TCAGCTTCAC GAGAACCATG
CCCGCACGAC CGGCGCCCGG GGAGCTTGTG GTCGTGGCGG CCACCACATC GCAAATGGCG
GGCCTGATCG GCATCAACAA TGCCGTAAAC GCGGATACCA GAAAGCTGAA CGGCACGATG
GCTGGCTTCA TGGCGCTTCC GGGCCGCCCA GGAATGTCGG TCCTCGCCTT CAGCGGCCCG
GATTGGCGCG ACATCAGGGC GATTGCCGAT AACTTTGTCC AGTTTGCCCA AGACGGCGAG
CGGCGGATGT TGAAAACCTC GGCCTGGCGC GGTGTCGATA CGCCTATTCT GGACAAGGCC
GGGGCGCTGT CCTTTGCCGC GCTCGGTCTT CCCGACCAGG AATTTTCAGG TCGCCGCTTT
TTCACGGATT TCACGGTGGG CATGCCAGCC GATTTCTATG CCAACCATTA TGGGACCGCG
ACCATATTGC TAGACGCCGC CTATTCGAGC AAGGTCTTGC CGGGTAGCCA GATCGTGGTA
TCGGTCAATG GTCATCTGGC GACCACCGTT CCGATCACGT CCAAAGGAGG CGCGGTTCTC
CGACACTATC CGATCCGCGT TACCCTGCGC CATTTTCATC CGGGCGTGAA TGTCATCGGG
CTGGAAGCCG TGATGATGAC CCAGGAGGAC AGTGTCTGCG CACCTGGTGA GACAGCGGAT
AAAACGGCGC GGTTTGCCTT GTTCGGCTCT TCGCAATTCG TCATGCCACC GATCGCGCAT
CTGACCCAGG CTCCCGATCT CGCTGCGACA GCGGGTCTCG GCTTTCCCTT TGCTGCGGCA
AAAACCCCGA CAATGGTTGT TTTAGGCCGC AATGACGATG TGACACTGGC AGCCGCCGCA
ACCGTGCTTG GCAAATTTGC CGCATCGGCC CGGCAAACCC TTGCCCTCGA AATGGCGTCA
TCCCCCCCAC GCTTTTCCGG TCGCAATGCG ATTTTCATCG GAGCCGCACC GGACCTTCCA
CCCCAGGCAT TTACCAATGT GCAGCTGCAA ACCAAGGGTG CCGGAGATGG CGATCTCAGC
ACCGACGTAA AGCTCAAATC CTGGCGCGAC AAAATCGATT CCGGCACGGT CATGGAATTT
TTTTCCAGCG TCGACACCTG GATGAAAGAA ACCTTCGATC TCAATCTGGC GACCGTCAGC
CTGTGGCCAA GTGCCGAGGC CGCCTATACA TTGCCCCAGG CATCCAGTGC ATTTCTTTCC
CAATCCGTCG ATGCCGACGG TGCCATCCTG ACGACATTCA GCGCACCTGA TCGCGCATTG
TTACAAACCG GCACGGTCGA TATCGCCGAT CTTCGCAATT GGACCGCGAT CTCGGGCCGC
GTCAGCGTTT ATGATGCGAG AAAACAGGAG ATCGTCGTCA ATGAGCCGAC CAGTGTCAGC
CTGTTGTCGA TGCAGCCCTT GAGCATCAGC AACATGCGAT TGGTCGCGGC AAACTGGCTC
TCCGGCAATT TCGTGATCTA TGCGGGCGGG CTCGTCCTGT GCGCCATCCT GCTGGGGCTT
GCCACGAATG CGTTGATGGC GCTGCTGGGG CGCGGCCACG GCTCAGACAA GAGGTAA
 
Protein sequence
MSPESNTMPG APIPGLPATK APVPVPTMPI PGIPLSNRPA QQQSESLSRR YLLTTPVLKL 
QGELARQAMS IYLTPEQASA EAKLVLSYSN ALVVAPEASN ILITINDTPV LNLPIRGGQS
AQQNTISVPR GVLVPGFNRI SFAAEQRHRT DCTIQSTYEL WTEVDASKTY LTFADPNANG
MKRLDDVRAL GVDEKGRSHF TIVSPDFEQP SATPALLQLA QGLSLLSGVA NGSFSFTRTM
PARPAPGELV VVAATTSQMA GLIGINNAVN ADTRKLNGTM AGFMALPGRP GMSVLAFSGP
DWRDIRAIAD NFVQFAQDGE RRMLKTSAWR GVDTPILDKA GALSFAALGL PDQEFSGRRF
FTDFTVGMPA DFYANHYGTA TILLDAAYSS KVLPGSQIVV SVNGHLATTV PITSKGGAVL
RHYPIRVTLR HFHPGVNVIG LEAVMMTQED SVCAPGETAD KTARFALFGS SQFVMPPIAH
LTQAPDLAAT AGLGFPFAAA KTPTMVVLGR NDDVTLAAAA TVLGKFAASA RQTLALEMAS
SPPRFSGRNA IFIGAAPDLP PQAFTNVQLQ TKGAGDGDLS TDVKLKSWRD KIDSGTVMEF
FSSVDTWMKE TFDLNLATVS LWPSAEAAYT LPQASSAFLS QSVDADGAIL TTFSAPDRAL
LQTGTVDIAD LRNWTAISGR VSVYDARKQE IVVNEPTSVS LLSMQPLSIS NMRLVAANWL
SGNFVIYAGG LVLCAILLGL ATNALMALLG RGHGSDKR