Gene Avi_6014 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvi_6014 
Symbol 
ID7380841 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAgrobacterium vitis S4 
KingdomBacteria 
Replicon accessionNC_011988 
Strand
Start bp1017460 
End bp1019673 
Gene Length2214 bp 
Protein Length737 aa 
Translation table11 
GC content62% 
IMG OID643649516 
Productexopolysaccharide polymerization/transport protein 
Protein accessionYP_002547747 
Protein GI222106956 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3206] Uncharacterized protein involved in exopolysaccharide biosynthesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.807866 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAATGG ACATCGACAT TTTTCAAATC CCCGGGATCT TGCGACGGCG CTGGTATTAC 
CTGGCGTTTT TTGCCGCGCT GTTTGCCGGT CTGGCGCTGC TCTATGCGCT CAGCCTGAAG
CCTGTCTATG TATCGTCCAC CCAGATCCTG CTCGATCCGC GCGGCCTGTC TGCCACCAGT
AGCGACAGCC GACAGCCGAC AGTCGCCGTA CAAAGCGACC CGGCCAGCCT CGACAGCCAG
ATCTATGTCG TCCTGTCCAG CGCCGTGCTG GGGGAAGTGG TCAACCGGCT CGACCTGACG
AAAGACTCCT ATCTCTATGC GGGAAAGCCA AGCAGCGCCG TATCCCCCGC TGAGGTCATG
GCTGCCACCA TCGGCGGGCT GGTCCGGCAT GTGAAAGTCG AGCGCGAAGG CCAGTCCTTC
ATCATGTCGA TCACGGTCGA ACACCGCATC GCCAAAACGG CAGCCGACAT TGCCAATATG
ATTGCCACCG TCTACCTGAA ACAGGTGGAC GAAGCCCGCT CCGACGCAGC GCGCCGGGCA
AGCGCCGCCT TCCAGGCGCA GGCCAGTGAA TTGCGCGATC GGGTGCTGAA GGCCGAAAGG
GCGGTCGAGG AATTCCGATC CGCCAACGGT CTGGCCAGCA CCGGCGTCAC CGGACTGGTG
ATCGACCAGC AACTAGCAGG CCTGAACCAG CAGTTGATCG CAGCACGCGG CGCGGAAGAA
CAGCAGCAGG CCATTTACCA GCAGACCCGC AATCTCACGG TCGCTGCCGT TGAAAACGGC
AATATCCCAG AAGCGGTACA ATCCACTACT GTCGGGCTGC TGCGCGACCG CTATGTCCAG
CTACAGGACC GCCAGGCCGA AGCGTCCGCC AATCTCGGCG GCAACCATCC GCAACTGAAG
GCGATCAATT CGCAGGTGGC CAGCATGCGC CAGGCCATCC AGCAGGAGCT GGACCGGGTG
CGCCAGTCGA TGAAGCTCAA CTATGACCGG GCGGTTGCCA ACCGCAAGGC GCTGGAAACC
CAGCTGCAAA GCCTGACGAA AACCAGTTTC GACAGCGGGG CGCGGCAGAT CACCCTGCGC
CAGCTGGAAA GCGAGGCGGA AGCCATCCGC ACCATTTACA AGGCCTTCCT CAACCGCGCC
GAGGAGCTGA GCCAGGAACA GACGATCTCC ATCAACAATT CCCGGGTTAT CACCGAGGCA
GTGGCGACAG CGAAATCGGT CACGACCCTC AAAGTGATGA TCCTTGCCGC CGCCATCCTG
TTTGGTCTGG CCTTCGGCAG CACGCTGGCG GTGGTGCTGG AACTCCTGTC GCGCAAGGAT
GTCGCGCCGC AAGCAGGCAT CGCCGCGCCT GCCACGGCGA CCTCATCGCC TCCTGGCAAA
GGCGAACCAC CCCCAGCACC GCCCATTGCG TCAGCAAGAC ATATCGCTCT GATTGCCGAC
GCTACGGAAC CTGAAAAACA GAAGTCCCGC AATCCGTTCA GCTTTATCAC CGCCTTTGGC
CGTCGGCTGG TCTCGCCTCT TGTTCCCGCA TCCAACCCGG CAACCGCATC ACAACCGGCG
GCAGGCGGCG CCTGGTCCCA TGCGGTCGCC AGCACAGCCG GTTTTCTGAT TGAATGCGGT
GAAGGCTATG CTGATCTGAC CGTTCTGTTT GTTGCAGCGG GCAGGCCGGT GGCCAGCGCC
TTTATTGGCG ATGTGGCACA GAAGCTGGCT GATCGGGACC GCGGCGTGCT GCTGGCCAAT
GGTGCCATGC TGGACCACCG CTTAGCGATC CGCTCGCACA GAAAAACCAA TGCCCGGCCA
AGCCTTGCCC AGGCATTGCA GCAGCCCGAT CTCGAAGACG CACCTCTCTC GCACATCCTG
CGCTACGAGC GCATCGCCCT GCCCCGCAAC CAGCCAGCGC GGCCAGCAGC GGGAGCATCG
GCCTCCTCCG GTCGCCCGAC CTATTCGCGA TTTGTCGAGC AAAGCCTTCA GGCGGAAACC
GATTTCACCT TGATTAATGC CTGCGGCGCT TGGATCAATG CCCGCGGCGC CGGAATCAAT
GCTAGCGGTG CTGGTGGCGA GCAGCATCTT ACTGCCCTTG CCGCCGAGGC CGATGTCATT
CTCGTCTTGA CCACGGCGCA AGACGAGGCC GCCGCCCTGG ATGAACTGCT GATGCGGCTG
GGCGAAGACG CAGAACGGGT CGTGGGTCGT ATCGTGCTCG AGACCGCCGG ATGA
 
Protein sequence
MKMDIDIFQI PGILRRRWYY LAFFAALFAG LALLYALSLK PVYVSSTQIL LDPRGLSATS 
SDSRQPTVAV QSDPASLDSQ IYVVLSSAVL GEVVNRLDLT KDSYLYAGKP SSAVSPAEVM
AATIGGLVRH VKVEREGQSF IMSITVEHRI AKTAADIANM IATVYLKQVD EARSDAARRA
SAAFQAQASE LRDRVLKAER AVEEFRSANG LASTGVTGLV IDQQLAGLNQ QLIAARGAEE
QQQAIYQQTR NLTVAAVENG NIPEAVQSTT VGLLRDRYVQ LQDRQAEASA NLGGNHPQLK
AINSQVASMR QAIQQELDRV RQSMKLNYDR AVANRKALET QLQSLTKTSF DSGARQITLR
QLESEAEAIR TIYKAFLNRA EELSQEQTIS INNSRVITEA VATAKSVTTL KVMILAAAIL
FGLAFGSTLA VVLELLSRKD VAPQAGIAAP ATATSSPPGK GEPPPAPPIA SARHIALIAD
ATEPEKQKSR NPFSFITAFG RRLVSPLVPA SNPATASQPA AGGAWSHAVA STAGFLIECG
EGYADLTVLF VAAGRPVASA FIGDVAQKLA DRDRGVLLAN GAMLDHRLAI RSHRKTNARP
SLAQALQQPD LEDAPLSHIL RYERIALPRN QPARPAAGAS ASSGRPTYSR FVEQSLQAET
DFTLINACGA WINARGAGIN ASGAGGEQHL TALAAEADVI LVLTTAQDEA AALDELLMRL
GEDAERVVGR IVLETAG