Gene Avi_5331 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvi_5331 
Symbol 
ID7380692 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAgrobacterium vitis S4 
KingdomBacteria 
Replicon accessionNC_011988 
Strand
Start bp333017 
End bp334312 
Gene Length1296 bp 
Protein Length431 aa 
Translation table11 
GC content58% 
IMG OID643648954 
ProductABC transporter substrate binding protein (sugar) 
Protein accessionYP_002547191 
Protein GI222106400 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.998389 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAATCA ACCGACGCAC CGTGGTGAGC GGTCTGGCTT TGGGCCTCGC TGCTGCCGGG 
CTTTCCACGC CCGTGCTGGC TGCCGACGAA GTCACGCTCA ACGTGCTTTA CAATCTGCCG
GGCTTCACGA AATTCCATCA GCCGCTGGCC GATGCGTTCA TGAAGAACAA TCCGAATGTA
AAGATCAATT TTCTGGCGCC CGCTCCAGGC TATAACGAGG GTCAGCAGCA GGTCCTGCGC
GCTGCCGTGA CCGGCAATCT GCCGGATGTT TATTTCTCAG GCTTCAACCT GACCGCGGAG
CTGGTTCACA CACTGGCACC CCGCAACCAG ATCACCGATC TGGCGCCCTT CATCGCGGCG
GAAGGCGGCC AGGCCTTCCT CGACAAAAAT TACAACCCGA AAATGGCGGC CCTCGGCCAG
ATCGATGGCA AGCAATACGG CCTCCCCGTC AATGCCTCCT CGCCAATCAT CTATATCAAT
GCTGATCTGG TAAAGAAGGC TGGCGGCGAT CCGGACAATA TGCCGAAAAC CTTTCCCGGA
CTGATCTCGC TGGCCAAGAA TATCCACGCG CTCGATCCGA AAATCTCCGG CATGGGTTAC
GACATCAATG GTTGGCCGGA TGACTGGCTT TGGCAGGCAT TGGTTCTCGA GCAGGGCGGC
ACATTGGTCA ACGAAAAGAC CAAGACTGTG GCTTTTGACA ACGAGATTGG CCTCAATGCT
CTGAAAATGG TTCGCCAGTT CGTGACCGAG GGTGGTCAGA CCCTGCTCGA CTGGGACCAG
TCCCGTCAGC AATTTGGTGC TGGTCTCACT GGTTTCATAT TCTCGACACC GGCCCATGTT
CAGACGATCG AGGGACTGGT GGGCGACCGT TTCAAGCTGA AGACGGCAAC CTTCCCGCTG
GACAACCCGG AAAAGGGTGG CGTACCGACG GGCGGCAACT CAGCCGTGAT CCTGACGCAG
GACAAGGCCA AGCAGGACGC CGCCTGGAAA TATCTGAAAT GGATCACCGG GCCTGAGGCG
CAGAACACCA TCGTGCGGAT CACCGGCTAT CTGCCGACCA ACAAGCTTGC CACCGGTGCC
GACTATCTTG CGCCTTATTA TGCCGAGCAT CCGAATGTAA AGACCGCCTC GCTCCAGGCA
GACCGGTCCT TGCCTTGGGC CGGTTACCCA GGCGGCGATT CCGTTCGCGT CTGGCGCACC
CAGCGCGACA TTATCGGCAC GGTCATGCGC GGTGAAGTGA CGCCGGAGGT TGGCCTGAAG
CAGATGGTCG ACCAGACCAA CGCCTTGTTG AAATAG
 
Protein sequence
MKINRRTVVS GLALGLAAAG LSTPVLAADE VTLNVLYNLP GFTKFHQPLA DAFMKNNPNV 
KINFLAPAPG YNEGQQQVLR AAVTGNLPDV YFSGFNLTAE LVHTLAPRNQ ITDLAPFIAA
EGGQAFLDKN YNPKMAALGQ IDGKQYGLPV NASSPIIYIN ADLVKKAGGD PDNMPKTFPG
LISLAKNIHA LDPKISGMGY DINGWPDDWL WQALVLEQGG TLVNEKTKTV AFDNEIGLNA
LKMVRQFVTE GGQTLLDWDQ SRQQFGAGLT GFIFSTPAHV QTIEGLVGDR FKLKTATFPL
DNPEKGGVPT GGNSAVILTQ DKAKQDAAWK YLKWITGPEA QNTIVRITGY LPTNKLATGA
DYLAPYYAEH PNVKTASLQA DRSLPWAGYP GGDSVRVWRT QRDIIGTVMR GEVTPEVGLK
QMVDQTNALL K