Gene Avi_5398 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvi_5398 
Symbol 
ID7381501 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAgrobacterium vitis S4 
KingdomBacteria 
Replicon accessionNC_011988 
Strand
Start bp399009 
End bp400220 
Gene Length1212 bp 
Protein Length403 aa 
Translation table11 
GC content59% 
IMG OID643649009 
ProductABC transporter substrate binding protein (sugar) 
Protein accessionYP_002547246 
Protein GI222106455 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.14652 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGATCGCTG TGCTGGCTCT TGCCACGGCT TGCCCGCTTG CATCGACCGC CCGGGCGGAC 
GACGTGACAC TCAATCTCTG GTCGCTCGAC AAGGATATCC AGCCGGCACC CAATCTGGTC
AAGCAATTCA ACGCGCTCAA CAATGGCATC AAGATCGAGT ATCGGCTCCT TCAGTTCGAC
GATGTCGTGA CGGAAGCGAT GCGCGCCTAT TCAACCGGTC AGGCGCCAGA TATCATTGCG
GTCGATAATC CGGAACATGC GCTGTTTGCC TCACGGGGCG CCTTCCTTGA TCTTACCGAC
ATGATCGCCA AGTCGGATGT GATCAAGCCC GCGAACTATT TTCCGGGGCC GTTGGCGTCT
GTGACCTGGA AAGACCGCTA TTTCGGCGTG CCGAAAGCCA CCAACACCAT CGCGCTCTAC
TATAACCGCG ATATGTTCAA GGCTAAGGGT CTCGATCCAC TCAAGCCACC GCAGACCTGG
GACGAACTTC TGGCCGCTGC CCGCAAGCTG AACGACCCGG CCAAGAATGT CTACGGCCTC
GCGTTTTCCG CCAAGGCCAG TGAAGAGGGA ACGTTCCAAT TCCTGCCCTG GGCGCAGATG
GGTGGCGGCG GTTATGACCA TATCAATGCT CCGGGCGCGG TAAAGGCGCT GGAGACCTGG
AAAACCATCA TGACCGAAAA ACTGGCCTCG CCGGATACGC TAACGCGCGG CCAGTGGGAT
TCGACCGGCA CGTTTAATTC CGGCAATGCC GCCATGGCGA TTTCGGGTCC TTGGGAGCTG
GACCGGATGC TGAAAGAGGC CAAATTCGAC TGGGGTGTCG CTCTGTTGCC GGTGCCAAGT
CCGGGTGCTG AACGGTCGTC GGGCATGGGT GACTTCAACT GGGCGATCTT TTCCAGCACC
AAGCATCCGG CGGAAGCCTT CAAGGCGCTG GAGTTCTTTG CCTCGCAGGA CAAGGACATG
TTCAAGAATT TCGGACAGCT ACCGGCCCGG TCCGATATCG CCATTCCGCC ATCGGGTTCG
CCCCTGAAGG ATGCGGCGCT GCAAGTTTTC CTCGAGCAAA TGAAATATGC CAAGCCGCGC
GGGCCGCATC CGGCATGGCC AAAAATATCC AAGGCGATCC AGGATGCCAT CCAGGCGGCC
CTGACTGGCC AGATGACGCC CAAGGATGCG CTGGACCAGG CGCAAAAGAA GATCAAAGCC
GCCCTGGGTT GA
 
Protein sequence
MIAVLALATA CPLASTARAD DVTLNLWSLD KDIQPAPNLV KQFNALNNGI KIEYRLLQFD 
DVVTEAMRAY STGQAPDIIA VDNPEHALFA SRGAFLDLTD MIAKSDVIKP ANYFPGPLAS
VTWKDRYFGV PKATNTIALY YNRDMFKAKG LDPLKPPQTW DELLAAARKL NDPAKNVYGL
AFSAKASEEG TFQFLPWAQM GGGGYDHINA PGAVKALETW KTIMTEKLAS PDTLTRGQWD
STGTFNSGNA AMAISGPWEL DRMLKEAKFD WGVALLPVPS PGAERSSGMG DFNWAIFSST
KHPAEAFKAL EFFASQDKDM FKNFGQLPAR SDIAIPPSGS PLKDAALQVF LEQMKYAKPR
GPHPAWPKIS KAIQDAIQAA LTGQMTPKDA LDQAQKKIKA ALG