Gene Avi_3888 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvi_3888 
Symbol 
ID7388568 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAgrobacterium vitis S4 
KingdomBacteria 
Replicon accessionNC_011989 
Strand
Start bp3247268 
End bp3248920 
Gene Length1653 bp 
Protein Length550 aa 
Translation table11 
GC content54% 
IMG OID643652635 
ProductABC transporter substrate binding protein (oligopeptide) 
Protein accessionYP_002550816 
Protein GI222149859 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.799639 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCATGA CATTGACGCG CCGCAGGCTG ATGACGACGG CCAGTGCTGT TCTGGCATTG 
GGCGTTGCGG GGGGCGTTAC GGGCGTGCCG CGCGCTTTTG CCGCCACGCC GAAAGATACC
CTGGTCGAGG CCTGGGCGTT TGACGATATC ATTACCATGG ACCCGGGCGA GGCTTTCGAA
ATCTCTGCCG CCGAAGTCAC CGGCAATACC TATGATACGC TGGTAAAGCT GAATATTGCC
GATACCTCGA CCGTGGCTCC CGGCATTGCG GAAAGCTGGA GTGCGTCTGA AGACGGCCTG
ACCTATACGT TCAAGATCAA GTCCGGCATC AAATTCGCGT CCGGCAACCC GATCACTGCG
GAAGATGTGG CCTATTCTTT TGAACGGGCC GTCAAGCTCA ACAAGAACCC GGCCTTCATC
CTCCAGCAAT TCGGCCTGAG TGGCGAGAAT GTCGCGGAAA ATGCCAAGGC AGTCGATGCC
TCGACCTTCC AGTTCAAGGT GGACAAGCCT TATGCGACCA GCTTCGTGCT GAACTGCCTG
ACCGCAACCG TCGGTGCCAT CGTTGACAAG AAACTGGTGC AGAGCCACGC GGCAGCGGTG
ACGGTTTCCA AGGATTATCC TTATGACACC GATTTCGGCA ACGGCTGGTT GAAGACCAAT
TACGCTGGTT CCGGTCCTTT CAAGCTGCGC GAATGGCGTG CCAATGAATT GGTCGTGATG
GAGCGCAATG ATAATTATTA TGGTGAAAAG GCTAAGCTTA AGCGGGTCAT CTACCGCTTC
CTGAAGGAAA GCTCAGGCCA ACGTCTGGCG CTGGAATCCG GTGACGTCGA TGTGGCGCGT
AATCTTTCGC CGACGGACTT TACCGCGATT GCCAATGCGA AAAATATCAA GACAGATTCT
GCCCAGAAGG GAACGGTCTA TTATCTCGGT CTCAACCAGA AAAACCAGTA TCTCTCCAAG
CCGGAAGTCC GCCAGGCCAT CAAATATCTT GTCGATTACG ATGCCATCGG CGCAACCCTC
ATCAAGGGTG TCGGCACCGT GCATGAATCC TTCCTGCCCA AGGGTATTCT TGGCTCGATC
GATGACCAGC CCTACAAGTT GAATGTTGCG AAAGCCAAGG AATTGCTGGA AAAGGCTGGC
CTCAAGGATG GCTTCAAAGT CACCATGGAT GTGCGCTCCA TCGAGCCGAT GACCAGCATT
GCCCAATCCA TGCAGCAAAC TTTCGCACAG GCGGGCATTA CGCTGGAACT CATCCAAGGA
GATGGTAAGC AGACGCTGAC CAAATACCGC GCCCGCAACC ATGACATCTA TATCGGGGAT
TGGGGTGCTG ATTATTGGGA CCCGCATTCC AATGCCGAAA CCTTTACCAG CAATCCTGAC
AATTCCGACA CGCCGAAATC CAAGACACTC GCCTGGCGCA ATGCCTGGGA TGTGCCTGAA
TTGACCAAGG AAACCAGCGA AGCGCTGTTG GAGCGGGACA CGCCGAAGCG CAAGGCGATG
TATGAAGACT TGCAGCGCAA GGTTCTGGCC GACGGTCCGT TTGTGATCAT CTATCAAAAA
ACGGAAATTG CCGGCTATAG GGCCAATGTT CAAAATTATA AGCTTGGCCC CACCTTCGAC
AGCAACCTCA AAAACTTGAT TTCCAAGGAT TGA
 
Protein sequence
MTMTLTRRRL MTTASAVLAL GVAGGVTGVP RAFAATPKDT LVEAWAFDDI ITMDPGEAFE 
ISAAEVTGNT YDTLVKLNIA DTSTVAPGIA ESWSASEDGL TYTFKIKSGI KFASGNPITA
EDVAYSFERA VKLNKNPAFI LQQFGLSGEN VAENAKAVDA STFQFKVDKP YATSFVLNCL
TATVGAIVDK KLVQSHAAAV TVSKDYPYDT DFGNGWLKTN YAGSGPFKLR EWRANELVVM
ERNDNYYGEK AKLKRVIYRF LKESSGQRLA LESGDVDVAR NLSPTDFTAI ANAKNIKTDS
AQKGTVYYLG LNQKNQYLSK PEVRQAIKYL VDYDAIGATL IKGVGTVHES FLPKGILGSI
DDQPYKLNVA KAKELLEKAG LKDGFKVTMD VRSIEPMTSI AQSMQQTFAQ AGITLELIQG
DGKQTLTKYR ARNHDIYIGD WGADYWDPHS NAETFTSNPD NSDTPKSKTL AWRNAWDVPE
LTKETSEALL ERDTPKRKAM YEDLQRKVLA DGPFVIIYQK TEIAGYRANV QNYKLGPTFD
SNLKNLISKD