Gene Avi_3931 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvi_3931 
Symbol 
ID7387287 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAgrobacterium vitis S4 
KingdomBacteria 
Replicon accessionNC_011989 
Strand
Start bp3297309 
End bp3299189 
Gene Length1881 bp 
Protein Length626 aa 
Translation table11 
GC content60% 
IMG OID643652671 
ProductABC transporter substrate binding protein (oligopeptide) 
Protein accessionYP_002550852 
Protein GI222149895 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCCGCCG CCTTTCTGCC CCGCCCGCTT CTGGCGGCGT CTCATCCAAG TGAACCGGAA 
CTACTGAAAG CCAAGGTCGA GGCTGGTCAA CTACCACCAA TGGCTGAGCG CCTGCCGAAA
GTGCCGCGTG TCGTGCCCCT TGAAGGGCCT GAGCGAGCGC CCGGACGCTA TGGCGGGACG
ATCCGCATGT TGATCGGCGG CCAGCGCGAT ATCAGATATA TGCCGATCAA TTGTTACTGT
CGGCTGATCG GCTATGATCT CGCCTTCAAT TTCCAGCCGG ATCTGCTGGA ACGGTTCGAG
GTGGTCGAGG AGCGGATCTT TACCTTCCAC CTGCGCGACG GCCATCGCTG GTCGGACGGT
TCGCCGTTTA CGGCAGAGGA TTTCCGCTAT GTCTGGGAGG ATATGTTCCA TGACAAAAAG
CTTTACAAGG GAGGAATTCC CACGGTCTTC CGGGTCAATG ACAAGGAGCC GGTCTTTGAG
GTGCTGGACG CGCTGACCGT TCGTTATACC TGGGAGGACC CTAATCCGGA TTTCCTGGCC
GAGCTTGCCG CCCCGACCGC AACGCGTCTG ATGATGCCCG GCGCCTATCT CAAGCAGTTC
CACCACAAAT ACCAGACCAA GGAAAAGCTG GAGGAGCTGG CCATACGAAA CGGCGTGCAG
GAATGGGTGG CGCTGCATCA GAAAATGTCG CGCATCGTGC GCCCGGAAAA TCCCGATCTG
CCGACGCTGG ATGCCTGGAT CCCCCGGACC GCCCCACCAT CGGGCCAGTT CATCTTCGAG
CGCAATCCTT TCTATCACCG GGTGGACGAA AACGGGGTGC AATTGCCCTA TATCGACAAG
GTGGTGATGG GCGTTGGTTC CGGCGACCTG ATTTCGGCCA AGACCGGCAC GGGCGAAAGC
GACCTGCAAT TCACCAATCT GGATTTTGCC GATTATACCT TCCTGAAAAA CGCCGAGAAA
CGCTATCCGA TCAAGGTCGA TCTGTGGAAG CGCACCCAAG GGGCGCGCAT TGCGCTGATG
CCGAACCTGA ATTGCAAGGA TCTGGTCTGG CGACAGGTGC TGCTTGATGT CCGGGTGCGC
CGGGCGCTGT CCTTGGCAAT CAACCGCGAA GAGATCAACA AGGCGGTGTT TTTCGGCCTG
GCTCAGGCTT CGGCCAATGC CATTCTGCCC GAAAGTCCGC TGTTCAAGCC GGAATATCGC
GATGCTTGGG CTGGGTTCGA CCCCGATCAG GCCAATGCGC TGCTGGATGA GGCGGGCCTT
GAGCGCCCTG ACCGGCATGG CCTTCGCCTG TTGCCGGATG GGCGCGAGGC GGTGCTGATT
GTCGAGAGCA CCGGGGAAAG CTCCTTCGAT TCCGACGTGC TGGAACTGAT CACCGATCAC
TTCCGCAAGA TCGGTTTCCG GCTTTTCGTG CATATCTCGC ATCGCGAGCT GTTTCGCCGC
CGCATCACCA ATGGCGAGAC TGTGATGGCC GTCTGGCAGG GATTGGACAA TGGCGTGCCG
ACCGCCGACA TGTCGCCACG CGAATTGGCG CCGACCAGTG ACGATCAGTT GCAATGGCCG
CTCTGGGGCC TGCACACGCT GTCGGGCGGT GGCGATGGCC GGGCACCGGA AATTGCCGAG
GCGCGGGCGT TGCTCGACCT GTTTCGGGCC TGGCGGCAGT CCGATCTGCT GGAGGAGCGC
ACGGCGATCT GGCATGACAT GCTGCGGCTC TATACCGATC AGGTCTTTAC CATCGGCATC
GTCAATTCCG CCTTGCAGCC GGTGGTGAAA TCGCGAAAGC TCAGGAACCT GCCGGAAAAA
GGCCTGTTTG GCTTTGTCCC GACCTCCCAA CTCGGCGTCT ATATGCCGGA TACATTCTGG
TATGATGAGG ATGCGACATG A
 
Protein sequence
MSAAFLPRPL LAASHPSEPE LLKAKVEAGQ LPPMAERLPK VPRVVPLEGP ERAPGRYGGT 
IRMLIGGQRD IRYMPINCYC RLIGYDLAFN FQPDLLERFE VVEERIFTFH LRDGHRWSDG
SPFTAEDFRY VWEDMFHDKK LYKGGIPTVF RVNDKEPVFE VLDALTVRYT WEDPNPDFLA
ELAAPTATRL MMPGAYLKQF HHKYQTKEKL EELAIRNGVQ EWVALHQKMS RIVRPENPDL
PTLDAWIPRT APPSGQFIFE RNPFYHRVDE NGVQLPYIDK VVMGVGSGDL ISAKTGTGES
DLQFTNLDFA DYTFLKNAEK RYPIKVDLWK RTQGARIALM PNLNCKDLVW RQVLLDVRVR
RALSLAINRE EINKAVFFGL AQASANAILP ESPLFKPEYR DAWAGFDPDQ ANALLDEAGL
ERPDRHGLRL LPDGREAVLI VESTGESSFD SDVLELITDH FRKIGFRLFV HISHRELFRR
RITNGETVMA VWQGLDNGVP TADMSPRELA PTSDDQLQWP LWGLHTLSGG GDGRAPEIAE
ARALLDLFRA WRQSDLLEER TAIWHDMLRL YTDQVFTIGI VNSALQPVVK SRKLRNLPEK
GLFGFVPTSQ LGVYMPDTFW YDEDAT