Gene Avi_5357 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvi_5357 
SymboldppA 
ID7380712 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAgrobacterium vitis S4 
KingdomBacteria 
Replicon accessionNC_011988 
Strand
Start bp356002 
End bp357510 
Gene Length1509 bp 
Protein Length502 aa 
Translation table11 
GC content57% 
IMG OID643648973 
ProductABC transporter substrate binding protein (dipeptide) 
Protein accessionYP_002547210 
Protein GI222106419 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0991682 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAACTGA ACAGAGTTTT GACCGCGCTG CTCGTCACCG CCGCTTTTGC CGCACCTGTG 
ATGGCCGCTG ACCTGAAGAT CGGCTTGTCG GAAGACCCTG ACGTGCTCGA TCCGGCGCAG
TCGCGCACGT TCGTGGGGCG TATTGTCTAT ACGGCCATGT GCGACAAACT GGTTGATGTC
TCGCCCGACT TGAAAATCAT CCCGCAGCTT GCCACCGGCT GGGCCTGGAG CGAGGGTGGC
AAGGTTTTAA CCATGACGCT GCGCCAGGGC GTGAAATTTC ACGACGAAAC ACCCCTCAAC
GCCGAAGCTG TTGTCGCCAC CATCCAGCGC AACATGACCA TGCCGGAATC GCGCCGCAAG
AGCGAGCTTG CCTCGGTTGA AAAGGTCGAG GCTATCGGCA GCGATACGGT GAAATTTACC
CTCAAGGCCC CGGATTCGAC CCTGCTGGCG CAATTGTCCG ACCGGGCCGG GATGATCGTC
TCGCCCAAGG CCGCCAAGGA ATTGGGCGCT AATTTCGGCA GCCATCCCGT CTGCGCCGGT
CCATTCAAAT TCGTCGAGCG GGTGCAGCAG GACCGGATCG TGCTGGAGAA ATTTGCCGAT
TACTGGAACA AGGATCAGAT CTTCATCAAC AAGCTCACCT ATCTGCCGAT TGTCGATAGC
ACGGTGCGCC TCGCCAATTT GCGCTCCGGT GATCTCGATA TGATTGAGCG GGTGGCGCCG
ACCGATGCCG CCTCCATCAA ATCCGATGGC AAGCTGGACT TTGAACAGGC AGTGGGCGTT
GGTTACATGG CGATGTATGT TAATATCGGC AATGGTCCTC GCGCCAATAA CCCGCTTGGC
AAGGACAAGC GCCTGCGGCA GGCCTTCTCG CTGGCCATCG ATCGCGAAGC ACTGAATCAG
ATCGTCTTTG AGGGGACGGC TCTGGCGGGC AATCAGCCAT TTCCGCCTGT GAGCCCCTGG
TATGACAAGC GGATTCCCGT TCCTGCCCGC GATGTCGAAA AGGCAAAAGC CCTGGTCAAA
GCCGCTGGAT TTGACCGGGT GCCGATTGAG TTGCAGGTGT CCAACAGCGC GACGATGTTG
CAGATGATGC AGGTCGTGCA ATCGATGGTC GCTGAGGCTG GTTTTGACGT GACGCTGAAG
ACGATGGAAT TTGCCACCAT GCTCAACGAG CAGACGACTG GAAATTACCA AATCAGCCGT
TCGGATTGGT CCGGGCGTGT GGACCCGGAT GGCAATCTGC ATCAGTTCGT CACCTGCAAG
GGTGGCATCA ACGATACCAA ATATTGCAAT CCCGCCGTCG ATACGCTGTT GAACGAAGCG
CGGCAATCCA CCGACGACGC CGTGCGCAAG CAGAAATATG ATGCGGCCGA TGAAATCCTG
AATGACGACC TGCCGATCAT CTATCTCGGT CATCAGTCCT GGCTCTGGGC GTCGAGCAAG
AAGATCACCG GTTTTGTGCC GTCACCGGAT GGAATGATCC GGCTGACGGG CATGCAAAAG
GCCAATTGA
 
Protein sequence
MKLNRVLTAL LVTAAFAAPV MAADLKIGLS EDPDVLDPAQ SRTFVGRIVY TAMCDKLVDV 
SPDLKIIPQL ATGWAWSEGG KVLTMTLRQG VKFHDETPLN AEAVVATIQR NMTMPESRRK
SELASVEKVE AIGSDTVKFT LKAPDSTLLA QLSDRAGMIV SPKAAKELGA NFGSHPVCAG
PFKFVERVQQ DRIVLEKFAD YWNKDQIFIN KLTYLPIVDS TVRLANLRSG DLDMIERVAP
TDAASIKSDG KLDFEQAVGV GYMAMYVNIG NGPRANNPLG KDKRLRQAFS LAIDREALNQ
IVFEGTALAG NQPFPPVSPW YDKRIPVPAR DVEKAKALVK AAGFDRVPIE LQVSNSATML
QMMQVVQSMV AEAGFDVTLK TMEFATMLNE QTTGNYQISR SDWSGRVDPD GNLHQFVTCK
GGINDTKYCN PAVDTLLNEA RQSTDDAVRK QKYDAADEIL NDDLPIIYLG HQSWLWASSK
KITGFVPSPD GMIRLTGMQK AN