Gene Avi_5349 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvi_5349 
SymboldppA 
ID7380705 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAgrobacterium vitis S4 
KingdomBacteria 
Replicon accessionNC_011988 
Strand
Start bp347965 
End bp349584 
Gene Length1620 bp 
Protein Length539 aa 
Translation table11 
GC content57% 
IMG OID643648966 
ProductABC transporter substrate binding protein (dipeptide) 
Protein accessionYP_002547203 
Protein GI222106412 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.60385 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAATTTC GTCTATTCGG CGGAGCTGTG GCCGTGTCGG TTGCTGCTTT GCTCACCAGC 
CCGGCTGTTG CTTTTGAAGG CAGAAGCGTT GTGGCGCCCG ATTGCAACTA TGGTGGCAAG
ATCAAGTCCA TCGTGGCGAC CGATGAGCAC ACGGTGACCT TCTCCATGTG CTCGCCCGAT
CCGGCCTTCA AAGCCAAGGC GGCTTTCGTA CCCTTCGGCA TCCAGCCGGC CAAGCATATC
GAAGAGGCTG GTCCGAAGAA GAAGCTGCTC GACAATCCGA TCGGCACTGG GCCGTTCAAG
CTGGAAAGCT GGAATCGCGG CGATTCCATC ACCATGACCC GCAACGAGAA TTACTGGGGT
GCCAAGCCGG CTTTCGACAA GCTGGTGTTT CGCTGGAATC AGTCCGGCGC GGGCCGCCTG
AATGAATTGC GCTCCGGCAC GGTCGATGAA ATCACCAATA TCAGCCCGGA TGATTTCGAC
AGTGTCAAGA ACGATCCGGA CCTGCAATTC CTGCCGCAGG AAAGCCCGAA CATTCTCTAT
CTCGGCATGG TCAACACCGC CAAGCCTTTT GACAATGAGA AGGTGCGCCA GGCGATTGCC
ATGGGCATCG ATCGCCAGCG CATCGTCGAT AATTTCTATC CAAAAGGTTC GGTCGTCGCC
AGCCATTTCA CGCCCTGTTC GCTGCCCAAT GGCTGCGCTG GCAAGGATTG GTACGGGTTT
GATGCGAGCG CGGCTAAAAA ACTGCTGGCC GATGCCGGAT ACCCGAATGG GTTCAAGACC
AAGATCTACT ACCGCGATGT GTTCCGCGCT TACCTGCCGG AACCAAGCGT CGTGGCTGTC
GAATTCCAGA CGCAGCTGAA GAAAAATCTC GGCATCGATG CGGAAGTGGT TCCGATTGAA
TCGGGTAAAT TCATTGATGA TACCTCCGCT GGCCGGATCG ATGGGCTCTA TCTGCTGGGT
TGGGGGGCTG ACTATCCGCA TGTCACCAAC TTCCTCGATT ATCACTTCGG CAAGACATCG
AAAATGTTCG GCACCACTTT CCCGGAAATT ACCGAGGGGT TGACCAAAGG CGGAACGATT
GCTGAGACAA AGACCGCCGA ACCGATCTAT GCGGCCGTCA ACGATGCCAT TCGCCAGCAT
GTGCCGATGG TGCCGATTGT CCATGGCGCC GCCGCCTATG CCGCTCGGGC GACCTTGAAG
AATGCCATCG TCCGCCCCTT TGGCTCGCCG TTGTTGCAGG ATTCCAATCC GGGTAAGGAT
ACGCTGGTCT TCATGCAGAA TGCCGAGCCG ATCAGCCTCT ATTGCGGCGA TGAAACGGAT
GGCGAAACGC TGAATGCCTG CACGCCGATT ACGGAAGCGC TGCTGGATTA TGCAAAGGAC
AGCGGCGATA TTGTTCCCGC TCTGGCCACC AGCTGTGATG CCAATGCGGA TTCGACCGTT
TGGACCTGCA AGCTGCGGAC CGGCGTGAAA TTCACCGACG GCTCTGATTT TACCGCCAAT
GACGTGGTGG TATCCTGGGC GGCGGGCATT GATGCATCCA ATCCGGCCCA TGTCGGCAAT
ACCGGCTCCT TCGACTATTT CTCCTCCCTC TGGGGCGGAT TGATGAACGC CAAGAAGTAA
 
Protein sequence
MKFRLFGGAV AVSVAALLTS PAVAFEGRSV VAPDCNYGGK IKSIVATDEH TVTFSMCSPD 
PAFKAKAAFV PFGIQPAKHI EEAGPKKKLL DNPIGTGPFK LESWNRGDSI TMTRNENYWG
AKPAFDKLVF RWNQSGAGRL NELRSGTVDE ITNISPDDFD SVKNDPDLQF LPQESPNILY
LGMVNTAKPF DNEKVRQAIA MGIDRQRIVD NFYPKGSVVA SHFTPCSLPN GCAGKDWYGF
DASAAKKLLA DAGYPNGFKT KIYYRDVFRA YLPEPSVVAV EFQTQLKKNL GIDAEVVPIE
SGKFIDDTSA GRIDGLYLLG WGADYPHVTN FLDYHFGKTS KMFGTTFPEI TEGLTKGGTI
AETKTAEPIY AAVNDAIRQH VPMVPIVHGA AAYAARATLK NAIVRPFGSP LLQDSNPGKD
TLVFMQNAEP ISLYCGDETD GETLNACTPI TEALLDYAKD SGDIVPALAT SCDANADSTV
WTCKLRTGVK FTDGSDFTAN DVVVSWAAGI DASNPAHVGN TGSFDYFSSL WGGLMNAKK