Gene Avi_7212 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvi_7212 
Symbol 
ID7380370 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAgrobacterium vitis S4 
KingdomBacteria 
Replicon accessionNC_011981 
Strand
Start bp181186 
End bp182781 
Gene Length1596 bp 
Protein Length531 aa 
Translation table11 
GC content55% 
IMG OID643641317 
ProductABC transporter substrate binding protein (oligopeptide) 
Protein accessionYP_002539614 
Protein GI222102575 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGCTCA CCAGACGCGA CTTTCACCGC ATTGCCCTTG CCGCAGGGGC CTTCACGGCG 
CTGCCCGGCG GGGCCTTTGC CGCCGCAGAA GAGCCAATTT CCGGCGGAAC GCTGAAGATT
GTTTATTTTC CCGAGCCAAC CCAGTTGGTG GCCATCAACA CCAGCTCGGG CGGTGCGCAA
TTCATCGGAG CCAAAATTTT CGATGGTTTG CTGACCTATG ATTACGACCT GACGCCCAAA
CCAGCGCTGG CCAAGGAATG GAGCATTTCA CCTGACGGTA AGACCTATAT CTTCCATCTG
CGCCCCAATG TAACCTTCCA TGATGGCAAG CCGTTGACAT CGGCGGATGT CGCCTTTTCG
ATTTTCCGCC TCAAGGAAGC CCATCCGCGG GGACGGGCTA TTTTTGCCAC CGTGGAAAGC
ATTGATACCA GCGATCCGCT GGTGGCAAAG CTCATCCTTT CCAAGCCGAC GCCAGCGCTG
ATCACCGCCC TGTCTGGCTC TGAATCACCC ATTGTGCCAA AGCACATTTT CGAGACCTTC
AAGCCCACCG AAAATCCAAA GCCGCAGCAG ATTATTGGCA GCGGCCCCTT TGCGCTGAAG
GAATGGGTGC CGGGCAGCCA TATTCTGCTG GAACGCAATG CTGCCCATTG GGATGCGCCA
AGGCCCTATC TCGACCGCGT TATTCTGAGG CTGATCAATG ATGCCAGCGC TCGCTCGGCA
GCCCTTGAAA CCGGGGAGGC CGATATTGGT CCCAATCCGG TTCCGCTTTC TGATCTGGAT
CGGTTGAAAA GCGTTGCGAC ACTCAAGGTC GATGATCAGA TCTATGCCTA TGCGGGCCAG
CAGAACCAGT TGGTGATCAA TCTGGAAAAC AGCTACCTGA AGGAGCAGAA AGTCCGCCAA
GCCATTGCCC ATGCCATCGA TGTGAAGGCG CTGATCAACA TCGTGCTCTA TGGCTATGGC
ATCCCCTCAC CCACGCCGAT CAGCCCCGGT CTGGCGAAAT TTCACAATCC CGATATTGGC
TTTGCCAAAT ATGATGTGGC CTTAGCCGAA AAACTGCTGG ATGAAGCGGG CTTTCGCCGT
CAAGCAAACG GCAAGCGTTT CAAGCTAAGG GTCACGACCA ACCCGTTTAA TCCGGCGAGC
TATTCAGACT TTATCGCACA AGCGCTGATC AAGATCGGCA TTGAGGCGGA TATTCAAAAA
TTCGATTTCG GCACCTATGT CAAAGTGGTC TATACGGACC GCGCCTGGGA TCTTTCGGTG
GAATCGCTCT CCAACACATT TGATCCCACC GCTGGCGTGC AGCGGGTCTA TTGGAGCAAG
AATTTCAAAA TCGGCCTGCC CTTTTCCAAC GCCAGCCACT ACGAAAACCC GGAGGTGGAC
CGCCTGCTGG AAGCGGCTGC GGTGGAGCCG GACATTGAAA AACGCGCCCA ATTGTTTAGG
GATTTTCAGG TGATCATCGC CCGCGATCTT CCCGTGATCA ATCTCGTTTC CCCCATCCAA
CCCGTGGTGG GCAACAAGCG CGTGCGCAAT TACGCTTATG GCGCGGAAGG TCTGATTGGC
AACCTTGCCT ATGCCAGCCT TGCAAAAAAC AGCTAA
 
Protein sequence
MTLTRRDFHR IALAAGAFTA LPGGAFAAAE EPISGGTLKI VYFPEPTQLV AINTSSGGAQ 
FIGAKIFDGL LTYDYDLTPK PALAKEWSIS PDGKTYIFHL RPNVTFHDGK PLTSADVAFS
IFRLKEAHPR GRAIFATVES IDTSDPLVAK LILSKPTPAL ITALSGSESP IVPKHIFETF
KPTENPKPQQ IIGSGPFALK EWVPGSHILL ERNAAHWDAP RPYLDRVILR LINDASARSA
ALETGEADIG PNPVPLSDLD RLKSVATLKV DDQIYAYAGQ QNQLVINLEN SYLKEQKVRQ
AIAHAIDVKA LINIVLYGYG IPSPTPISPG LAKFHNPDIG FAKYDVALAE KLLDEAGFRR
QANGKRFKLR VTTNPFNPAS YSDFIAQALI KIGIEADIQK FDFGTYVKVV YTDRAWDLSV
ESLSNTFDPT AGVQRVYWSK NFKIGLPFSN ASHYENPEVD RLLEAAAVEP DIEKRAQLFR
DFQVIIARDL PVINLVSPIQ PVVGNKRVRN YAYGAEGLIG NLAYASLAKN S