Gene Avi_9144 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvi_9144 
Symbol 
ID7367694 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAgrobacterium vitis S4 
KingdomBacteria 
Replicon accessionNC_011984 
Strand
Start bp108061 
End bp109584 
Gene Length1524 bp 
Protein Length507 aa 
Translation table11 
GC content58% 
IMG OID643644339 
ProductABC transporter substrate binding protein (oligopeptide) 
Protein accessionYP_002542636 
Protein GI222083233 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.215146 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATATCT CTCGTCGTGG TTTCGTAGGC GGTTCGCTTA CACTTCCATT TGTCGGAATG 
GCCGCCAAGA CCGCGTTGGC GCAGACGCCC GGCAAGTATC TTCGTTATGG CCTCAACAAT
TTTCCGGCCA ACCTGACGCC ATGGGTGAAT GCCGGCGCCG CCGCCGGCAC GTTCATGAGC
CTTGTCCACA GGGGCCTGTT CTCATTTGCC CCGGATGGAA GTCTGCAGGG AGAGCTTGCC
GAGAGTTTTG AAAATGACGG CAACAAGGTC TTCACGTTCA AGCTGCGCAA GGCGACCTAT
CACGACGGTC AGCCGCTGTT GGCAGAAGAC GTAAAGTGGA CGTTGGAACA GGTCGCAGCC
AAGGATTCGA CCGCTTACTT CCGCGCTCAG TTTCAGGAAG TCGCCTCGAT CGAGACTCCG
GATGAGCGGA CCGTCAAGGT CATCATGAAA AACCCGTCGG TGACGGTGAC CCAACTCCTG
GCGACCTACT ATATGCCGAT CCTGAAAAAG GGGACGACGA AGGAAAACAA CGCCAACGGG
ACGGGTCCCT TCAAGCTCGT CAACATTGAG CGTGGCTCCT ACATCGACGT CGAAGCCTTC
GACAAGTATT ACAAGCCGGG CCTGCCGAAA GTTTCGAAGA TCCGGCTGCA GGCATTTCCG
GACGAAAACC TGCGTGTCGC GGCCCTCCAG ACAGGTGACA TTGACATGAC GGAATACGCC
CCATGGTGGG CGATCGACAA CCTTGAGAAA GATCCCAATC TCAAGCTGGA TACGACACCG
GGCGCCTTCA TGTACCTGAT GTTCAACGGC ACGCAGGGGC CGATGGCCAA CCCACGCGTT
CGCCAGGCTG TTGCCTTCGC GATGAAGCGT GAGGCCATGG TCCAGGCGGC ATTCTATGGG
CACGGTGAAC CCCTGAGGGG TCTTCCTTTC TACAAGGGCA CTCCCTATTT TGATGCCCTG
CGCTCGAATT TCTGGACCGA AGACCTGGCG AAGGCCAAGG CACTCCTGTC GGAAGCCGGC
TTTCCGAACG GATTTTCGTG CAATCTCCTG TCAGCATCCG ACGTGGCGAT CCAGAAGGCC
ACAGCGGAGG TTGTGCAACA GGGTCTGGCG GCAATCGGCA TCCAGGCGCA GCTCAACCTT
CCGGACTTTG CAACGCGTGT TTCGCTCGGC AACAAGGGTC AGTTCGATAT CGGCGTGAAC
GGAACCGCTT GCGACAACAA TGACCCGGAC GGGATCACAT CCATCGTCGA CGATTCCCTG
TCGCCCGCCT ATACCCGCAG CCTCAACATT AAGACGCCGG GCCTGGCTGA GATGTTGGCC
AGGGGTCGTG GGGAAAGCAA TCTCGAGGCC CGCAAGGGTA TCTATGCCGA GGTCGAAAAA
CTCGTCTACC AAAACACCCC CTATGTGGGC TTGACCTGGC GCAACCAGTC CTATGCGCTG
CGCAAGAATG TCCAGGGCTT CAACAACCTT CCCGGAGCGC TCACATTCTT CTCCGGAAGA
ACCCTGGAGA GCACGTCGCT TTAG
 
Protein sequence
MNISRRGFVG GSLTLPFVGM AAKTALAQTP GKYLRYGLNN FPANLTPWVN AGAAAGTFMS 
LVHRGLFSFA PDGSLQGELA ESFENDGNKV FTFKLRKATY HDGQPLLAED VKWTLEQVAA
KDSTAYFRAQ FQEVASIETP DERTVKVIMK NPSVTVTQLL ATYYMPILKK GTTKENNANG
TGPFKLVNIE RGSYIDVEAF DKYYKPGLPK VSKIRLQAFP DENLRVAALQ TGDIDMTEYA
PWWAIDNLEK DPNLKLDTTP GAFMYLMFNG TQGPMANPRV RQAVAFAMKR EAMVQAAFYG
HGEPLRGLPF YKGTPYFDAL RSNFWTEDLA KAKALLSEAG FPNGFSCNLL SASDVAIQKA
TAEVVQQGLA AIGIQAQLNL PDFATRVSLG NKGQFDIGVN GTACDNNDPD GITSIVDDSL
SPAYTRSLNI KTPGLAEMLA RGRGESNLEA RKGIYAEVEK LVYQNTPYVG LTWRNQSYAL
RKNVQGFNNL PGALTFFSGR TLESTSL