Gene Avi_5039 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvi_5039 
Symbol 
ID7381200 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAgrobacterium vitis S4 
KingdomBacteria 
Replicon accessionNC_011988 
Strand
Start bp31437 
End bp32531 
Gene Length1095 bp 
Protein Length364 aa 
Translation table11 
GC content58% 
IMG OID643648714 
ProductABC-type sugar transport system, periplasmic component 
Protein accessionYP_002546951 
Protein GI222106160 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1879] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.797463 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAACGCA GACATATCCT GCAAGCAACC GGTGCCTTGA TACTGCTCAC GGCGGGCAAC 
GCCTGGGCCG ATCCGATGGC AGAAGCCAAG GCGGTGGTCG ATAAATATGC CAGCAAGGTC
ACGACTTGGG ACGGACCGAA AAGCGCCCCA AAACCACAGC CGGGAAAAAC CATTGTTGTT
CTGGCTGGTG ACCTGAAAAA TGGTGGCATT CTGGGTGTCA GCAATGGCGT CGAGGAAGCC
GCCAAGGCCA TAGGCTGGCA GGTCAAGGTG CTGGATGGCG CAGGCTCGAT CGGTGGGCGT
ACCGCAGCTT TCGGTCAGGC CATGGCCTTG AAGCCTGATG CCATCATCAT CGACGGTTTC
GATGCAGTAG AACAGGCACC GGCGCTGGAA CAGGCGAAAG CCAATAAAAT CCCGCTGGTC
GCCTGGCATG CCGGTCCCAC CATCGGACCG GACGAAAAGA ACGGCCTGTT CGCCAATATC
AGCACTGACG CTATGGAAGT CTCCAAGGCC GCCGCCAACT GGGCCTATGT CGATGCCAAA
GGCAAGCCGG GCGTTATCAT CTTTACCGAC TCCACCTATG CCATCGCCAT CGCCAAGGCT
GACAAGATGA AGGCCGAAAT CGAGCGGCTG GGCGGCAAGG TTCTGGCCTA TGTCGATACG
CCGATTGCTG AAACCTCGCA GCGCATGCCA CAGCTGACCA CCTCGCTGTT GCAGAAATAC
GGCGACAGCT GGACCCATAC ATTGGCCATC AACGACCTCT ACTTCGATTT CATGGGACCA
TCGCTGGCCT CGGCAGGGAA GGGGGGAACC GATGCACCGA TCAATGTCGC TGCAGGGGAT
GGTTCCGAAT CCGCCTATCA GCGCATCCGC GCTGGCCAGT ACCAGAAGGT GACGGTGGCC
GAACCCTTGA ACCTGCAAGG CTGGCAATTG GTGGATGAGT TGAACCGCGC CCTCAATGGC
GAAAAATGGT CCGGCTATAT GTCGCCGCTG CATGTGGTGA CAGCCGACAA TGTCGAATTC
GATGGCGGAC CGAAAAACAG CTTCGATCCC GACAATGGCT ATCGGGATGC CTATAAGAAA
ATCTGGGGCA AATGA
 
Protein sequence
MKRRHILQAT GALILLTAGN AWADPMAEAK AVVDKYASKV TTWDGPKSAP KPQPGKTIVV 
LAGDLKNGGI LGVSNGVEEA AKAIGWQVKV LDGAGSIGGR TAAFGQAMAL KPDAIIIDGF
DAVEQAPALE QAKANKIPLV AWHAGPTIGP DEKNGLFANI STDAMEVSKA AANWAYVDAK
GKPGVIIFTD STYAIAIAKA DKMKAEIERL GGKVLAYVDT PIAETSQRMP QLTTSLLQKY
GDSWTHTLAI NDLYFDFMGP SLASAGKGGT DAPINVAAGD GSESAYQRIR AGQYQKVTVA
EPLNLQGWQL VDELNRALNG EKWSGYMSPL HVVTADNVEF DGGPKNSFDP DNGYRDAYKK
IWGK