Gene Avi_7341 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvi_7341 
Symbol 
ID7380553 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAgrobacterium vitis S4 
KingdomBacteria 
Replicon accessionNC_011981 
Strand
Start bp321179 
End bp322459 
Gene Length1281 bp 
Protein Length426 aa 
Translation table11 
GC content56% 
IMG OID643641416 
Productsugar ABC transporter sugar-binding protein 
Protein accessionYP_002539713 
Protein GI222102674 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTACAGGC AGAAATTTGG GATATCGCTG ACCATTGCCG CTGCGGCACT GGGAATGGTC 
ACTGTCATGG CAGGCGGCGC GTTCGCACAA TCGGCAGCCC CCGTCACCTT GAAATGGGCG
CTGTGGGACT GGGACAAGGT GGCCTATTAC AAGCCGCTGA TAGAAGCCTA TCAGGCCAAG
CATCCGAATG TGAAATTCGA GCCGGTGGAT CTCGGTTCGC AGGACTACAC GCAGATGATC
GCCACCCAAC TGACCGGTGG CGCCAAGGAT ATCGACGTCG TCACCATCAA GGACGTGCCG
GGCTACGCGA CCCTGGTGCG GGCCAACTCT ATCGGCGATC TCTCCGGTTT CATGACCGAG
CAGAAGATCG ACAAAGCTAG ATATGGCGGG CTTATCGAGG AGCTGAGCAT TGACGGCAAG
GTCTATGCAA TACCCTTCCG CTCTGACTTC TGGGTGGTCT ATTATAACAA GGACATCTTC
GACAAAGCGG GCGTTTCCTA CCCGACCAAT GACATGACCT GGACACAGTT CGACCAGATC
GCAGTTAAGC TGAAGGGCGG CATGGGGGTC AACAAGACCT ACGGCGCATT GCTGCACACA
TGGCGTTCGA CCGTTCAGCT TCCGGGCATC ATGGATGGTC AGCATACGCT GGTCGGTGGC
GATTACGCTT TTCTGAAGCC CTGGTATGAG CGGGCGCTGA AGCTTCAAAA GGAAGGTGCG
ATCCCGTCCT ATGCATCGCT GAAAACCTCC AATACCCATT ATTCGGCGCT GTTCTTCAAC
GGGACGGTCG GTATGCTGCC GATGGGGACC TGGTTCATCG GCACCCAGAT CGCCAAGGTA
AAGTCCGGTG AATCCAAGAG CAAGAATTGG GGCATCGTCA AATTCCCGCA CCCAGACGGC
GTTGCAGCCG GCACAACGGC GGCGCAGATT GCGGCTCTTT CGGTCAATAA CAACTCAGCC
CACAAAGACG TGGCGCTTGA CTTCATCAAG TTCGTGACTG GACCTGAAGG TGCGGCAATC
ATTGCCGATA CGGGAACTTT GCCAGCGGTG CGCACAGACG ATGTCAGCAC CAAGATCACC
TCGCTGCCCG GCTTCCCGCA GGACGAAAAC AGCAAGGCGG CGCTTAAAGC CGGCAAGTCC
TATCTGGAAA TGGCGGTCAG TCCCAATGCA GCAAAAATCG AGGTCGTGCT GAACCGTGTG
CATGACGCGA TCATGACAGA CAACACATCT ATCGACGATG GCCTGAAGGA GATGAACGAC
GGCGTCAAGG CGATCAAATA G
 
Protein sequence
MYRQKFGISL TIAAAALGMV TVMAGGAFAQ SAAPVTLKWA LWDWDKVAYY KPLIEAYQAK 
HPNVKFEPVD LGSQDYTQMI ATQLTGGAKD IDVVTIKDVP GYATLVRANS IGDLSGFMTE
QKIDKARYGG LIEELSIDGK VYAIPFRSDF WVVYYNKDIF DKAGVSYPTN DMTWTQFDQI
AVKLKGGMGV NKTYGALLHT WRSTVQLPGI MDGQHTLVGG DYAFLKPWYE RALKLQKEGA
IPSYASLKTS NTHYSALFFN GTVGMLPMGT WFIGTQIAKV KSGESKSKNW GIVKFPHPDG
VAAGTTAAQI AALSVNNNSA HKDVALDFIK FVTGPEGAAI IADTGTLPAV RTDDVSTKIT
SLPGFPQDEN SKAALKAGKS YLEMAVSPNA AKIEVVLNRV HDAIMTDNTS IDDGLKEMND
GVKAIK