Gene Veis_2011 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagVeis_2011 
Symbol 
ID4692688 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameVerminephrobacter eiseniae EF01-2 
KingdomBacteria 
Replicon accessionNC_008786 
Strand
Start bp2285200 
End bp2286642 
Gene Length1443 bp 
Protein Length480 aa 
Translation table11 
GC content63% 
IMG OID639849777 
Productextracellular solute-binding protein 
Protein accessionYP_996781 
Protein GI121608974 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.427162 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTGAAG CACTCAAAGC CCTCGCACGG GCATGCCACG ATCGGCGGCT GGAGCGTCGC 
GAGTTTCTGG CGCGCGCCAG CGCCCTCGGT TTTGGCAGCA GCGCCGCCGG CCTGATGCTC
AATGCCGTGT CGACCCGGGC CCTCGCCCAG GACGGCGGCG TCGACTTCAT GAAGCACAAG
GGCAAGACCG TCAAACTGCT GCTGAACAAG CATCCGTATG TGGATGCGAT GGTCAAGAAC
ATCGAGAACT TCAAGGCCCT GACCGGCTTG AATGTCAGCT ACGACATCTT TCCGGAAGAT
GTCTACTTCG ACAAGGTGAC CGCAGCCCTG GCCAGCAAGA GCAGCCAGTA CGACGCTTTC
ATGACCGGCG CCTACCAGAC CTGGAAGTAC GGCCCGGCGC GCCAGATCGT CGACCTGAAC
CAGTACCTGC AAGACCCCAA GCTCACCTCG GCCAACTACG CCTGGGAGGA TATCTACCAG
AACCTGCGCG CCGCCACGTC CTGGGACGGC AAGGCCGGCT CCGCACTCGG CGGCCCGGGC
GCCAAGCAAT GGGCCTTGCC CTGGGGCTTC GAGCTCAACA GCCTGGCCTA CAACAAGCGC
CTGTTCGATG CGCTGAAACT GGGCGTGCCG ACCCACCTGG CGGACCTGGC GGACAAGGCC
GCCAGCATCA GCAAGAGCGG CAAGGGCTAC GGCATCGGCG TGCGCGGCTC GCGCAGTTGG
GCCACGATCC ATGCCGGCTT TTTGTCGGCG TACACCAACT TCGGCAACAA GGACTTCCAC
AGCGCGGGCG GCAAGCTGAC GCCGGCAATG AACACGCCGC AGAGCAAGCA GTTTCACCAG
CAGTGGATCG ACATGATCAA GAACGGCGGG CCGAAGAACT GGACCAACTA CACCTGGTAT
GAGGTCGGCA ACGATCTGGG CGCGGGCAAT AGCGCGATGA TCTACGACGC CGACATCATG
GGCTACTTCT TCAACAGCGG CAGCAACAAG GAAGCCGGCA ACCTGGCCTA CGCCGCGTTC
ACGCCGAACC CGGCCGCCAA GGCGCCCACG CCCAATATCT GGATCTGGTC GCTGGCGATG
AGCGAGTTCT CCAAACAAAA GGAGGCCGCC TGGTTCCTGC TGCAATGGGC CACCGGCACG
CAGAACACCA CCTTCGGCGC CACCCAGGGC GACTTGGTGA ACCCGGTGCG CAAATCGGTC
TGGGAAAACG CCCAGTTCAA GGAGCGGCTG GACAAGTCCT ACCCCGGCTA CCTGCGGCAA
TACCAGGCCA GCGTGGAGGG CGCGAAGATC TACTTCACGC CGCAGCAGTT GTTTCCCGAA
TTCACCACCG AGTGGGCGTC GATGCTGCAA CAGATGTACG GCGGCACGGT GCCGGTCGGC
GAGGGGCTGG ACAAACTGGC CGAGACGCTG ACCCGCAAGC TCAAGGGCGT GGGCCTGGCC
TGA
 
Protein sequence
MSEALKALAR ACHDRRLERR EFLARASALG FGSSAAGLML NAVSTRALAQ DGGVDFMKHK 
GKTVKLLLNK HPYVDAMVKN IENFKALTGL NVSYDIFPED VYFDKVTAAL ASKSSQYDAF
MTGAYQTWKY GPARQIVDLN QYLQDPKLTS ANYAWEDIYQ NLRAATSWDG KAGSALGGPG
AKQWALPWGF ELNSLAYNKR LFDALKLGVP THLADLADKA ASISKSGKGY GIGVRGSRSW
ATIHAGFLSA YTNFGNKDFH SAGGKLTPAM NTPQSKQFHQ QWIDMIKNGG PKNWTNYTWY
EVGNDLGAGN SAMIYDADIM GYFFNSGSNK EAGNLAYAAF TPNPAAKAPT PNIWIWSLAM
SEFSKQKEAA WFLLQWATGT QNTTFGATQG DLVNPVRKSV WENAQFKERL DKSYPGYLRQ
YQASVEGAKI YFTPQQLFPE FTTEWASMLQ QMYGGTVPVG EGLDKLAETL TRKLKGVGLA