Gene Veis_0944 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagVeis_0944 
Symbol 
ID4693580 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameVerminephrobacter eiseniae EF01-2 
KingdomBacteria 
Replicon accessionNC_008786 
Strand
Start bp1057692 
End bp1058960 
Gene Length1269 bp 
Protein Length422 aa 
Translation table11 
GC content63% 
IMG OID639848722 
Productextracellular solute-binding protein 
Protein accessionYP_995740 
Protein GI121607933 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.544942 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATCCCG CACCACTTTG CAGCGCAGCG CTGGCCTTCG GGCTGGCGAC GGCAGCGCTC 
GCCCAAGAGC CCAAGGTGAT TACCGAATGG GACATTCAGA CCCAGCCCGG GGGCTCCAAG
CTGATACAGG AGGCGCAGGC GCGCTTCGAG AAGGCCAACC CCGGCTTCAA GGTGCAGCGC
ACGCAAATCC CCAACGACGC CTACAAGACC AAGCTGAAGA TCGCGATGGG GGCCAATGAG
CCGCCATGCG TGTTCACGAG TTGGGGCGGC GGGGTGCTGC GCGAGTACAT CAAGGCCGGT
CAAGTCGTCG ACCTCGGTCC TTACCTGGCC AAAGATCCCG CGTTTCGTGA GCGCTTCCTG
CCCAGCGCCT TCGACGCCAT CACCTGGCAG GGCAAAACCT ACGGCCTGCC GGGGGAGAAC
ACCACCGCAG CCGTGATTTA CTACAACACC GAGATCTTCG CCAAGTTCGG GCTCGCGCCG
CCCAAGACCT GGCCCGAACT GATGAAGCTC GTCGAGACGC TCAAGGCCAA CGACGTGGCC
CCGTTTGCCC TGGCCAACAA GGCCAAGTGG CCCGGTTCGA TGTACTACAT GTACCTCGTC
GACCGCATTG GCGGACCGGA GGTGTTCCGC AAAGCCATTG CCCGCGCGCC GGGTGGCAGC
TTTGCCGACC CGGCCTTCGT CGAGGCCGGC AAATATCTGC AAGAACTGGT CAAGGCCGGC
GCCTTCGCGC AGGGCTTCAA CGGCCTGGAC TACGACATTG GCGCAGCGCG CAGATTGCTG
TACTCGGGCA AGGCCGCCAT GGAACTGATG GGAACCTGGG AATCATCGAA CATCAAGAAC
GAAAACCCGG AATTCGCCAA AAAGGTGGAC TTCTTCCCGT TCCCCGGCGT GCCGGGCGGC
AAGGGGCAGG CGGGCAATGT CGTCGGCTCC GTGGGGCAAA ACTTCTACAG CATATCGACG
GCCTGCAAGA CGCCCGAGGC GGCCTACCAG TTGATCACGA CGATGCTCGA CGAGGCCTCG
GTCAAGGCGC GCCTGGCAGA CAAGCGCCTG GTGCCGGTCA AGGAACTGAC GATCGCCGAT
GCCCCGATGC AGCGGGTGAT GCAACTGGTG GCCGACGCGC CGGCCGTGCA ACTGTGGTAC
GACCAGGAAC TGCCGCCGCA GTTGGCCGAA CTGCACAAGG ACACGGTGCA GGCCCTGTTC
GGGCTGTCGA TCACGCCCGA AGAAGCAGCG CAAAAGATGC AAGCGCTGGC CGCGCAAATC
CTCAAGTAG
 
Protein sequence
MNPAPLCSAA LAFGLATAAL AQEPKVITEW DIQTQPGGSK LIQEAQARFE KANPGFKVQR 
TQIPNDAYKT KLKIAMGANE PPCVFTSWGG GVLREYIKAG QVVDLGPYLA KDPAFRERFL
PSAFDAITWQ GKTYGLPGEN TTAAVIYYNT EIFAKFGLAP PKTWPELMKL VETLKANDVA
PFALANKAKW PGSMYYMYLV DRIGGPEVFR KAIARAPGGS FADPAFVEAG KYLQELVKAG
AFAQGFNGLD YDIGAARRLL YSGKAAMELM GTWESSNIKN ENPEFAKKVD FFPFPGVPGG
KGQAGNVVGS VGQNFYSIST ACKTPEAAYQ LITTMLDEAS VKARLADKRL VPVKELTIAD
APMQRVMQLV ADAPAVQLWY DQELPPQLAE LHKDTVQALF GLSITPEEAA QKMQALAAQI
LK