Gene Veis_4044 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagVeis_4044 
Symbol 
ID4694163 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameVerminephrobacter eiseniae EF01-2 
KingdomBacteria 
Replicon accessionNC_008786 
Strand
Start bp4433010 
End bp4434047 
Gene Length1038 bp 
Protein Length345 aa 
Translation table11 
GC content64% 
IMG OID639851791 
Productextracellular solute-binding protein 
Protein accessionYP_998767 
Protein GI121610960 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0687] Spermidine/putrescine-binding periplasmic protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAACC AATGCCTGCT CGCCTGCGCC GCCCTGTGTG CCGGCATGGC CCTGCCCGGT 
CTGGCGCAGC AGTCCATCAC CGTCGTGAAC TTTGGCGGCG CCGCCGCCAA CGCCCAGAAG
AAAGCCTATT ACGAGCCTTA CGAGAAGCAG ACCGGCAGCA AGATCGTGGC GCTGGAGTAC
AACGGCGAGC AGGCCAAGCT CAAGGCCATG GTCGAGGCCA AAAAAGTCAC TTGGGATGTG
CTCGAAGTCG AGACCCCCGA CGCCGTGCGC GGCTGCGATG AAGGGCTGTT CGAGAAGATC
GACTACAGCC GGATCGCCAG CAAGAACGAG CTGATGCCTG ACGCCATCAC CGACTGCGCC
GTGGGTTTCC TGGTGTGGTC GACCGTGATG GCCTACAACG GCGACAAGCT CAAGACCGCC
CCCGGCGGTT GGGCCGACTT CTTCGACACG CAAAAGATTC CCGGCAAGCG CGGCATGCGC
AAGGGCGCCC GCTACAACCT CGAATTTGCG CTGCTGGCCG ATGGCGTCAA GCCCGCCGAT
GTGTACCCGC TGCTGGCCAC CCGGGAGGGC GCCGACCGGG CCTTCAAAAA GCTCACCGCG
CTCAAGCCCC ATATCCAGTG GTGGGCCGCC GGCGCGCAGG TGCCGCAGTT CCTGGTGGCC
GGCGATGTGG TGCTGAGCAC GGCCTACAAC GGGCGCATCG ACGCGGCCAA CCGCGAAGGG
CGCAACCTTC GCATCCATTG GCCCGGCAGC ATCTACGACC TGGAATACTG GACCATCCCC
AAAGGCGCGC CGAACAAGGA TGAGGCGCTG AAATTCATCG CCTTCAGCCT GCAGGCCGAC
AACCAGGCGG TGTACGTGCG GCAAATCGCC TATGGCCCGA CCAACACCAA GGCCATGGCC
CAACTCGACG CAAAGACCCT GGAACGACTG CCCACCTCGG CCAACAATGC CCGGCAAGCG
CTGCGGTTCG ACGTGGGTTT CTGGGCCGAC CAGGGCGAGA TGCTGGAAAA GCGCTTTGCC
GTCTGGGCCA CACAGTGA
 
Protein sequence
MKNQCLLACA ALCAGMALPG LAQQSITVVN FGGAAANAQK KAYYEPYEKQ TGSKIVALEY 
NGEQAKLKAM VEAKKVTWDV LEVETPDAVR GCDEGLFEKI DYSRIASKNE LMPDAITDCA
VGFLVWSTVM AYNGDKLKTA PGGWADFFDT QKIPGKRGMR KGARYNLEFA LLADGVKPAD
VYPLLATREG ADRAFKKLTA LKPHIQWWAA GAQVPQFLVA GDVVLSTAYN GRIDAANREG
RNLRIHWPGS IYDLEYWTIP KGAPNKDEAL KFIAFSLQAD NQAVYVRQIA YGPTNTKAMA
QLDAKTLERL PTSANNARQA LRFDVGFWAD QGEMLEKRFA VWATQ