Gene Veis_0139 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagVeis_0139 
Symbol 
ID4690882 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameVerminephrobacter eiseniae EF01-2 
KingdomBacteria 
Replicon accessionNC_008786 
Strand
Start bp159578 
End bp161086 
Gene Length1509 bp 
Protein Length502 aa 
Translation table11 
GC content65% 
IMG OID639847924 
Productextracellular solute-binding protein 
Protein accessionYP_994949 
Protein GI121607142 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.184027 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTGAACC GCCGCACCGT TCTTGCCACT GGCGTCTCGG GCGTCACTGG CGCCATCATC 
GCCTTGCCGC TGACGGCCCC GCTCACCGCC CTGGCGCAGG GCCGCAAAGA CGCCGTGGTG
CTGGGCATGA CGCTGGAACC CCCGAGCCTG GACCCGACCA CCAGCGCCGC ATCGGCAATC
GCCGAAGTCG TTCACTACAA CATCTTCGAG ACGCTCACCA AGATCAACAC CGACGGCAGC
GTCGCGCCTT TGCTGGCCGA GAGTTGGGAG GTCTCGCCCG ACCTGAAGAC CTACACCTTC
AAACTGCGCC GCGGCGTGAA GTTCCAAAAC GGCGAGCCTT TCACGGCGGC CACCGTGAAG
TTCGCGTTCG ACCGCTCGGG CAGCGACAAG AGCAGCAACA AGAGCAAGAG CACCTTCGCC
AGCCTCACGA CCCAGGTGGT CGATGACTAC ACCGTGGTGC TGCTGAACAA GGACATAGAC
CCGGACCTGC TGTTCGTGCT CGGCCAGGCC CCGTTCGCCA TCGTCGAGCC CAAGAGCGCA
GACCGCAACG CCAGCCAGCC CGTGGGCACC GGCCCTTACC GGCTCGGCAG TTGGAACCGG
GGCTCTGCGG TGCTGCTGAC CGCATGGGAG GGCTTTCGCA ACCCCGCTGC GATCAAGATC
AAGCGCGCCA GCTTTCGCTT CATTTCAGAC CCCGCCGCCC AGGTGGCCGC GCTGCTGGCC
GGCGACGTCG ACGCCTTCCC GCGCATCACG AATCGCGGCG TGGCGCAGTT CAAGAGCGAC
CCGCGCTTTC AGGTCGTGAT CGGCGGCTCA CGCGCCAAGA CCATCCTGGC CATCAACAAC
GCCAGAAAGC CGCTCGATGA TCTGCGCGTG CGCCGCGCCA TCGCCGCCGC CATCGATCGC
AAGGCGGTGA TCGCAGCGGC CACCGAGGGC TATGGCGTAC CCATCGGCAG CCACTATGTG
CCCGGCGCCT TCGGCTATGT CGACACCACC GGTGTGAACC CCTACGACCC TGGCAAGGCC
ATCCGGCTGT TGGCCGAAGC CGGCGTCAAA ACACCGCTGA CGCTGGGCAT GACGCTGCCC
CCCACCCCCT ATGCCCGCCT GGGCGGCGAG GTGATCGCAG CGCAACTGGC CAAGGTGGGC
ATCATTGCCA AGATGGAGAA CGTGGAATGG GCGCAGTGGC TCAGCGGCAC CTATGGCGGC
AAGAACTACG ACCTGACCGT CATCGCGCAT GTGGAGCCGT TCGATCTGAA CAACTTCGCC
AACCCCGACT ACTACTGGGG CTACCAATCG CCCGCATTCA ATGCGCTGTT CGACCAGATC
AAGAACACCG CCCGGCCCGC CGAGCGCGCG CGCCTGCTCG GCCAGGCGCA ACGCCTGCTG
GCCGATGACG CAGTGCATGC CTTTTTGTAC CAGGGCCAGT GGGTCACCGT GGCGAACAAA
AACCTCAAGG GGCTGTGGAA AGACATGCCG GTGTTCGTGA ACGACATCTC GGCGCTCTCC
TGGAGCTGA
 
Protein sequence
MLNRRTVLAT GVSGVTGAII ALPLTAPLTA LAQGRKDAVV LGMTLEPPSL DPTTSAASAI 
AEVVHYNIFE TLTKINTDGS VAPLLAESWE VSPDLKTYTF KLRRGVKFQN GEPFTAATVK
FAFDRSGSDK SSNKSKSTFA SLTTQVVDDY TVVLLNKDID PDLLFVLGQA PFAIVEPKSA
DRNASQPVGT GPYRLGSWNR GSAVLLTAWE GFRNPAAIKI KRASFRFISD PAAQVAALLA
GDVDAFPRIT NRGVAQFKSD PRFQVVIGGS RAKTILAINN ARKPLDDLRV RRAIAAAIDR
KAVIAAATEG YGVPIGSHYV PGAFGYVDTT GVNPYDPGKA IRLLAEAGVK TPLTLGMTLP
PTPYARLGGE VIAAQLAKVG IIAKMENVEW AQWLSGTYGG KNYDLTVIAH VEPFDLNNFA
NPDYYWGYQS PAFNALFDQI KNTARPAERA RLLGQAQRLL ADDAVHAFLY QGQWVTVANK
NLKGLWKDMP VFVNDISALS WS