Gene Veis_4121 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagVeis_4121 
Symbol 
ID4695052 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameVerminephrobacter eiseniae EF01-2 
KingdomBacteria 
Replicon accessionNC_008786 
Strand
Start bp4520121 
End bp4521686 
Gene Length1566 bp 
Protein Length521 aa 
Translation table11 
GC content62% 
IMG OID639851868 
Productextracellular solute-binding protein 
Protein accessionYP_998844 
Protein GI121611037 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.157439 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAAACTCA AGCTATCGAT GCTGTTCGCC GTCGTCTTGG CCGCGGCATC GGGCGGCACT 
TGCGTGCTGG CCAAAACCGC CAAAGACATA TTGGTGATCG GAAAATCGGC CGATCCGCAA
AACCTGGATC CGGCCGTCAC GATGGACAAC AACGACTGGA CGGTCACATA CCCGGCGTAC
CAGCGTCTGG TGCGCTACAA GGTCCATGAC GGCAAGGCTT CCAGCGAGTT GGAAGGCGAT
TTGGCGCAAA GCTGGAGTAG TTCGCCAGAC GCGATGACCT GGGAATTCAA GCTCAAGCCG
GGCAGCAAAT TCGCCGATGG TTCGCCGGTC GATGCCCATG CGGTGAAGTT CTCCTTCGAG
CGTTTGCTGG CCTTGAAAAA AGGCCCTTCC GAACCCTTCC CCCCGGGCAT CGAGGTCAGC
GCGCCCGACG CGTCGACCGT GCGCTTCAAG CTCAAGACCG GCTTTGCGCC TTTCCTGTCG
ATCCTGGCCA TCGATGGCGC GTCGGTGGTC AACCCGAAAG TGATGCAATA CGAACAAAAC
GGCGACAAGG CCCAGGGCTG GCTGGCGGGG CACACCATGG GCAGCGGCGC CTTCCAACTG
AGCAGTTGGC AAAAGGGCCA AAGCATCGTG ATGGACAAAA GCCCGCATCC GAATGGAGCG
GCGCCGGCCT TCAACAAAGT GATCATCAAG TTCGTGCCCG AGGCCTCGGC GCGCCGCCTG
CAACTGCAAG GCGGCGACAT GGACATTGCC GAAGACCTGC CGCCCGACCA GATCGAAAGC
CTGAAAGCGC AACAGGGCCG CCAGGGCGTC GTGGTGGGCG ACTACCCGAG CTTGCGCGTC
ACCTACCTGT ACCTGAACAA CAAAAAAGCG CCGCTGGACA AGCCCGAGGT GCGCCGCGCC
ATCATCGCCG CCGTCGATGT GCGCGCCATC ATCGACGGCA TTTTCTCGGG CAAGGCCAAG
GCCATGAACG GGCCCATTCC CGAAGGCATG TGGGGGCACG ACGCGCAGGC TGCGCCCGCA
GCCTTTGCGC CGGCCAAGGC CAGGGAACTG CTGGCCAAAG CCGGGCTGCG CAATATCCGG
CTGGGCTTTT TGCTGTCGGA CAAAGACCCT TCGTGGAGCC CGATCGCGCT GGCCACGCAG
TCCAACCTGG CCGATGTCGG CATCCAGGTG CGCCTGGAAA ACATGGCCAA TGCCAGCTTT
CGCGAACGTG TCGGCAAGGG CGACTTCGAT ATCGCCATCG GCAACTGGAG CCCCGACTTT
GCCGACCCCT ACATGTTCAT GAACTACTGG TTCGAGAGCG ACAAGCAGGG GGCCGCCGGC
AACCGCTCCT TCTACTCCAA CCCCCGGGTC GATGCGCTGC TGGCCAGAGC GGCCCATGCG
AGCGCCTTGT CCGAGCGCAG CAGGCTGTAC CAGGAGGCGC AAAAAATCGT GGTCGACGAT
GCGGTCTATG TCTACCTGTT TCAGAAAAAC ACCCAGATCG CCGCGCGCAG CAGCGTCAAG
GGGCTGGTGT TCAACCCGAT GCTCGAGCAG ATCTACAACG TCCAGCAGAT GTCCAAGTCC
GAGTAG
 
Protein sequence
MKLKLSMLFA VVLAAASGGT CVLAKTAKDI LVIGKSADPQ NLDPAVTMDN NDWTVTYPAY 
QRLVRYKVHD GKASSELEGD LAQSWSSSPD AMTWEFKLKP GSKFADGSPV DAHAVKFSFE
RLLALKKGPS EPFPPGIEVS APDASTVRFK LKTGFAPFLS ILAIDGASVV NPKVMQYEQN
GDKAQGWLAG HTMGSGAFQL SSWQKGQSIV MDKSPHPNGA APAFNKVIIK FVPEASARRL
QLQGGDMDIA EDLPPDQIES LKAQQGRQGV VVGDYPSLRV TYLYLNNKKA PLDKPEVRRA
IIAAVDVRAI IDGIFSGKAK AMNGPIPEGM WGHDAQAAPA AFAPAKAREL LAKAGLRNIR
LGFLLSDKDP SWSPIALATQ SNLADVGIQV RLENMANASF RERVGKGDFD IAIGNWSPDF
ADPYMFMNYW FESDKQGAAG NRSFYSNPRV DALLARAAHA SALSERSRLY QEAQKIVVDD
AVYVYLFQKN TQIAARSSVK GLVFNPMLEQ IYNVQQMSKS E