Gene Veis_4734 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagVeis_4734 
Symbol 
ID4691379 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameVerminephrobacter eiseniae EF01-2 
KingdomBacteria 
Replicon accessionNC_008786 
Strand
Start bp5223936 
End bp5225501 
Gene Length1566 bp 
Protein Length521 aa 
Translation table11 
GC content60% 
IMG OID639852477 
Productextracellular solute-binding protein 
Protein accessionYP_999447 
Protein GI121611640 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.676308 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACGCAA ATCTCATGAC GAAGGTCGCC CATCGGTTCT GGCTGAGGAT GCTGCCGCAG 
GGCATGGCGT GCTGCGCCTT GCTGCCGGCC TTCGTGCCGG CGCCTGCGTT GGCGGGCAAG
GCCAATGACA CGCTGGTGTA TGCCTCTGAC AGCGAGGCGC CCAACATCAG CCAGTACCAC
AATAACGTGC GCGAAGGCGT GATTCTGGCG CACCTGATCT GGGATACGCT GGTCTGGCGC
GACCCCCGCA CCGGCGACTA CAAGCCGCAG TTGGCCAGCG CCTGGAAGTG GGAGTCGCCG
ACGGCGCTGG TGATGGACTT GCGCCAGGGT GTGCAATTCC AGAATGGCGA TCCGTTCACG
GCGGAGGATG TTGCCTTCAC CTTCAACTAT GCGGTGTCGA GCGAATCCAG GGTCATCACA
CGGCAAAACG TCGATTGGAT CAAGAGCGTC GACAAACTCG GCGACTACAA GGTGCGCATC
AACCTCAAGC AGCCCTTTCC GGCTGCGCTG GAATACCTGG CCGGTCCCTT GCCGATTTAT
CCCGGCGCGT ATTTCAGGAA AGTCGGCCTG GAAGGATTCG CCAAGGCGCC GATCGGCACC
GGCCCTTACA AGGTCGTCAG CGTGACGCCG GGGCGAGGCG TGAGCATGGT CAGGAACGGC
AATTATTTCA AGGACAGTCC GCAAGGCCAG CCGAAGATTG GCAATATCAA ATTTGTCGTC
ATTCCCGACC CTGAAACACG CTCGGCACAA CTGATGACCG GCGCGATCGA CTGGATTTGG
CGCGTGCCCG CCGATCAGGC CGAGTCGCTC AAGAGCACGC CCGGGATCAC CGTGCAAAGC
GGCGAAACCA TGCGCGTCGG TTTTCTGGTG ATCGATGCGG CGGGCAACTC CTCGCCCCAT
TCGCCGTTCA AGGATGTGCG TGTGCGCCAG GCGGTCAACC ATGCGATCAA CCGCCAGGGT
ATCGCCGACA ATCTGGTGCG CGGCGGCAGC AAGCCGGTCT ACACCGCCTG TTTCCGCACC
CAGTTCGGTT GCGATGACAA GGTGGTGGTC CACTATGACT ACAACCCCGC CAAGGCGAAA
GAGTTATTGC GCGCCGCTGG TTACGCCAAC GGTTTCGACA CCGATTTGTA TGCCTATCGC
GAGCGCGAGT TCGCCGAAGC CATCGTCGGC GATTTGCACA AGGTGGGCAT TCGCGCACGG
CTGCATTACA TGAAGCACGA TGCGATGCAG GTCGAGTACC GCGGCGGCAA GGCGCCGATG
ACGTTTTATG CCTGGGGCTC GTACTCGATC AATGACACGT CCGCCTTTAC CGGCGTGTAT
TTCAAGGGCA GCAGCGACGA CATCGTCAAG GACCCGCAAC TGCGCCAATG GCTGGAAACG
GCCGACACCT CGACCGACCC TGCCGTGCGC AAAACGAATT ACGCCAAGGC ATTGGCGTTG
ATTTCCCGGC AGGCTTATCT GGCGCCGATG TTTTCTTATT CCACTTACTA CGCCCATAGC
TCGGCGCTCA GGTTTCAGGG ATATCCCGAC GAGTTGCCGC GTTTTCATGA GGCCAGTTGG
AAGTAG
 
Protein sequence
MNANLMTKVA HRFWLRMLPQ GMACCALLPA FVPAPALAGK ANDTLVYASD SEAPNISQYH 
NNVREGVILA HLIWDTLVWR DPRTGDYKPQ LASAWKWESP TALVMDLRQG VQFQNGDPFT
AEDVAFTFNY AVSSESRVIT RQNVDWIKSV DKLGDYKVRI NLKQPFPAAL EYLAGPLPIY
PGAYFRKVGL EGFAKAPIGT GPYKVVSVTP GRGVSMVRNG NYFKDSPQGQ PKIGNIKFVV
IPDPETRSAQ LMTGAIDWIW RVPADQAESL KSTPGITVQS GETMRVGFLV IDAAGNSSPH
SPFKDVRVRQ AVNHAINRQG IADNLVRGGS KPVYTACFRT QFGCDDKVVV HYDYNPAKAK
ELLRAAGYAN GFDTDLYAYR EREFAEAIVG DLHKVGIRAR LHYMKHDAMQ VEYRGGKAPM
TFYAWGSYSI NDTSAFTGVY FKGSSDDIVK DPQLRQWLET ADTSTDPAVR KTNYAKALAL
ISRQAYLAPM FSYSTYYAHS SALRFQGYPD ELPRFHEASW K