Gene Veis_4050 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagVeis_4050 
Symbol 
ID4691086 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameVerminephrobacter eiseniae EF01-2 
KingdomBacteria 
Replicon accessionNC_008786 
Strand
Start bp4441076 
End bp4442047 
Gene Length972 bp 
Protein Length323 aa 
Translation table11 
GC content63% 
IMG OID639851797 
Productmembrane dipeptidase 
Protein accessionYP_998773 
Protein GI121610966 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2355] Zn-dependent dipeptidase, microsomal dipeptidase homolog 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.36145 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCAACC TGCACCAGAA CGCATTGGTC TTCGACGGCC TGATCATTTC CAACTGGGAC 
CGCTCGGTCT TCGAGGACAT GCGCAAAGGC GGCCTGAGCG GCGCCAATTG CACGGTCTCG
GTGTGGGAGG ACTTCAAGGG CACGGTGGCC AACATCGCCC GCATGAAGCG GCTGATCCGC
GACAACGGCG ATTTGCTGGC CCTGGCGCGC AACACGGCCG ACATCGAGCA GGCCAAGCAG
GACGGCAAGA CCGCCATCGT GCTCGGATTC CAGAACGCCC ATGCGTTCGA GGACCAGCTC
GGCTACATCG AAGCCTTCCA CGACATGGGC GTGCGCGTGG TGCAGCTTTG CTACAACACG
CAGAACCTGA TCGGCACCGG CTGCTACGAA CGCGATGGCG GCCTGTCGGG CTACGGCCAC
GAGGTGGTGG CCGAGATGAA CCGCGTCGGC ATCATGATCG ACCTGTCGCA TGTCGGCGCC
AAGACCTCCG ACGAGGCGAT CCGCGCCTCC AAAAAGCCGG TCACCTACTC GCATTGCCTG
CCGGCGGGTC TGAAGAAACA CCCGCGCAAC AAGAGCGATA CGCAGCTCAA ATTCATCGCC
GACCAGGGCG GTTTCATCGG GGTGACCATG TTCCCGCCCT TTCTCAAGCG CGGCATCGAT
GCCACGGTCG AGGACTATGT GCAGGCGCTG GACTATGTAG TCAACCTGGT CGGCGAGGAC
TGCGTGGGCA TAGGCACCGA CTTCACCCAG GGCTACGGCC AGGCGTTCTT CGATTGGCTC
ACGCATGACA AGGGCGTGCA CCGCCGCCTG ACCGAGTTCG GCGTGGTCCA GAACCCGCAG
GGCATACGCA CCATCGGCGA GATGCCCAAC CTGACCGCAG CGATGGAACG GGCGCGCTGG
CCGGCACGCA AGATCACCAA GGTGATGGGG CGCAACTGGC TGCGGGTTTT CAACGAAGTC
TGGAGTGTGT GA
 
Protein sequence
MSNLHQNALV FDGLIISNWD RSVFEDMRKG GLSGANCTVS VWEDFKGTVA NIARMKRLIR 
DNGDLLALAR NTADIEQAKQ DGKTAIVLGF QNAHAFEDQL GYIEAFHDMG VRVVQLCYNT
QNLIGTGCYE RDGGLSGYGH EVVAEMNRVG IMIDLSHVGA KTSDEAIRAS KKPVTYSHCL
PAGLKKHPRN KSDTQLKFIA DQGGFIGVTM FPPFLKRGID ATVEDYVQAL DYVVNLVGED
CVGIGTDFTQ GYGQAFFDWL THDKGVHRRL TEFGVVQNPQ GIRTIGEMPN LTAAMERARW
PARKITKVMG RNWLRVFNEV WSV