Gene Veis_4149 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagVeis_4149 
Symbol 
ID4691505 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameVerminephrobacter eiseniae EF01-2 
KingdomBacteria 
Replicon accessionNC_008786 
Strand
Start bp4564906 
End bp4566357 
Gene Length1452 bp 
Protein Length483 aa 
Translation table11 
GC content68% 
IMG OID639851896 
Productphenylhydantoinase 
Protein accessionYP_998872 
Protein GI121611065 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0044] Dihydroorotase and related cyclic amidohydrolases 
TIGRFAM ID[TIGR02033] D-hydantoinase 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.435965 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACCAAG CCCTTGTGAT TCGCGGCGGC ACCGTGGTCA ACGCCGACCG GGAACAAACT 
GCCGATCTGC TGTGCGTCGA TGGCCGCATC GTGGCGCTGG GCGCCGATGC GGCGGCGCAG
GCGCCCGCCG GCGCGCAGAC CCTCGATGCC AGCGGCCAGT ACATCCTGCC CGGCGGCATC
GACCCCCACA CCCACATGCA ACTGCCGTTC ATGGGCACCG TGACCGCAGA CGACTTTTTC
ACCGGCACGG CAGCCGGCCT GGCCGGCGGC ACGACCAGCA TCATCGACTT CGTCATCCCC
GATCCGCAGG AGCCGCCCAT GGCCGCCTAC CGCAAGTGGC GCGGCTGGGC CGAAAAGTCT
GCGGCCGACT ACGGCTTTCA TGTGGCCATC ACCTGGTGGA GCGAGCAGGT GCACGCCGAC
ATGGGCCAAC TGGTGCAAGA AGAAGGCGTG AACAGCTTCA AGCACTTCAT GGCCTACAAG
AACGCCATCA TGTGCGACGA CGAAACGCTG GTAAACAGCT TCCAGCGCGC GCTGGAACTG
GGCGCCATGC CCACGGTGCA TGCCGAAAAC GGCGAACTGG TCTACCGGCT ACAGCAGGAC
GTGGCCAAAA AAGGCATCAC CGGCCCCGAA GGCCATCCGC TGGCCCGCCC GCCGCTGGTC
GAGGCCGAGG CCGCCCAGCG CGCCATCGCC ATTGCCGAGG TGCTCGGAGT GCCGATCTAT
GTGGTGCATG TCAGTTGCCA GGAAGCCGCC GACGCGATAG CCCGCGCCCG CGCGCGCGGC
CAGCGCGTGT ACGGCGAAGT GCTGGCCGGG CACCTGCTGA TCGACGACAG CGTGTACCGC
GACCCGGACT TCGCCCGGGC CGCAGCGCAT GTGATGAGCC CGCCGTTTCG CCCCAAAGCC
CACCAGGAGG CGCTCTGGCG CGGCCTGCAA TCGGGCCAGT TGCAAACCAC GGCCACCGAC
CACTGCGTGT TTTGCGCCGC GCAAAAAGCC ATGGGCCAAA AGAACTTCGC CCACATCCCC
AATGGCACCG GCGGCGTGGA AGAGCGCATG GCCGTCATCT GGGACGCCGG CGTGAATAGC
GGGCGCCTGA CGCCCAGCGA ATTCGTGGCC ATCACCTCGG CCAACGCGGC CCGCCTGTTC
AACATCTACC CGCGCAAAGG CTTCATCGGC GCCGGCGCCG ACGCCGACCT GGTGCTGTGG
GACCCCGAGG GCACGAAAAC CATCTCGGCC AAGACCCAGC ACAGCAAGGG CGACTTCAAC
ATCTTTGAAG GCCGCAGCGT GCGCGGCATC GCCGCCCATA CCGTGAGCCA GGGCCGCGTG
GTCTACGCCA ACGGCGAACT ACGCGCCGAG CCAGGCCGGG GCCGCTACAT CGCGCGCCCG
GCGTTTGGCG CCAACTTCCA GGCCCTGCAA AAACGCGCCC GGCATTTGGC CCCGGCCGCC
GTGGCCCGCT GA
 
Protein sequence
MNQALVIRGG TVVNADREQT ADLLCVDGRI VALGADAAAQ APAGAQTLDA SGQYILPGGI 
DPHTHMQLPF MGTVTADDFF TGTAAGLAGG TTSIIDFVIP DPQEPPMAAY RKWRGWAEKS
AADYGFHVAI TWWSEQVHAD MGQLVQEEGV NSFKHFMAYK NAIMCDDETL VNSFQRALEL
GAMPTVHAEN GELVYRLQQD VAKKGITGPE GHPLARPPLV EAEAAQRAIA IAEVLGVPIY
VVHVSCQEAA DAIARARARG QRVYGEVLAG HLLIDDSVYR DPDFARAAAH VMSPPFRPKA
HQEALWRGLQ SGQLQTTATD HCVFCAAQKA MGQKNFAHIP NGTGGVEERM AVIWDAGVNS
GRLTPSEFVA ITSANAARLF NIYPRKGFIG AGADADLVLW DPEGTKTISA KTQHSKGDFN
IFEGRSVRGI AAHTVSQGRV VYANGELRAE PGRGRYIARP AFGANFQALQ KRARHLAPAA
VAR