Gene Veis_0416 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagVeis_0416 
Symbol 
ID4691170 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameVerminephrobacter eiseniae EF01-2 
KingdomBacteria 
Replicon accessionNC_008786 
Strand
Start bp471845 
End bp473044 
Gene Length1200 bp 
Protein Length399 aa 
Translation table11 
GC content64% 
IMG OID639848197 
Productcytosine deaminase 
Protein accessionYP_995222 
Protein GI121607415 
COG category[F] Nucleotide transport and metabolism
[R] General function prediction only 
COG ID[COG0402] Cytosine deaminase and related metal-dependent hydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.156504 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACAGGCG ATCAAGTCAT ACGCAATGTG CGTCCGCTGG GCGCGGCTGC CGTCGATATC 
GTGCTGCATC GGGCAACGAT AGGCGCCATC GTGCCCGCCG GCAGTGCCGC AGCGGATATC
CCCTGCCTGC TGGATGGCGG CGCTCGGTTG CTATTGCCCG GATTGGTGGA AAGCCATGTC
CATTTCGACA AAACGCTGTG GGGCATGGCT TGGCATTCCA ACACCTCGGG GCCAGGCTTG
AGCGACAAGG TTGCCAATGA GCGGCGCGTG CTCAGGGCCA TCGACGTGCC GCTGGCAGAG
CGCGCCGGCC CCTTGATCGA GCATTGCATT GCGCGTGGCT CGCTGTATTT TCGCTGTCAC
ATCGATATCG ACCCGGAGTT GAAGCTGGAG CGTGTCCACG CGATGCTGGC GCTGCGCGAA
CGCTACCGCG ATGTCATCGA CATGCAGTTT GTCGCCTTTC CCCAAGCCGG CATGCTGATT
CAGCCGGGCA CCCTGGAGTT GATGCGCGCA GCGCTGGAAC TGGGCGTGGA GCATGTCGGC
GGGCTGGACC CGGCGGGCAT GGACGGCGAC CCCATCCGGC ATCTCGAAGG CATATTTGCA
TTGGCAACGC GCTACGACCG AGGCATCGAC ATCCATCTGC ACGACCGTGG CAGCCTGGGG
CTGTGGCAGG TGGAACGCAT CGCCGACTTC ACCAAAGCCT GCGGCCGGTC TGGCCGCGTC
ATGATCAGCC ACGCCTTCTG CCTTGGCATG CTGCCCACCG AGCAATTGCT GCCGCTGGCC
GACCGGCTGG CCGAGTTGGG CATCTCGATC ATGACCTCCG GCAACGCAGG CATCGACGTG
CCGCCGGTTG CCTTGCTGCG CAGCCGTGGC GTCAATGTCT GCTCCGGCTC GGATGGCATA
CGCGACCCCT GGAGCCCCAT GGGCAACGGC GACATGCTGG AGCGGGCGAT GCTGATTGCG
CTGCGCTATC GCTGGAACAA AGATCAGGAA CTGGCCATGG CCTTCGACAT CGTGAGCGCC
GGCGGTGCCC GGGCGCTGGG AATCGCCAAC TACGGCCTGG ACGTTGGATG CCCCGCCAAT
TTCGTGCTGG TGCCGGCAGA AAATCCAAGC GAAGCACTGA TCAATCGTCC CGCCATGCGC
ACCGTCATCA GCCGTGGGCG GATAGTGGCG CGCGCCGGCC AATTTGCAGG ACAGCAATGA
 
Protein sequence
MTGDQVIRNV RPLGAAAVDI VLHRATIGAI VPAGSAAADI PCLLDGGARL LLPGLVESHV 
HFDKTLWGMA WHSNTSGPGL SDKVANERRV LRAIDVPLAE RAGPLIEHCI ARGSLYFRCH
IDIDPELKLE RVHAMLALRE RYRDVIDMQF VAFPQAGMLI QPGTLELMRA ALELGVEHVG
GLDPAGMDGD PIRHLEGIFA LATRYDRGID IHLHDRGSLG LWQVERIADF TKACGRSGRV
MISHAFCLGM LPTEQLLPLA DRLAELGISI MTSGNAGIDV PPVALLRSRG VNVCSGSDGI
RDPWSPMGNG DMLERAMLIA LRYRWNKDQE LAMAFDIVSA GGARALGIAN YGLDVGCPAN
FVLVPAENPS EALINRPAMR TVISRGRIVA RAGQFAGQQ