Gene Veis_3649 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagVeis_3649 
Symbol 
ID4692375 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameVerminephrobacter eiseniae EF01-2 
KingdomBacteria 
Replicon accessionNC_008786 
Strand
Start bp4035155 
End bp4036588 
Gene Length1434 bp 
Protein Length477 aa 
Translation table11 
GC content71% 
IMG OID639851404 
ProductN-formimino-L-glutamate deiminase 
Protein accessionYP_998383 
Protein GI121610576 
COG category[F] Nucleotide transport and metabolism
[R] General function prediction only 
COG ID[COG0402] Cytosine deaminase and related metal-dependent hydrolases 
TIGRFAM ID[TIGR02022] formiminoglutamate deiminase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0243931 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGCCTG AGCAGACCGC GCAGCGCAGC CTGTTTGCGG CACAGGCCCT GCTGCCCGGC 
GGCTGGGCGC GCAACGTGCT CGTGCAATGG GATGCGGCCG GGCGCATCAC CGGCGTGGAC
ACCGATGCCA GGGCGCCGGC CGGCCGGCCC GTGGCCGCAG GGCCGCTGCT GCCGGGCCTG
CCGAACCTGC ACTCGCATGC CTTCCAGCGC GCGTTCGCCG GCCTGGCCGA ATACCGCGCC
GAACGCCAGG ACAGCTTCTG GAGTTGGCGC CAACTGATGT ACCGCTTTGC AGCGCACATC
ACACCCGGGC AGATGCAAGC CATCGCCACC TGGCTCTACG TGGAGATGCT GGAGGCCGGC
TACACCCGGG TGTGCGAGTT CCACTACCTG CACCACGACC ACACTGGCCA GCCCTATGCC
GACGACGCCC GGATGTCGCT GGCGCTGCTG CACGCCGCGC GCACGGCCGG CATCGGCATC
ACACTGCTGC CGGCGCTGTA CCAAAGCAGC GGATTTGGCG CCCGGCCGCC GCACGCGCAG
CAAGCGCGCT TCATCCGCAG CACCGCCAGC ATGCTCTCGT TATTGGAGCG CCTGAGGCCC
ATCGCACAAG CGCAGGGCGC TGTGCTGGGC CTGGCTTTGC ATTCGCTGCG CGCGGTGCCG
CCGGACAGCC TGCAGGCCGC CGTGCAGGGC ATCACGGCGC TGGACCCCCA GGCCCCGATC
CACATCCACA TCGCCGAGCA GCAGCAAGAA GTCGACGACT GCATCGCCTG GAGCGGACAG
CGCCCGGTGC AATGGCTGCT CGATCACGCC CCGGTGGACG CACGCTGGTG CCTGGTGCAC
GCCACCCGGA TGACGCCCGA CGAACATGCC GCCGCCGCGC GCACCGGCGC CGTGGTCGGC
CTGTGCCCCA GCACCGAGGC CAACCTGGGC GACGGCATCT TCGACCTGCC GCTGTGGTTG
CAGCATGGCG GCCGCTGGGG CCTGGGCTCG GACAGCCATA TCTGCGTGAA TGCGGCCGAA
GAACTACTGC TGCTCGAATA CGGCCAGCGC CTGTCGCGCC GCCAGCGCAA CGTGCTGGCC
CATGCCACGC AGCCCGAAGT AGCCACCGCG ATGAGCTTGC AGGCCGTGCA GGGCGGCGCA
CAGGCCGCCG GGCACGGCAT CGGTGCGGGC CTGGCAGGCA TCGCCGTCGG CCGGCAGGCC
GACCTGGTGG TGCTCGACGC GCAGCATCTG GCGCTGCGCG GCCTGCCCGC GCACAGCATG
CTCTCGGCCC ATGTATTCGG CAGCCAGCGC AGTTCAGCCC TGGACAGCCT GTGGGTGGCC
GGCGTGCGCC GCGTCACCCA AGGCCGGCAC GCGCTGCACG AGGCGGCGGC CCAGGACTTC
ATCGCCGCCC GCAGCGCCAT CATTGCGGCG CAACGCGCCG GAGCGATCCG CTAA
 
Protein sequence
MAPEQTAQRS LFAAQALLPG GWARNVLVQW DAAGRITGVD TDARAPAGRP VAAGPLLPGL 
PNLHSHAFQR AFAGLAEYRA ERQDSFWSWR QLMYRFAAHI TPGQMQAIAT WLYVEMLEAG
YTRVCEFHYL HHDHTGQPYA DDARMSLALL HAARTAGIGI TLLPALYQSS GFGARPPHAQ
QARFIRSTAS MLSLLERLRP IAQAQGAVLG LALHSLRAVP PDSLQAAVQG ITALDPQAPI
HIHIAEQQQE VDDCIAWSGQ RPVQWLLDHA PVDARWCLVH ATRMTPDEHA AAARTGAVVG
LCPSTEANLG DGIFDLPLWL QHGGRWGLGS DSHICVNAAE ELLLLEYGQR LSRRQRNVLA
HATQPEVATA MSLQAVQGGA QAAGHGIGAG LAGIAVGRQA DLVVLDAQHL ALRGLPAHSM
LSAHVFGSQR SSALDSLWVA GVRRVTQGRH ALHEAAAQDF IAARSAIIAA QRAGAIR