Gene Vapar_5141 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagVapar_5141 
Symbol 
ID7971512 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameVariovorax paradoxus S110 
KingdomBacteria 
Replicon accessionNC_012791 
Strand
Start bp5454763 
End bp5455962 
Gene Length1200 bp 
Protein Length399 aa 
Translation table11 
GC content71% 
IMG OID644795735 
ProductAmidohydrolase 3 
Protein accessionYP_002947009 
Protein GI239818099 
COG category[F] Nucleotide transport and metabolism
[R] General function prediction only 
COG ID[COG0402] Cytosine deaminase and related metal-dependent hydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAACTCG AGTCGCTGCG CATTCCGTCG CGGCTGCGCG GCTTTGCCGC GGGCGATGCC 
CAGGTCTTCG ACGTCACGCT GGCCGGCGAC AGGGTGCAGG CCGTCGCGCC GAGCGCATCG
CAGTCGCAGG CGCGCGGCAC CTTGCTGAGC GCGCTCGTCG AGGCGCATGC GCACATCGAC
AAGAACTACA CCGTGCAGGA AGTCGGCGCG GCCGAGGGCA ACCTGTTCGC GGCCATCGAC
CGCATGGCGA AGCACCGCGC GGGCTGGAGC GGCCAAACGC TGCGCCCGCG CATGGAGCGC
GCGCTGCACG AGGCCTGGCA GTCGGGCACG CGCGCGCTGC GCACCCACAT CGACTGGGTG
GAGGGCGAGC CGCCCGCCGC GCTGGCGGTG TTCGAGGCGC TGCGCGAAGA GTGGCGCGGC
CGCATCGAGC TGCAGTTCGT CTCTCTCACG CCGCTCGACC TGTTCGCGGA CCTCGCGGCC
GGCGAGCGCA TTGCGCGCGA GGTGAAGCGC GCGGGCGGCG TGCTGGGCGC CTTCGTGTAT
CGCAACGAAG GCCTGGTGCA CAAGCTGGGC CGCGTGTTCG ACCTCGCGCA GGACCACGGC
CTGGGCCTCG ACTTCCATGT CGACGAAGGG CTCGACGCCG ATGCGAGCGG CCTGCGCAGC
ATTGCGCAGC TGATGCGTGC GCGCGACTTC CGGCGCGGCG TGGTCTGCGG CCACTGCTGC
TCGCTGGCGA TGCAGGACGA TGCCGTTGCC AACGAAACGC TGGCGCTGTG CGCGGGCGCC
GGCATCCACA TCGTCGCGCT GCCGACCACC AACCTCTACC TGCAGGGCGC CTGGGACCGC
ACGCCCGTGC CGCGCGGCAT CACGCGCATC CACGAGGCGG CGGCACGGGG CTTGCGTGCG
AGCCTGGCCA CGGACAACGT GCAGGACGCC TTCTATCCCT ATGGCAGCTA CGACCTTCTC
GAAACCTTCG GCCTCGGTGT GCAGATGGCG CACCTCGCGC CCGCGGAAGA ATGGCTCGAC
GCGATCACCG TCAGCCCCGC GAAGGCGCTC GGCCTGGCAT GGGACGGCCG CATTGCGCCG
GGCTGCCCCG CGGACCTGGT GCTGCTCGCG GCCACCGGCG AGCATGAGCT GGTCGGCCCG
CGCGGGCGCC GACGCACCGT GATCCGTGGC GGTCAAGAAA TTCTGGAGCA GACACGATGA
 
Protein sequence
MKLESLRIPS RLRGFAAGDA QVFDVTLAGD RVQAVAPSAS QSQARGTLLS ALVEAHAHID 
KNYTVQEVGA AEGNLFAAID RMAKHRAGWS GQTLRPRMER ALHEAWQSGT RALRTHIDWV
EGEPPAALAV FEALREEWRG RIELQFVSLT PLDLFADLAA GERIAREVKR AGGVLGAFVY
RNEGLVHKLG RVFDLAQDHG LGLDFHVDEG LDADASGLRS IAQLMRARDF RRGVVCGHCC
SLAMQDDAVA NETLALCAGA GIHIVALPTT NLYLQGAWDR TPVPRGITRI HEAAARGLRA
SLATDNVQDA FYPYGSYDLL ETFGLGVQMA HLAPAEEWLD AITVSPAKAL GLAWDGRIAP
GCPADLVLLA ATGEHELVGP RGRRRTVIRG GQEILEQTR