Gene Vapar_2654 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagVapar_2654 
Symbol 
ID7972416 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameVariovorax paradoxus S110 
KingdomBacteria 
Replicon accessionNC_012791 
Strand
Start bp2797993 
End bp2799273 
Gene Length1281 bp 
Protein Length426 aa 
Translation table11 
GC content69% 
IMG OID644793242 
Productcytosine deaminase 
Protein accessionYP_002944545 
Protein GI239815635 
COG category[F] Nucleotide transport and metabolism
[R] General function prediction only 
COG ID[COG0402] Cytosine deaminase and related metal-dependent hydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.689557 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTCGACC TCCTCGTCAT CAACGCCACC CTCCCCGACG GCCGCTCGGG CATGTCGATT 
GCCGTGCAGG GCGGCCGCAT TGCCGAAGTC ACCGCCGGGC TCGACGCGCC GGCGCTCGAC
AAGCTCGATG CCCAGGGCCT GCTGGTTGCG CCGCATTTCG TCGATCCGCA CTTCCACATG
GACGCCACGC TGAGCTATGG CCTTCCGCGC ATCAACCAGA GCGGCACCTT GCTCGAAGGC
ATTGCGCTGT GGGGCGAACT CAAGCCGCTG CTCACGGCCG ACGCACTCAT CGAGCGTGCG
CTGGCGTATT GCGACTGGGC GGTGGCCAAG GGCCTGCTGG CCGTGCGCTC GCATGTGGAC
ACGAGCGACC CGAGCCTGCT CGCGGTCGAT GCGCTGCTCG AGGTCAAGCG CAAGGTCGCG
CCGTACCTCG ACCTGCAGCT CGTGGCCTTT CCGCAGGATG GCGTGCTGCG CTCGGCAGGC
GGCGTCGACA ACCTGAAGCG CGCGCTCGAC AAGGGCGTGG ACGTGGTCGG CGGCATTCCG
CATTTCGAGC GCACCATGGC CGATGGCGCC GCCAGCGTGA AGCTGCTGAT GGAGCTGGCG
GCCGAGCGCG GCAAGCTGGT CGACATGCAC TGCGACGAGT CGGACGATCC GCTCTCGCGC
CACATCGAAA CCCTGTCGGC CGAGACCTGG CGCCTCGGCA TGCAGGGCCG CGTGACCGGC
TCGCACTGCA CCTCGATGCA TTCGATGGAC AACTACTACG CGAGCAAGCT GCTGCCGCTG
ATCGCGCAGA GCGGCGTCAG CGTGGTTTCC AACCCGCTCA TCAACATCAC GCTGCAGGGC
CGCCACGACA GCTATCCCAA GCGCCGCGGC ATGACGCGCG TGCCCGAGCT GATGGCCGCC
GGCGTGAACG TCGCCTTCGG CCACGACTGC GTGATGGACC CGTGGTACGG CATGGGATCG
GGCGACATGC TCGAAGTAGC GCACATGGGC CTGCACGTGG CGCAGATGAC CAGCCAGGCC
GGCATCCGCC AGTGCTTCGA CGCGGTGACC ACGAACGCCG CGCGCGTGAT GCACCTCGAA
GGCTACGGCC TCGAGGCGGG CTGCGACGCG AGCTTCGTGC TGCTGCAGGC GCGCGACCCG
GTCGAAGCGA TCCGGCTGCG CGCGACGCGG CTCAAGGTGT TCCGCAAGGG GCGCCTGCTG
GCGGAAACGC CGGCCGCGAC GGCCGCGCTG CACCTGCCGG GGCGCGCGCA GTCCACCAGC
TGGATGAGCC GCAAGTCCTA G
 
Protein sequence
MLDLLVINAT LPDGRSGMSI AVQGGRIAEV TAGLDAPALD KLDAQGLLVA PHFVDPHFHM 
DATLSYGLPR INQSGTLLEG IALWGELKPL LTADALIERA LAYCDWAVAK GLLAVRSHVD
TSDPSLLAVD ALLEVKRKVA PYLDLQLVAF PQDGVLRSAG GVDNLKRALD KGVDVVGGIP
HFERTMADGA ASVKLLMELA AERGKLVDMH CDESDDPLSR HIETLSAETW RLGMQGRVTG
SHCTSMHSMD NYYASKLLPL IAQSGVSVVS NPLINITLQG RHDSYPKRRG MTRVPELMAA
GVNVAFGHDC VMDPWYGMGS GDMLEVAHMG LHVAQMTSQA GIRQCFDAVT TNAARVMHLE
GYGLEAGCDA SFVLLQARDP VEAIRLRATR LKVFRKGRLL AETPAATAAL HLPGRAQSTS
WMSRKS