Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Vapar_2654 |
Symbol | |
ID | 7972416 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Variovorax paradoxus S110 |
Kingdom | Bacteria |
Replicon accession | NC_012791 |
Strand | - |
Start bp | 2797993 |
End bp | 2799273 |
Gene Length | 1281 bp |
Protein Length | 426 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 644793242 |
Product | cytosine deaminase |
Protein accession | YP_002944545 |
Protein GI | 239815635 |
COG category | [F] Nucleotide transport and metabolism [R] General function prediction only |
COG ID | [COG0402] Cytosine deaminase and related metal-dependent hydrolases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.689557 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCTCGACC TCCTCGTCAT CAACGCCACC CTCCCCGACG GCCGCTCGGG CATGTCGATT GCCGTGCAGG GCGGCCGCAT TGCCGAAGTC ACCGCCGGGC TCGACGCGCC GGCGCTCGAC AAGCTCGATG CCCAGGGCCT GCTGGTTGCG CCGCATTTCG TCGATCCGCA CTTCCACATG GACGCCACGC TGAGCTATGG CCTTCCGCGC ATCAACCAGA GCGGCACCTT GCTCGAAGGC ATTGCGCTGT GGGGCGAACT CAAGCCGCTG CTCACGGCCG ACGCACTCAT CGAGCGTGCG CTGGCGTATT GCGACTGGGC GGTGGCCAAG GGCCTGCTGG CCGTGCGCTC GCATGTGGAC ACGAGCGACC CGAGCCTGCT CGCGGTCGAT GCGCTGCTCG AGGTCAAGCG CAAGGTCGCG CCGTACCTCG ACCTGCAGCT CGTGGCCTTT CCGCAGGATG GCGTGCTGCG CTCGGCAGGC GGCGTCGACA ACCTGAAGCG CGCGCTCGAC AAGGGCGTGG ACGTGGTCGG CGGCATTCCG CATTTCGAGC GCACCATGGC CGATGGCGCC GCCAGCGTGA AGCTGCTGAT GGAGCTGGCG GCCGAGCGCG GCAAGCTGGT CGACATGCAC TGCGACGAGT CGGACGATCC GCTCTCGCGC CACATCGAAA CCCTGTCGGC CGAGACCTGG CGCCTCGGCA TGCAGGGCCG CGTGACCGGC TCGCACTGCA CCTCGATGCA TTCGATGGAC AACTACTACG CGAGCAAGCT GCTGCCGCTG ATCGCGCAGA GCGGCGTCAG CGTGGTTTCC AACCCGCTCA TCAACATCAC GCTGCAGGGC CGCCACGACA GCTATCCCAA GCGCCGCGGC ATGACGCGCG TGCCCGAGCT GATGGCCGCC GGCGTGAACG TCGCCTTCGG CCACGACTGC GTGATGGACC CGTGGTACGG CATGGGATCG GGCGACATGC TCGAAGTAGC GCACATGGGC CTGCACGTGG CGCAGATGAC CAGCCAGGCC GGCATCCGCC AGTGCTTCGA CGCGGTGACC ACGAACGCCG CGCGCGTGAT GCACCTCGAA GGCTACGGCC TCGAGGCGGG CTGCGACGCG AGCTTCGTGC TGCTGCAGGC GCGCGACCCG GTCGAAGCGA TCCGGCTGCG CGCGACGCGG CTCAAGGTGT TCCGCAAGGG GCGCCTGCTG GCGGAAACGC CGGCCGCGAC GGCCGCGCTG CACCTGCCGG GGCGCGCGCA GTCCACCAGC TGGATGAGCC GCAAGTCCTA G
|
Protein sequence | MLDLLVINAT LPDGRSGMSI AVQGGRIAEV TAGLDAPALD KLDAQGLLVA PHFVDPHFHM DATLSYGLPR INQSGTLLEG IALWGELKPL LTADALIERA LAYCDWAVAK GLLAVRSHVD TSDPSLLAVD ALLEVKRKVA PYLDLQLVAF PQDGVLRSAG GVDNLKRALD KGVDVVGGIP HFERTMADGA ASVKLLMELA AERGKLVDMH CDESDDPLSR HIETLSAETW RLGMQGRVTG SHCTSMHSMD NYYASKLLPL IAQSGVSVVS NPLINITLQG RHDSYPKRRG MTRVPELMAA GVNVAFGHDC VMDPWYGMGS GDMLEVAHMG LHVAQMTSQA GIRQCFDAVT TNAARVMHLE GYGLEAGCDA SFVLLQARDP VEAIRLRATR LKVFRKGRLL AETPAATAAL HLPGRAQSTS WMSRKS
|
| |