Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Synpcc7942_0568 |
Symbol | |
ID | 3774806 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Synechococcus elongatus PCC 7942 |
Kingdom | Bacteria |
Replicon accession | NC_007604 |
Strand | - |
Start bp | 549371 |
End bp | 550687 |
Gene Length | 1317 bp |
Protein Length | 438 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 637798976 |
Product | cytosine deaminase-like protein |
Protein accession | YP_399587 |
Protein GI | 81299379 |
COG category | [F] Nucleotide transport and metabolism [R] General function prediction only |
COG ID | [COG0402] Cytosine deaminase and related metal-dependent hydrolases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 0.322505 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 0.00702757 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | GTGATCCTGT CTCTTGTCTT GCCTGAGAGC GATCGCTACT GTCTGCAGCA AGCGCGGGTT CCACAAGCAG TCCTCACTGA CTGGAGCAGC CTCGGTCAAC CCGATTGGGA AGGTAACTTT CTGATCGATC TTGAGATCGA CAGGGGCAAG ATTTGTGCGA TCGCCCCCAC CCAGTCCAGC GCTGGCAACG TCGCCAGTCT GGATTTACAG GGTCGGCAAC TCTGGCCCGG ATTTGTCGAT ATCCACACCC ACATCGATAA AGGACATATC TGGCCGCGCA GTCCCAATCC TGACGGCAGT TTTGATGGCG CTTTGACGAC GGCGATCGCG GATTCTCAAA CCCACTGGAA TTCCGATGAG TTGCTGGTGC GGATGGAATT TAGCCTGCGC TGTGCCTACG CCCACGGCAC GGTCGCGCTA CGCACCCATC TCGATTCTGC AGGTGATTTG GCAGCCCAGG GCTTCACTGT TTTTCAGGAA TTACGGGAAC GCTGGCGCGA TCGCCTGACG CTCCAAGCCG CTTCGCTAGT CTCCCTCGAT CACTATCAGG GGGTAGCAGG AGAGCGGCTC GCCGATTTGG TAGCGGCAGC AGGCGGTCTA CTCGGTGGAG TAACTTTTCC GTCGCCAGCC TTGGATGAGC AACTCGATCG CCTGCTGGAT CTTGCCCGCG ATCGCCAGCT CGATCTGGAT TTGCATGTGG ATGAAAGCCT CAATCCAAGC GATCGCACTC TGCTGCAGGT AGCTGCTGCC GTCCAACGCA ACGGCTTCAC CGGTAAAGTG CTCTGCGGCC ACTGTTGTAG CTTGTCCGTA CAACCCGAGG CAGACTTACC GATTCAGTTA CAGGCAGTGC AAGCGGCGGG ACTGGGGATT GTCTCACTAC CGCTCTGCAA TTCCTACCTA CAGGATCGGC AGGCCGGTCG GACGCCACGA TTGCGCGGTA TTGCCCCAGT TCAAGAAATC CAAGCGGCAG GGATTCCCAC CTTCCTCTCC AGCGACAACA GCCGCGATCC CTTCTATGCT TACGGCGATT TGGACATGGT CGAAGTCTTT CGAGAGTCGG TGCGAATTGG GCAGCTCGAT CATCCTTGGT CACCTTGGCC TGCTGCCGTC ACCCGTACGC CGGCGGATTG GATAGGACTG CCAGATCAAG GCCGGATTGC GATCGGGGCG CGGGCTGATT TCGTGATTTT TAACGCCCGT TCTTTCACAG AACTGCTGGC TCGCCCCCAA AGCGATCGCC TGATTGTGCG CAATGGCCGC GCGATCGCGC CGGAACTGCC TGACTATGCT GAATTAGATG CAATCTTGGC GCAGTAG
|
Protein sequence | MILSLVLPES DRYCLQQARV PQAVLTDWSS LGQPDWEGNF LIDLEIDRGK ICAIAPTQSS AGNVASLDLQ GRQLWPGFVD IHTHIDKGHI WPRSPNPDGS FDGALTTAIA DSQTHWNSDE LLVRMEFSLR CAYAHGTVAL RTHLDSAGDL AAQGFTVFQE LRERWRDRLT LQAASLVSLD HYQGVAGERL ADLVAAAGGL LGGVTFPSPA LDEQLDRLLD LARDRQLDLD LHVDESLNPS DRTLLQVAAA VQRNGFTGKV LCGHCCSLSV QPEADLPIQL QAVQAAGLGI VSLPLCNSYL QDRQAGRTPR LRGIAPVQEI QAAGIPTFLS SDNSRDPFYA YGDLDMVEVF RESVRIGQLD HPWSPWPAAV TRTPADWIGL PDQGRIAIGA RADFVIFNAR SFTELLARPQ SDRLIVRNGR AIAPELPDYA ELDAILAQ
|
| |