Gene Synpcc7942_0568 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSynpcc7942_0568 
Symbol 
ID3774806 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSynechococcus elongatus PCC 7942 
KingdomBacteria 
Replicon accessionNC_007604 
Strand
Start bp549371 
End bp550687 
Gene Length1317 bp 
Protein Length438 aa 
Translation table11 
GC content58% 
IMG OID637798976 
Productcytosine deaminase-like protein 
Protein accessionYP_399587 
Protein GI81299379 
COG category[F] Nucleotide transport and metabolism
[R] General function prediction only 
COG ID[COG0402] Cytosine deaminase and related metal-dependent hydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value0.322505 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.00702757 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
GTGATCCTGT CTCTTGTCTT GCCTGAGAGC GATCGCTACT GTCTGCAGCA AGCGCGGGTT 
CCACAAGCAG TCCTCACTGA CTGGAGCAGC CTCGGTCAAC CCGATTGGGA AGGTAACTTT
CTGATCGATC TTGAGATCGA CAGGGGCAAG ATTTGTGCGA TCGCCCCCAC CCAGTCCAGC
GCTGGCAACG TCGCCAGTCT GGATTTACAG GGTCGGCAAC TCTGGCCCGG ATTTGTCGAT
ATCCACACCC ACATCGATAA AGGACATATC TGGCCGCGCA GTCCCAATCC TGACGGCAGT
TTTGATGGCG CTTTGACGAC GGCGATCGCG GATTCTCAAA CCCACTGGAA TTCCGATGAG
TTGCTGGTGC GGATGGAATT TAGCCTGCGC TGTGCCTACG CCCACGGCAC GGTCGCGCTA
CGCACCCATC TCGATTCTGC AGGTGATTTG GCAGCCCAGG GCTTCACTGT TTTTCAGGAA
TTACGGGAAC GCTGGCGCGA TCGCCTGACG CTCCAAGCCG CTTCGCTAGT CTCCCTCGAT
CACTATCAGG GGGTAGCAGG AGAGCGGCTC GCCGATTTGG TAGCGGCAGC AGGCGGTCTA
CTCGGTGGAG TAACTTTTCC GTCGCCAGCC TTGGATGAGC AACTCGATCG CCTGCTGGAT
CTTGCCCGCG ATCGCCAGCT CGATCTGGAT TTGCATGTGG ATGAAAGCCT CAATCCAAGC
GATCGCACTC TGCTGCAGGT AGCTGCTGCC GTCCAACGCA ACGGCTTCAC CGGTAAAGTG
CTCTGCGGCC ACTGTTGTAG CTTGTCCGTA CAACCCGAGG CAGACTTACC GATTCAGTTA
CAGGCAGTGC AAGCGGCGGG ACTGGGGATT GTCTCACTAC CGCTCTGCAA TTCCTACCTA
CAGGATCGGC AGGCCGGTCG GACGCCACGA TTGCGCGGTA TTGCCCCAGT TCAAGAAATC
CAAGCGGCAG GGATTCCCAC CTTCCTCTCC AGCGACAACA GCCGCGATCC CTTCTATGCT
TACGGCGATT TGGACATGGT CGAAGTCTTT CGAGAGTCGG TGCGAATTGG GCAGCTCGAT
CATCCTTGGT CACCTTGGCC TGCTGCCGTC ACCCGTACGC CGGCGGATTG GATAGGACTG
CCAGATCAAG GCCGGATTGC GATCGGGGCG CGGGCTGATT TCGTGATTTT TAACGCCCGT
TCTTTCACAG AACTGCTGGC TCGCCCCCAA AGCGATCGCC TGATTGTGCG CAATGGCCGC
GCGATCGCGC CGGAACTGCC TGACTATGCT GAATTAGATG CAATCTTGGC GCAGTAG
 
Protein sequence
MILSLVLPES DRYCLQQARV PQAVLTDWSS LGQPDWEGNF LIDLEIDRGK ICAIAPTQSS 
AGNVASLDLQ GRQLWPGFVD IHTHIDKGHI WPRSPNPDGS FDGALTTAIA DSQTHWNSDE
LLVRMEFSLR CAYAHGTVAL RTHLDSAGDL AAQGFTVFQE LRERWRDRLT LQAASLVSLD
HYQGVAGERL ADLVAAAGGL LGGVTFPSPA LDEQLDRLLD LARDRQLDLD LHVDESLNPS
DRTLLQVAAA VQRNGFTGKV LCGHCCSLSV QPEADLPIQL QAVQAAGLGI VSLPLCNSYL
QDRQAGRTPR LRGIAPVQEI QAAGIPTFLS SDNSRDPFYA YGDLDMVEVF RESVRIGQLD
HPWSPWPAAV TRTPADWIGL PDQGRIAIGA RADFVIFNAR SFTELLARPQ SDRLIVRNGR
AIAPELPDYA ELDAILAQ