Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Syncc9605_2031 |
Symbol | |
ID | 3737707 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Synechococcus sp. CC9605 |
Kingdom | Bacteria |
Replicon accession | NC_007516 |
Strand | + |
Start bp | 1847745 |
End bp | 1850756 |
Gene Length | 3012 bp |
Protein Length | 1003 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 637776617 |
Product | glycosyltransferase |
Protein accession | YP_382326 |
Protein GI | 78213547 |
COG category | [R] General function prediction only |
COG ID | [COG1216] Predicted glycosyltransferases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGACTGGAT GCTCCTACGA TCAGGGCGTG GAGCCTGCCA GCGGCCAAGA CAACAGCACG CTTGATGCTC AGGCCTGGCA GCGCTGGCGG TCTGGTCGCG CCAGCGCCGA AGAAATCGAG CGTTGGCAGC AACAGGTCCA GCAGCAGCTC CCCTTGCTCC CGCAACAGCT GCTGGACCCG TCGCTGCTGC CCATCGCACT GTTGAGCAAC CCGAGCCAGT GGTCTCCGGA AGACAGCGGG CTGGATCCAA TCGCTCTTCT GGCCTGCCAC CAGGACCTCA CTGACCCCAA ACAATTGCTG AGCAGCGGCC GCCTTGGCGA GATCGCCCTG GGACAGCGCA GCCTGTTCAC GGATCTGCCG CCGGTACACC TCCAGGCCTA CCGGGATGGG TTGAGACCAG CGGCATGTGA CGACTCCCTC AGCGCTCTAG AGCACCTGGG GGGCATCGGC CGTCAACGCT TCCAGCAGGG ACAGCCCTTG GGCATGGATC TCGATCGCTA CCAGCCCCCC CTACCGGACC CAGCACCAAG CGGGTCCATC CCCTCAGCGG CCACGGTGTT GGTGGTTCTC CACCCCACCC AAGAAGAAGC AGAGGCTGAG TCGGCGCGCA GCTCACCACC AGAAGGATGG AGTCAGATCC GGCACGCCTC CCTCGAGGAT CCCTCTGGAT GGATCGGAGC TCCGTTCCCA GACGGTGAGA CCCTCGTCAG CTTCTGCCAC GCCAGTGATC AGCTGGATCC CCAGGCAGCC CTGCGCATGG CCCACTGCGC AGCGCAACAC CCTGAAGCGG TGTTGCTCAC CAGCGATGAA ACCCTGCGCT GGAGTGAAGA CCCAGGCATT CCTGCGGGGA ACCGCCAGAA CAGAACAACC ATCACACCGT TTCGTCTGCT CTGCCGGGGA TGCATCGGCG GCCTGGTGAC CCTCCGCTGG TCAACGCTGC AACAACTGAA CCTGCCGGCA TCCAGCGGTT CGCTGCATGC CCTGCTGCTG GATCTGGCGC TGCAGGTTTG CCGCCGTGGA GATGCCGTGG CCCACTGCCC GGAGGTTTTG CTTCAGCGAT GCATCCGGGC GAACCCGACC GTGCCCGACG TTGCATCCCC GGCCGACCGC CACTGTTGGA CCCCTGAGCT CAGCGATGAG ATCCTGGCGA TCACCCAGCG ACACAGCCCA GGCTTTCTGG AGCTGGGTGG GGAGCTCACT TCCTCTCAAT CGCTCAACGC CTGCCATCAG TTAAGGCTCC GCACCGATCC CAGTGTTCTG GTGTCGGTTC TGATCCCGTT TCGCGACCGG GTGGATCTCA CCCAGAGCTG CGTCGCCTCC CTGCGCCGCT GTGCCGGAGC TGTGGCCTAC GAGCTGATCC TGATCGACAA CGGCAGTGAG GAGGCCGCCA CCCAGGACTG GCTGGATGAA CAGGCGCAGC TCGATGACGT GTGCGTGGTG CGAGTGGACG AACCGTTCAA CTATTCGCGG CTCAACAACA TCGGCAGACG CCATGCCCGT GGCAGCCATT TGTTGCTGCT GAACAACGAC ATCGAATTCC GATCCGCCGA GGTGCTCCAG GCCCTGCTGG ACCCATTCGC CTACCGCGGC ACCACTGCCG TGGGAGCGAA GTTGCACTAC CCCGACGGGA GTATCCAGCA TCAGGGGGTC GCCCTGGTGA AGGGGGAACG GCGCTGTGTG GTGGAACCTG GCAAACACCT GCACAGCGCT CCGGTGCTGG CCACCCTCAC ACCCCTGCTC CTGCAGGAGG AGTTCACCGC AGCAACCGGA GCCTGCCTGA TGCTCCGCAG CAGTGATTTC GATGGCATTC AAGGCTTCGA TGAAAATCTC GCCGTGGTGT TCAACGATGT GGATCTCTGC CTGCGGCTCC GCGCGCAGGG CGGATCAATA GTCGTGACCC CCTATCTGGA GATCGTTCAT CACGAGTCGA TCAGCCGCGG CAAGGATCGG GAAGGTGCTG CACTGGCCCG CCATCAACGG GAATCCGGTC AGCTAAGGGC CAAGCACGCC GGGCTGTTTG CCGCAGGCGA TCCCCTCAGC AGTCAGCGGA TTCATCCCCA CAGCAATCGC TACCAACCCC GGGAACCCGC GCCGCGCTCA AAAGGTCCCG TGGCCAATGC CGTGCTCATG CACTGGAGGG ATCCGAACTT CCAACCCAAC CGTCAACGAC CGATCGTTGT GCTCGCGCAT TTTTCAGCCG ACAACCGGTT TCGGGATGAC CTCTTCCCTC TAATTGACGA GTACCAGCGC TTCGCCGACG TGATTGTTGT GTCGTCAGCC AGCGGGGTGC GCTGGCACCC GAGAACCCTG CATCGGCTGC GCCAACGCTG TGCCGCGATC GTTATCCGGC GCAACCAGGG GTATGACTTC GGCAGCTGGA AGGCAGCACT CAACCTTCAC CGGCAGGACA TTGATCAAGC AGCGTTCCTG GTGCTCACCA ATGACAGCTT CTGGGGACCG ATCGCCCCCC TCGACGATCT GTTCCAACGC CTGCAGGCCA GCAGAGCGGA TGTGATTGGG CTGACGGACG ATTTGATGTA CGAACCGCAT CTGTCGTCCG CCTTCACGGC TTACAAACCC AAGGCCTTGC AGAGCCAAGC CTTCAACAAC TTCTGGAATT CTCTACAGAT CTGGCCCCGC AAACGGGACC TGGTCAAACA ATGCGAAGTG GGCCTACCGG TGCAGCTGCG GGCAGCGGGG GTGAAGCTCG AGAGCCTCTA TACCCACAAC GCCAATGGCA ATGTTCTCCA CTACGACTGG AAGCACTTGA TTGAGCAAAG CGGCTTCCCT TTCCTCAAGG TGAGCCTGCT GCGGGACAAC CCCACAAAGC AACCCGTCGA CACCTGGCCC GAGGTGATCG GACAACGCAA CCCTCAGCTA GCGGCCAGCA TCGAACGCCA ACTGCAGTCG AGGACAGGGT TACGACGACT GCTGGAACGG CTGCGCCATC GACTGAATGG AAGTGGTCGC AATGGGTCTC GTGCTGTAAT AGCGCCGACT TCACACCGGT GA
|
Protein sequence | MTGCSYDQGV EPASGQDNST LDAQAWQRWR SGRASAEEIE RWQQQVQQQL PLLPQQLLDP SLLPIALLSN PSQWSPEDSG LDPIALLACH QDLTDPKQLL SSGRLGEIAL GQRSLFTDLP PVHLQAYRDG LRPAACDDSL SALEHLGGIG RQRFQQGQPL GMDLDRYQPP LPDPAPSGSI PSAATVLVVL HPTQEEAEAE SARSSPPEGW SQIRHASLED PSGWIGAPFP DGETLVSFCH ASDQLDPQAA LRMAHCAAQH PEAVLLTSDE TLRWSEDPGI PAGNRQNRTT ITPFRLLCRG CIGGLVTLRW STLQQLNLPA SSGSLHALLL DLALQVCRRG DAVAHCPEVL LQRCIRANPT VPDVASPADR HCWTPELSDE ILAITQRHSP GFLELGGELT SSQSLNACHQ LRLRTDPSVL VSVLIPFRDR VDLTQSCVAS LRRCAGAVAY ELILIDNGSE EAATQDWLDE QAQLDDVCVV RVDEPFNYSR LNNIGRRHAR GSHLLLLNND IEFRSAEVLQ ALLDPFAYRG TTAVGAKLHY PDGSIQHQGV ALVKGERRCV VEPGKHLHSA PVLATLTPLL LQEEFTAATG ACLMLRSSDF DGIQGFDENL AVVFNDVDLC LRLRAQGGSI VVTPYLEIVH HESISRGKDR EGAALARHQR ESGQLRAKHA GLFAAGDPLS SQRIHPHSNR YQPREPAPRS KGPVANAVLM HWRDPNFQPN RQRPIVVLAH FSADNRFRDD LFPLIDEYQR FADVIVVSSA SGVRWHPRTL HRLRQRCAAI VIRRNQGYDF GSWKAALNLH RQDIDQAAFL VLTNDSFWGP IAPLDDLFQR LQASRADVIG LTDDLMYEPH LSSAFTAYKP KALQSQAFNN FWNSLQIWPR KRDLVKQCEV GLPVQLRAAG VKLESLYTHN ANGNVLHYDW KHLIEQSGFP FLKVSLLRDN PTKQPVDTWP EVIGQRNPQL AASIERQLQS RTGLRRLLER LRHRLNGSGR NGSRAVIAPT SHR
|
| |