Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Syncc9902_0638 |
Symbol | |
ID | 3743460 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Synechococcus sp. CC9902 |
Kingdom | Bacteria |
Replicon accession | NC_007513 |
Strand | - |
Start bp | 649315 |
End bp | 652233 |
Gene Length | 2919 bp |
Protein Length | 972 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 637770810 |
Product | glycosyltransferase |
Protein accession | YP_376650 |
Protein GI | 78184215 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG3754] Lipopolysaccharide biosynthesis protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.770569 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGTGTCAA GCCAAAGTCT TTCAGATCGA TCGCTCACGC AGGCTCTACG ACAAGTCGTT GAGCGGGGAT TTGGCTTGGA AGACTTACAG CCAGCCTTGG CGGAGCGAAA TCCAGGCCTA CCAGTGGCCT GCCTCGATCC TCGCCTGATC CCGATTCCTT TACTGGTGAG CCCCCAGAGC TGGGCACCAC AAGACAGCGG CATCGATGAA AAAGCCCTGC TTCAAGCCCA TCCAGATCTC ATCGGCGCCA CAACGCTTGT GAGCTCTGGG CGACTCACCC AGATCGTCCA TGGAGAGGCA TGCCTGTATG CAGACCTACC TCCAATCCAC TTACAGGCCT ACCTCGATGG TCTCCGTCCA GCCGCGAAAA AAGAACTTCA ACGCCAAAAA GCCTGCGCCC TTGAGCACCT AGCCACCCTC GGCTGGAAAC GGTTCCGCCA TCGGCGTGCT CTTGGGAAAG ATCTCGACCG CTATCAACCG GAACCTGATG CAGCCATACA AACCGCTCAC TCACCAAACA ATCCATTGCC ACTGGTGTTG GTGGTTTTAC AAACGGAGCC TCGACCAGCC AACAACGCTG CCAGCACTCT TGACAGCAAC ACCACTGACG GAAACACCGT TGGCTGGAAT CACATCATCC ATGGACATCT CGACGATCTC AACTCGGTGT GGTCGGCACT GCCTTCCTCC AGCGAGGCTC TGGTGAGCTT TTGCCATTGC GATGACCGCC TGGCACCACA AGCACCACTA CTGCTGAGTC GAGCCGCAGA AGCTTCACAC AAGGCTGAGT TGTTCAGCTC TGATGAAACA TTGCAGTGGA GTGCCGATCC GACACAGCCC CCTGGCAACC GTCAAAACCA CGTATCCGCC CTGCCTTGGC GCCTACTCAC CCGAGGGTGT ATCGGTGGGC TAGTGACGCT TCGACTCAGT CGATTGCGCC AATTAGACCT CCCAAATCGA CGCCTATGTC TTCACAATCT GATCCTTGAT CTCAGCCTTC AAGTCAGTGC ACAACGCCAC ACATGTGAGC ATTTAAGCCA GGCTCTACTG AGCCGAAAAA TCACCACGAA TCCCGCCGTT CCAGAAGTGG CATCCCCCAA AGATCGTCTG GTTTTCAGTG CCGCACAATG TCAAGAATCT CTGAGTATTT GTCGAGAACG CGGAGCTCAT TTTCTGACGA CTGGCGGGGT AATTACGGCC CATCCCAAAT GGTCTGGATG CCATCAAGTG CATTGGACTC CCCCAGAGAA CACATTTGTT TCAATTCTGA TTCCCTTTCG CGATAGCGCA GCACTCACCA AAGTTTGCGT GGAGTCGATT CGCCGTTGTG CTGGAGATAT TCCCTTTGAA CTGATCTTGA TCGACAACGG CAGCACGAGT ACAGACACTT TGACTTGGCT CAATCATCAG CGCGTTCTGA GCGATGTCAC AGTGCTGCGT TTGGATTGCG ACTTTAATTA CGCACAGATT CATAATTTTG CACGTCCTTA TTGCCAGGGC ACACATCTCT TACTTCTAAA TAACGATATT GAAGTCCGCT CGGAAAATAT TCTTCAAAAA CTCCTCGATC CTTTTGCAGT GAGCCACACA GTGGCCGTTG GTGCTCGACT GCGTTACCCC AACAACAGCA TCCAACATCA TGGGGTCGTG CTCATCAAAG GGGAACGACG CTGTGTGTTG GAGCCGGGCA AACACCTGTC TGAACTCAAC ACCATTGATA TGTTCACTCC CTTGGGAGTG CAAGAAGAGT GGATTGCAGC CACTGCTGCC TGTTTGATGC TTAGAACCGC TGATTTTGAT CAAATTGGTG GGTTCGATGA ACAACTCGCA GTGGTCTTCA ACGATGTAGA TCTATGCCTG CGATTGCGAC AACTGGAAGG CTCTGTTGTT GTCACGCCGT TTGTTGACAT CATTCACCAT GAATCAGTGA GTCGCGGCAA AGATCAATCA GGAGAGGCCC TAGCCCGACA TCAACGCGAG TCGGGTTATT TGCGACATAA ACACGCCGGA CTCTTTGAAA CTGGAGATCC ACTATTTAGT CACAACATCC ATCCCCACAG CAACCGCTTC CAACCCAAAG ATCCTGCACC CCAGTCGTCT GGCCGTGTGA AGCCTCAGCA GATCAGCCAC TGGAAGCAAC GGGGCTGGAA ACCAAATAAC AAAAAGCCCT TGATTGTGAT GGCTCATTTT GATCCAACCA ACCGTTTACG CCCAGATCTG CTCAAACTTT TGGAGAGTTA CGCACAATTT GGAGATGTCG TTTTGGTCTC GGCTTCGCCA GGCCTCCGAT GGCACTGGAA CACCTTGCGA AAGCTCAAAA GATGTTGCCG CGCCATTTTG ATTCGCCGCA ACGAAGGATA TGACTTTGGC AGCTGGATGG CAGCCCTGCA TTCGCTTAAA AGAGATATTG ACAACATCGA TCAACTTATT CTCACAAATG ACAGTTTTTG GGGGCCAATC ACACCCCTCG ACGATTTATT TCAACGACTA AAAGCAAGCA GTGCAGACGT CATTGGACTC ACCGACGACC AAATGTATGC ACCCCATCTT CAATCTGCTT TTCTTGCATT TCGAAAACCA GTAATTCAAA GTGATGCATT TCAAAGCTTT TGGAATCAAC TGGAATCATG GCCTCGCAAA CGTGATTTGA TCAAAAAATA CGAAGTTGGT CTACCAGTTT GCCTACAAAA AGCCGGGTTT AAAACAGAAA GTCTTTACAC AAAACATGCC AATGGAAATA TTTTGCATAC AGCCTGGAAA GAGTTAATTG AAGCAAAAAA CTTTCCCTTT CTGAAGGTTT CACTGCTTCG GGATAATCCG ATGCATCAAG AGATTGATGC CTGGGAGCAG GTGATCTCGG AACGGAATCC CGTCTTGGCC GCTCAAATCC GATCACAACT CCAACAGCCA AAATCCTGA
|
Protein sequence | MVSSQSLSDR SLTQALRQVV ERGFGLEDLQ PALAERNPGL PVACLDPRLI PIPLLVSPQS WAPQDSGIDE KALLQAHPDL IGATTLVSSG RLTQIVHGEA CLYADLPPIH LQAYLDGLRP AAKKELQRQK ACALEHLATL GWKRFRHRRA LGKDLDRYQP EPDAAIQTAH SPNNPLPLVL VVLQTEPRPA NNAASTLDSN TTDGNTVGWN HIIHGHLDDL NSVWSALPSS SEALVSFCHC DDRLAPQAPL LLSRAAEASH KAELFSSDET LQWSADPTQP PGNRQNHVSA LPWRLLTRGC IGGLVTLRLS RLRQLDLPNR RLCLHNLILD LSLQVSAQRH TCEHLSQALL SRKITTNPAV PEVASPKDRL VFSAAQCQES LSICRERGAH FLTTGGVITA HPKWSGCHQV HWTPPENTFV SILIPFRDSA ALTKVCVESI RRCAGDIPFE LILIDNGSTS TDTLTWLNHQ RVLSDVTVLR LDCDFNYAQI HNFARPYCQG THLLLLNNDI EVRSENILQK LLDPFAVSHT VAVGARLRYP NNSIQHHGVV LIKGERRCVL EPGKHLSELN TIDMFTPLGV QEEWIAATAA CLMLRTADFD QIGGFDEQLA VVFNDVDLCL RLRQLEGSVV VTPFVDIIHH ESVSRGKDQS GEALARHQRE SGYLRHKHAG LFETGDPLFS HNIHPHSNRF QPKDPAPQSS GRVKPQQISH WKQRGWKPNN KKPLIVMAHF DPTNRLRPDL LKLLESYAQF GDVVLVSASP GLRWHWNTLR KLKRCCRAIL IRRNEGYDFG SWMAALHSLK RDIDNIDQLI LTNDSFWGPI TPLDDLFQRL KASSADVIGL TDDQMYAPHL QSAFLAFRKP VIQSDAFQSF WNQLESWPRK RDLIKKYEVG LPVCLQKAGF KTESLYTKHA NGNILHTAWK ELIEAKNFPF LKVSLLRDNP MHQEIDAWEQ VISERNPVLA AQIRSQLQQP KS
|
| |