Gene Syncc9605_2031 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSyncc9605_2031 
Symbol 
ID3737707 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSynechococcus sp. CC9605 
KingdomBacteria 
Replicon accessionNC_007516 
Strand
Start bp1847745 
End bp1850756 
Gene Length3012 bp 
Protein Length1003 aa 
Translation table11 
GC content62% 
IMG OID637776617 
Productglycosyltransferase 
Protein accessionYP_382326 
Protein GI78213547 
COG category[R] General function prediction only 
COG ID[COG1216] Predicted glycosyltransferases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGACTGGAT GCTCCTACGA TCAGGGCGTG GAGCCTGCCA GCGGCCAAGA CAACAGCACG 
CTTGATGCTC AGGCCTGGCA GCGCTGGCGG TCTGGTCGCG CCAGCGCCGA AGAAATCGAG
CGTTGGCAGC AACAGGTCCA GCAGCAGCTC CCCTTGCTCC CGCAACAGCT GCTGGACCCG
TCGCTGCTGC CCATCGCACT GTTGAGCAAC CCGAGCCAGT GGTCTCCGGA AGACAGCGGG
CTGGATCCAA TCGCTCTTCT GGCCTGCCAC CAGGACCTCA CTGACCCCAA ACAATTGCTG
AGCAGCGGCC GCCTTGGCGA GATCGCCCTG GGACAGCGCA GCCTGTTCAC GGATCTGCCG
CCGGTACACC TCCAGGCCTA CCGGGATGGG TTGAGACCAG CGGCATGTGA CGACTCCCTC
AGCGCTCTAG AGCACCTGGG GGGCATCGGC CGTCAACGCT TCCAGCAGGG ACAGCCCTTG
GGCATGGATC TCGATCGCTA CCAGCCCCCC CTACCGGACC CAGCACCAAG CGGGTCCATC
CCCTCAGCGG CCACGGTGTT GGTGGTTCTC CACCCCACCC AAGAAGAAGC AGAGGCTGAG
TCGGCGCGCA GCTCACCACC AGAAGGATGG AGTCAGATCC GGCACGCCTC CCTCGAGGAT
CCCTCTGGAT GGATCGGAGC TCCGTTCCCA GACGGTGAGA CCCTCGTCAG CTTCTGCCAC
GCCAGTGATC AGCTGGATCC CCAGGCAGCC CTGCGCATGG CCCACTGCGC AGCGCAACAC
CCTGAAGCGG TGTTGCTCAC CAGCGATGAA ACCCTGCGCT GGAGTGAAGA CCCAGGCATT
CCTGCGGGGA ACCGCCAGAA CAGAACAACC ATCACACCGT TTCGTCTGCT CTGCCGGGGA
TGCATCGGCG GCCTGGTGAC CCTCCGCTGG TCAACGCTGC AACAACTGAA CCTGCCGGCA
TCCAGCGGTT CGCTGCATGC CCTGCTGCTG GATCTGGCGC TGCAGGTTTG CCGCCGTGGA
GATGCCGTGG CCCACTGCCC GGAGGTTTTG CTTCAGCGAT GCATCCGGGC GAACCCGACC
GTGCCCGACG TTGCATCCCC GGCCGACCGC CACTGTTGGA CCCCTGAGCT CAGCGATGAG
ATCCTGGCGA TCACCCAGCG ACACAGCCCA GGCTTTCTGG AGCTGGGTGG GGAGCTCACT
TCCTCTCAAT CGCTCAACGC CTGCCATCAG TTAAGGCTCC GCACCGATCC CAGTGTTCTG
GTGTCGGTTC TGATCCCGTT TCGCGACCGG GTGGATCTCA CCCAGAGCTG CGTCGCCTCC
CTGCGCCGCT GTGCCGGAGC TGTGGCCTAC GAGCTGATCC TGATCGACAA CGGCAGTGAG
GAGGCCGCCA CCCAGGACTG GCTGGATGAA CAGGCGCAGC TCGATGACGT GTGCGTGGTG
CGAGTGGACG AACCGTTCAA CTATTCGCGG CTCAACAACA TCGGCAGACG CCATGCCCGT
GGCAGCCATT TGTTGCTGCT GAACAACGAC ATCGAATTCC GATCCGCCGA GGTGCTCCAG
GCCCTGCTGG ACCCATTCGC CTACCGCGGC ACCACTGCCG TGGGAGCGAA GTTGCACTAC
CCCGACGGGA GTATCCAGCA TCAGGGGGTC GCCCTGGTGA AGGGGGAACG GCGCTGTGTG
GTGGAACCTG GCAAACACCT GCACAGCGCT CCGGTGCTGG CCACCCTCAC ACCCCTGCTC
CTGCAGGAGG AGTTCACCGC AGCAACCGGA GCCTGCCTGA TGCTCCGCAG CAGTGATTTC
GATGGCATTC AAGGCTTCGA TGAAAATCTC GCCGTGGTGT TCAACGATGT GGATCTCTGC
CTGCGGCTCC GCGCGCAGGG CGGATCAATA GTCGTGACCC CCTATCTGGA GATCGTTCAT
CACGAGTCGA TCAGCCGCGG CAAGGATCGG GAAGGTGCTG CACTGGCCCG CCATCAACGG
GAATCCGGTC AGCTAAGGGC CAAGCACGCC GGGCTGTTTG CCGCAGGCGA TCCCCTCAGC
AGTCAGCGGA TTCATCCCCA CAGCAATCGC TACCAACCCC GGGAACCCGC GCCGCGCTCA
AAAGGTCCCG TGGCCAATGC CGTGCTCATG CACTGGAGGG ATCCGAACTT CCAACCCAAC
CGTCAACGAC CGATCGTTGT GCTCGCGCAT TTTTCAGCCG ACAACCGGTT TCGGGATGAC
CTCTTCCCTC TAATTGACGA GTACCAGCGC TTCGCCGACG TGATTGTTGT GTCGTCAGCC
AGCGGGGTGC GCTGGCACCC GAGAACCCTG CATCGGCTGC GCCAACGCTG TGCCGCGATC
GTTATCCGGC GCAACCAGGG GTATGACTTC GGCAGCTGGA AGGCAGCACT CAACCTTCAC
CGGCAGGACA TTGATCAAGC AGCGTTCCTG GTGCTCACCA ATGACAGCTT CTGGGGACCG
ATCGCCCCCC TCGACGATCT GTTCCAACGC CTGCAGGCCA GCAGAGCGGA TGTGATTGGG
CTGACGGACG ATTTGATGTA CGAACCGCAT CTGTCGTCCG CCTTCACGGC TTACAAACCC
AAGGCCTTGC AGAGCCAAGC CTTCAACAAC TTCTGGAATT CTCTACAGAT CTGGCCCCGC
AAACGGGACC TGGTCAAACA ATGCGAAGTG GGCCTACCGG TGCAGCTGCG GGCAGCGGGG
GTGAAGCTCG AGAGCCTCTA TACCCACAAC GCCAATGGCA ATGTTCTCCA CTACGACTGG
AAGCACTTGA TTGAGCAAAG CGGCTTCCCT TTCCTCAAGG TGAGCCTGCT GCGGGACAAC
CCCACAAAGC AACCCGTCGA CACCTGGCCC GAGGTGATCG GACAACGCAA CCCTCAGCTA
GCGGCCAGCA TCGAACGCCA ACTGCAGTCG AGGACAGGGT TACGACGACT GCTGGAACGG
CTGCGCCATC GACTGAATGG AAGTGGTCGC AATGGGTCTC GTGCTGTAAT AGCGCCGACT
TCACACCGGT GA
 
Protein sequence
MTGCSYDQGV EPASGQDNST LDAQAWQRWR SGRASAEEIE RWQQQVQQQL PLLPQQLLDP 
SLLPIALLSN PSQWSPEDSG LDPIALLACH QDLTDPKQLL SSGRLGEIAL GQRSLFTDLP
PVHLQAYRDG LRPAACDDSL SALEHLGGIG RQRFQQGQPL GMDLDRYQPP LPDPAPSGSI
PSAATVLVVL HPTQEEAEAE SARSSPPEGW SQIRHASLED PSGWIGAPFP DGETLVSFCH
ASDQLDPQAA LRMAHCAAQH PEAVLLTSDE TLRWSEDPGI PAGNRQNRTT ITPFRLLCRG
CIGGLVTLRW STLQQLNLPA SSGSLHALLL DLALQVCRRG DAVAHCPEVL LQRCIRANPT
VPDVASPADR HCWTPELSDE ILAITQRHSP GFLELGGELT SSQSLNACHQ LRLRTDPSVL
VSVLIPFRDR VDLTQSCVAS LRRCAGAVAY ELILIDNGSE EAATQDWLDE QAQLDDVCVV
RVDEPFNYSR LNNIGRRHAR GSHLLLLNND IEFRSAEVLQ ALLDPFAYRG TTAVGAKLHY
PDGSIQHQGV ALVKGERRCV VEPGKHLHSA PVLATLTPLL LQEEFTAATG ACLMLRSSDF
DGIQGFDENL AVVFNDVDLC LRLRAQGGSI VVTPYLEIVH HESISRGKDR EGAALARHQR
ESGQLRAKHA GLFAAGDPLS SQRIHPHSNR YQPREPAPRS KGPVANAVLM HWRDPNFQPN
RQRPIVVLAH FSADNRFRDD LFPLIDEYQR FADVIVVSSA SGVRWHPRTL HRLRQRCAAI
VIRRNQGYDF GSWKAALNLH RQDIDQAAFL VLTNDSFWGP IAPLDDLFQR LQASRADVIG
LTDDLMYEPH LSSAFTAYKP KALQSQAFNN FWNSLQIWPR KRDLVKQCEV GLPVQLRAAG
VKLESLYTHN ANGNVLHYDW KHLIEQSGFP FLKVSLLRDN PTKQPVDTWP EVIGQRNPQL
AASIERQLQS RTGLRRLLER LRHRLNGSGR NGSRAVIAPT SHR