Gene Syncc9902_1847 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSyncc9902_1847 
Symbol 
ID3742140 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSynechococcus sp. CC9902 
KingdomBacteria 
Replicon accessionNC_007513 
Strand
Start bp1779445 
End bp1780902 
Gene Length1458 bp 
Protein Length485 aa 
Translation table11 
GC content53% 
IMG OID637772042 
Productputative neutral invertase-like protein 
Protein accessionYP_377848 
Protein GI78185413 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.570169 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGGACGC GATTTACAGA AGAGAGTCAG CGATTTCGTC CCAGTTCCAA AGAAGACCAA 
GTGGTCCAAA AGGCCCAAGA GCACTTCGAA CGCACCCTCA TATCGATTCA GGGACAACTC
GCCGGAAGCG TTGCAGCTCT TGAAAGTAGT TACGCCGATT CGGAGCTGAA CTACGGCGAA
ATCTTTGTCC GAGACAACGT CCCGGTGATG ATTTATTTGC TGGTACAGGG ACGCTTCGCG
ATCGTGAAGC AATTTCTGAA GGTTTGCCTC GACCTCCAGA GCACGAGTGT CCAAACCCGT
GGGGTTTTTC CGACAAGTTT CGTTGAAGAA GAGGGGAATC TGGTTGCTGA TTACGGCCAG
CGCTCTATTG GGCGGATCAC CTCTGTGGAT CCAAGCCTGT GGTGGCCGAT CCTTTGTTGG
ATTTACGTCA AAAGAAGTGG CGACACTGAT TTTGGGCGGA GCCCAGAAGT GCAGCGCGGA
ATCCAACTCC TTCTTGATCT GGTACTGCAC CCCAGCTTTG AAGGAACACC TGTGCTGTTC
GTACCGGACT GCGCCTTCAT GATCGATCGT CCGATGGACG TTTGGGGCGC ACCACTGGAA
GTGGAAGTGT TGTTGTACGG CGCACTACGA AGCTGCGTTG AACTCATGGA GCTTTGCCAA
CGCCACGACA CCAGCGCACT CCTGGCAGAG CGTCTCCGCC TAAGTCGCAA ATGGACCCAT
GACCTGCGGC AATTTCTGTT GAAGCACTAC TGGGTCACCA GCAAAACCAT GCAGGTTCTC
CGGCGTCGTC CCACGGAGCA ATACGGAGAC AACCAGCACC AAAACGAATT CAATGTTCAA
CCTCAGGTGA TCCCTGATTG GCTTCAGGAC TGGCTTCAAG ATCGAGGTGG ATACCTCATC
GGCAACATTC GAACTGGCAG GCCGGACTTC CGTTTCTACA GCCTGGGCAA TTCGCTCGCC
TCGATGTTTG GACTGCTCAC GGCACCACAA CAACGGGCCT TATTTCGGCT GGTGCATCAC
AACCGTGATC ACCTCATGGC ACAAATGCCA ATGCGAATCT GCCATCCCCC AATGGCAGGG
GTGGAGTGGG AAAACAAAAC GGGGTCCGAC CCTAAAAACT GGCCTTGGAG TTATCACAAC
GGTGGGCATT GGCCCAGCCT GCTTTGGTTT TTTGGATCAT CAATCCTTCT CCATGAACGG
CTGCATCCCA ATGCTGACGT GTTGTTAATG AGTGAAATGA CCACACTCCT CGACGAGTGC
TACTGGAGCC ATCTCAACCA ACTCCCGCGG CAACAGTGGG CTGAATATTT CGATGGGCCA
ACGGGAACAT GGGTGGGACA ACAATCAAGG ACATTTCAAA CCTGGACCAT CGTGGGGTTC
CTCTTAACCC ACCATTTCCT CCGAGTGAAT CCCGATGACG TCTTAATGCT GAATCTGGAT
GCTGGCCTCG GCCGCTAA
 
Protein sequence
MGTRFTEESQ RFRPSSKEDQ VVQKAQEHFE RTLISIQGQL AGSVAALESS YADSELNYGE 
IFVRDNVPVM IYLLVQGRFA IVKQFLKVCL DLQSTSVQTR GVFPTSFVEE EGNLVADYGQ
RSIGRITSVD PSLWWPILCW IYVKRSGDTD FGRSPEVQRG IQLLLDLVLH PSFEGTPVLF
VPDCAFMIDR PMDVWGAPLE VEVLLYGALR SCVELMELCQ RHDTSALLAE RLRLSRKWTH
DLRQFLLKHY WVTSKTMQVL RRRPTEQYGD NQHQNEFNVQ PQVIPDWLQD WLQDRGGYLI
GNIRTGRPDF RFYSLGNSLA SMFGLLTAPQ QRALFRLVHH NRDHLMAQMP MRICHPPMAG
VEWENKTGSD PKNWPWSYHN GGHWPSLLWF FGSSILLHER LHPNADVLLM SEMTTLLDEC
YWSHLNQLPR QQWAEYFDGP TGTWVGQQSR TFQTWTIVGF LLTHHFLRVN PDDVLMLNLD
AGLGR