Gene Syncc9902_2034 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSyncc9902_2034 
Symbol 
ID3742994 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSynechococcus sp. CC9902 
KingdomBacteria 
Replicon accessionNC_007513 
Strand
Start bp1941817 
End bp1943634 
Gene Length1818 bp 
Protein Length605 aa 
Translation table11 
GC content42% 
IMG OID637772231 
ProductTPR repeat-containing protein 
Protein accessionYP_378035 
Protein GI78185601 
COG category[S] Function unknown 
COG ID[COG3898] Uncharacterized membrane-bound protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.651462 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTTCAA CCAACCAACC ACTTCAAGCT CTCGAAGCTT CCATACAATT AGCCCGCCAA 
AACTATAAAG AAGCGAAAAT CGACTCAGCC CTCCAACACA TCAAAGAAGG GCTACAGATT
GATGAGCACA ATGCCCAATT ACTCGAGATT GCTGCCAGCA CTCATCTTCG TTTCAATAAT
AACAATGAAG CCATTAAATA TGCTCAAGAA TTAATCACAC ACCATCCAAG AAATCCGCAT
GGATACAGCA GATACGCACA AGCCCTGCTG CAACAGAAAC AACCTTTTAA TGCCAATCAA
ATGGCACTTA GCGGATTACA AGAAGCTCCA CACAATCAAC AGCTCCTGGC ATTAACTAGC
GAATCTTATC AAGCGCTAGA GCGCTGGGAT CAATCACTAG AACTGGCAAA TCAACTTATT
GAATTCCATC CAAGCCTGCA CATGGGATAC CTCAAGGCAG CACAAAACCT TCTAAAACTC
AACAAACCCA ACGAGGCTTC TTCTATCGCA GAACGAGGCT TAAAAGCAAA GCCGAACCAC
CCTCATCTTC ATAGCATTGC CAGCGAAGCC TACCGAGCCA GACACCTACA CAATCACTCA
CTCAATCACG CAAAAGCCTT AATTGAAAAT CAACCAGACT CCATCGAAGG GCATAAACGA
GCAGCCCAAG ACTTACTGAA ACTAGGGCTA AGGAATGAGG CAATTTTAAT CATCGAGCAA
CTCATTCAAC GAAACAAGTC CAAGAAAGCC GCGGCAATGG CCAGCAAGCT CTTCAAAGTA
GCCGGACAAA CATCAAGAAG CCTCGTTCTT ACAACACAAC TCGCCAAAGC AAACGATGCA
ACAGATGATG ATCAACGGCA ATGGATTAGC AATTTATTCT CATGTAGACA GATTGATCTT
GCACTATCTA AAATTGAATG CAACAACCCA ATCGATGCAG AAATACAGAG CAATACTCTT
CGAGCTCTCC TCAGCCAACC ACTCAATACA TTTAAGAAAC TATCAAAGCA TGAGCGGAAC
TTAATCAGAG AATACGATAT CTATTATCAC CTATCTCTTC CTAACTTTAA TCCAACTCTG
GAAGATTTAG AACAACGGCT CAAATCCACC AACAAGATCA TCCTACTTGT GGTGCATGTT
GGGAAATGCG CAGGCGAATC AATCATCACC GCACTAGAAG AAACCTTCAC CTCAAACGAA
GTGGAAGTCA TCGAATATCA CACTTTTGAT AGCAATATGC TGATCAGAGA AAGTCTTCCA
TTGCTCCACA AGCATTCCGA TCGAATCCAC ATTGTGACTT GCACTCGCAA TCCTGTGGAT
CGCTGGATAT CTGCTTTCAA CTGGGACTAC CACACATTTT TCTTATCAAG CCAATTCTAT
TGTCCAGACC ACATCATTCA ATTACATCGC CAATACTTCT CCGCCCTAGA ATTAACCAAT
GGCTTGATGC GGAAGGAAAT AGAGGCCCAT GAATTGGCCA CATTCAAACA CCTTGCCTAT
GGACACATGG CGAAAGGAAT CTCCTGGTAC CTACCAGAAG AAATTATCGA CAATCTTCCT
AAGCAAGCAA TATCGACAAT CAATGTTGAA ACAATCCAAA ATGACTTCAA TCAATGCATT
CATACAATCA CAACAACTTT CAAACAAATA GGTAAACGGG AACCAACACC CATCCCAAAA
ACCAAGCAAA ATTATCAACA CTGGTACAAA TCAGGGGCAT TTAGTGCCGC CAGACAATTC
AGCGATGCAC AAAGACAGTT CCTCGAACAA TTCCTAAATG AAGATTACAA AGTAAACAAT
AAATTGAATA AGATTTAG
 
Protein sequence
MSSTNQPLQA LEASIQLARQ NYKEAKIDSA LQHIKEGLQI DEHNAQLLEI AASTHLRFNN 
NNEAIKYAQE LITHHPRNPH GYSRYAQALL QQKQPFNANQ MALSGLQEAP HNQQLLALTS
ESYQALERWD QSLELANQLI EFHPSLHMGY LKAAQNLLKL NKPNEASSIA ERGLKAKPNH
PHLHSIASEA YRARHLHNHS LNHAKALIEN QPDSIEGHKR AAQDLLKLGL RNEAILIIEQ
LIQRNKSKKA AAMASKLFKV AGQTSRSLVL TTQLAKANDA TDDDQRQWIS NLFSCRQIDL
ALSKIECNNP IDAEIQSNTL RALLSQPLNT FKKLSKHERN LIREYDIYYH LSLPNFNPTL
EDLEQRLKST NKIILLVVHV GKCAGESIIT ALEETFTSNE VEVIEYHTFD SNMLIRESLP
LLHKHSDRIH IVTCTRNPVD RWISAFNWDY HTFFLSSQFY CPDHIIQLHR QYFSALELTN
GLMRKEIEAH ELATFKHLAY GHMAKGISWY LPEEIIDNLP KQAISTINVE TIQNDFNQCI
HTITTTFKQI GKREPTPIPK TKQNYQHWYK SGAFSAARQF SDAQRQFLEQ FLNEDYKVNN
KLNKI