Gene Syncc9902_1678 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSyncc9902_1678 
Symbol 
ID3743917 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSynechococcus sp. CC9902 
KingdomBacteria 
Replicon accessionNC_007513 
Strand
Start bp1624646 
End bp1626001 
Gene Length1356 bp 
Protein Length451 aa 
Translation table11 
GC content54% 
IMG OID637771870 
ProductRNA polymerase sigma factor RpoD 
Protein accessionYP_377680 
Protein GI78185245 
COG category[K] Transcription 
COG ID[COG0568] DNA-directed RNA polymerase, sigma subunit (sigma70/sigma32) 
TIGRFAM ID[TIGR02393] RNA polymerase sigma factor RpoD, C-terminal domain
[TIGR02937] RNA polymerase sigma factor, sigma-70 family
[TIGR02997] RNA polymerase sigma factor, cyanobacterial RpoD-like family 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACTCCTG CTGCCACCAA ATCCGCGAAA CCGGACATCG TTCTATTGGC GAACGCCGAA 
GGACAGGTGA AGAATGTGGT GCCGACTGCC AAGGGCGCAA CAAAAAAAGC TCCGGCACGG
AAACGGAAGG CAAGCTCAAG CGCTGAGGAT TTGAATGCGG CAGCAGACGA ACTTCTTGCC
AAGGCAGACG GCAGCACCAA GACATCAGAA GGGAAAACAA AAAAAGCAGC AACAAAGGCG
ACGTCAGTGA AAAGCGCGGC CAAGAAAACA ACAGCCAAGA AAGCAGCTAC ACCAAAAGCC
AAAACTGCGG CACCAGAACT CACAGCTGAG GAAAAAGCCA AAGCGGCCAG TGCAGAAAAA
GAAGCCAAAG CAAAGGCTTT GGCCAGCATC AAAATCGGGC CGAAGGGGGT CTACACCGAA
GATTCGATTC GTGTGTACCT GCAGGAAATC GGACGCATTC GCCTCTTGCG ACCTGACGAA
GAGATCGAAC TGGCCCGCAA AATTGCCGAC CTTCTTTATC TAGAAGAGCT GGCCGCACAA
TTTGAAAGCG ATAACGGTCG TGAGCCCGAC AAAAAGGAGT GGGCAGCACT CGTTGAGATG
CCGCTCATTC GCTTCCGCCG GCGGTTGATG CTGGGCCGAC GAGCCAAGGA AAAAATGGTG
CAGTCCAACC TGCGCCTTGT GGTGTCGATT GCCAAGAAAT ACATGAATCG AGGCCTGAGT
TTCCAGGACC TGATTCAAGA GGGCAGCCTT GGCCTGATCC GCGCCGCTGA AAAATTCGAT
CACGAAAAGG GCTACAAGTT CTCCACCTAC GCCACCTGGT GGATTCGTCA GGCGATCACA
CGGGCGATCG CCGACCAAAG CCGGACGATT CGCCTACCTG TTCACCTTTA CGAAACGATC
TCGAGGATCA AGAAGACCAC CAAGGTTCTG TCCCAAGAAT TCGGTCGGAA GCCAACAGAA
GAGGAAATTG CGGAATCGAT GGAAATGACC ATCGAGAAGC TGCGGTTCAT CGCCAAGAGC
GCTCAATTGC CGATCTCCCT TGAAACCCCG ATCGGGAAAG AAGAAGATTC CCGCTTGGGC
GACTTCATCG AAGCCGACAT CGAAAACCCC GAACAGGACG TTGCCAAAAA TCTTCTGCGT
GAGGATCTTG AAGGCGTGCT GGCTACCCTC AGCCCCCGCG AGCGCGATGT ACTGCGCTTG
CGCTACGGCC TGGATGACGG TCGCATGAAA ACCCTCGAAG AGATTGGCCA GATTTTTGAC
GTAACCCGAG AGCGGATTCG TCAAATTGAA GCGAAGGCCC TTCGCAAATT GCGGCATCCC
AACCGCAATG GTGTTCTCAA GGAATACATC AAATAA
 
Protein sequence
MTPAATKSAK PDIVLLANAE GQVKNVVPTA KGATKKAPAR KRKASSSAED LNAAADELLA 
KADGSTKTSE GKTKKAATKA TSVKSAAKKT TAKKAATPKA KTAAPELTAE EKAKAASAEK
EAKAKALASI KIGPKGVYTE DSIRVYLQEI GRIRLLRPDE EIELARKIAD LLYLEELAAQ
FESDNGREPD KKEWAALVEM PLIRFRRRLM LGRRAKEKMV QSNLRLVVSI AKKYMNRGLS
FQDLIQEGSL GLIRAAEKFD HEKGYKFSTY ATWWIRQAIT RAIADQSRTI RLPVHLYETI
SRIKKTTKVL SQEFGRKPTE EEIAESMEMT IEKLRFIAKS AQLPISLETP IGKEEDSRLG
DFIEADIENP EQDVAKNLLR EDLEGVLATL SPRERDVLRL RYGLDDGRMK TLEEIGQIFD
VTRERIRQIE AKALRKLRHP NRNGVLKEYI K