Gene Syncc9902_1061 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSyncc9902_1061 
Symbol 
ID3742484 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSynechococcus sp. CC9902 
KingdomBacteria 
Replicon accessionNC_007513 
Strand
Start bp1022496 
End bp1024424 
Gene Length1929 bp 
Protein Length642 aa 
Translation table11 
GC content53% 
IMG OID637771237 
Productpeptidase M41, FtsH 
Protein accessionYP_377069 
Protein GI78184634 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0465] ATP-dependent Zn proteases 
TIGRFAM ID[TIGR01241] ATP-dependent metalloprotease FtsH 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.498037 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATCAGC GCTGGCGCCT CCTTGCCCTA TGGCTTTTGC CGATTGGCGT TGTGCTGCTG 
ATCGGCTGGC AGGTCCTTAG CAACGGCGGG ATGAATGGCA TGAGCCAAAA CAACGGTGGT
GGCAACAGCC CCACGGTTGC TCCGCGTAAC GCCGCTGTCG CGAGGATGAG CTACGGCCGT
TTCCTGGATT ACGTGGAAGC AGGTCGCATC ACGGCTGTTG ATATCTACGA CGGTGGTCGT
AATGCCGTAG TTGAAGCTGT CGATCCAGAT TTAGATAACC GCGTCCAGCG TTTACGCGTC
GATCTCCCAG GTTTAGCCCC TGAGTTGATC AACACCTTGA AGGAAGAAGG CATCAGCTTT
GATGTGCACC CCCCCAAGAG CACTCCGCCA GCGCTGGGTC TTTTGGGCAA TTTGTTGTTC
CCATTGCTGC TGATCGGATC CCTGATCTTT TTGGCTCGCC GCAATAGCAA CATGCCCGGC
GGCCCTGGTC AGGCCATGCA ATTTGGCAAG AGCAAGGCCA AGTTCATGAT GGAAGCCGAG
ACCGGCGTGA TGTTCGACGA CGTTGCTGGC GTCACAGAAG CCAAGCAAGA ACTCGAAGAG
GTTGTCACGT TCCTGAAGCA GCCTGAACGC TTCACATCTG TCGGAGCTCA AATCCCTCGG
GGACTCCTAC TTGTCGGACC TCCAGGTACT GGCAAAACGT TGTTAGCCAA AGCCATTGCC
GGAGAAGCTG GAGTTCCGTT CTTTGCCCTG TCGGGTTCTG AGTTTGTTGA AATGTTTGTT
GGCGTTGGTG CCAGCCGTGT GCGCGATCTG TTCAAAAAGG CCAAAGAAAA CAGTCCTTGC
TTGATTTTTA TTGATGAGAT TGATGCCGTT GGTCGTCAAC GCGGCGCTGG TGTCGGCGGT
GGCAACGACG AGAGAGAACA AACTCTCAAC CAACTCCTCA CAGAAATGGA TGGTTTCGAA
GGAAATAATG GAATCATCAT TATCGCGGCA ACAAACCGTC CTGATGTTCT TGATTCGGCG
TTATTGCGTC CAGGTCGATT TGATCGTCAA GTCACTGTTG ATGCACCAGA TATCAAAGGT
CGCCTTGCCA TTCTTGCTGT CCACTCTAAA AATAAGAAAC TCGATGGTGA ACTATCACTT
GAAAGTATTG CTCGACGTAC TCCTGGTTTC ACGGGTGCAG ATCTCGCCAA CTTGATGAAC
GAGGCGGCAA TCTTGACGGC ACGCCGCAGA AAGGAATCAA TTGGTCTGAG TGAAATTGAT
GATGCTGTTG ACCGCATCAT CGCTGGCATG GAAGGTCGTC CTCTCACCGA TGGACGCAGC
AAGCGGCTAA TTGCTTATCA CGAAGTGGGT CATGCACTTA TCGGAACTCT GGTCAAAGCC
CATGACCCTG TCCAAAAAGT CACCCTTGTG CCCCGCGGCC AAGCCCAAGG CCTGACCTGG
TTCTCTCCGG ACGAAGAGCA AACCCTAGTG ACCCGCGCCC AGCTCAAGGC ACGCATCATG
GGTGCCCTGG GTGGTCGTGC TGCCGAAGAC GTGGTGTTTG GTTCCCAGGA AATCACCACC
GGTGCCGGTA GCGACATCCA ACAGGTCGCC TCGATGGCCC GCAATATGGT GACGCGGCTT
GGCATGAGTG ATCTTGGACC TGTTGCCCTT GAGGGCGGTG GTCAGGAAGT CTTCCTTGGT
AGAGATTTGA TGTCTCGCAG TGAAATTTCC GAATCGATTT CTCAACAGGT GGATACACAA
GTTCGCAGCA TGGTTAAGCG CTGTTATGAA GAAACTGTTG CCCTCGTTGC AGCCAACCGA
GAAGCAATGG ACCAATTGGT TGAAATATTG ATTGAGAAAG AGACAATGGA TGGCGACGAA
TTCAAATCAA TCGTTGCTGA ATTCACTTCA GTCCCTGAAA AGGACCGCAC AGTGCCAATT
CTGAACTGA
 
Protein sequence
MNQRWRLLAL WLLPIGVVLL IGWQVLSNGG MNGMSQNNGG GNSPTVAPRN AAVARMSYGR 
FLDYVEAGRI TAVDIYDGGR NAVVEAVDPD LDNRVQRLRV DLPGLAPELI NTLKEEGISF
DVHPPKSTPP ALGLLGNLLF PLLLIGSLIF LARRNSNMPG GPGQAMQFGK SKAKFMMEAE
TGVMFDDVAG VTEAKQELEE VVTFLKQPER FTSVGAQIPR GLLLVGPPGT GKTLLAKAIA
GEAGVPFFAL SGSEFVEMFV GVGASRVRDL FKKAKENSPC LIFIDEIDAV GRQRGAGVGG
GNDEREQTLN QLLTEMDGFE GNNGIIIIAA TNRPDVLDSA LLRPGRFDRQ VTVDAPDIKG
RLAILAVHSK NKKLDGELSL ESIARRTPGF TGADLANLMN EAAILTARRR KESIGLSEID
DAVDRIIAGM EGRPLTDGRS KRLIAYHEVG HALIGTLVKA HDPVQKVTLV PRGQAQGLTW
FSPDEEQTLV TRAQLKARIM GALGGRAAED VVFGSQEITT GAGSDIQQVA SMARNMVTRL
GMSDLGPVAL EGGGQEVFLG RDLMSRSEIS ESISQQVDTQ VRSMVKRCYE ETVALVAANR
EAMDQLVEIL IEKETMDGDE FKSIVAEFTS VPEKDRTVPI LN