Gene A9601_02471 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagA9601_02471 
Symbol 
ID4716931 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. AS9601 
KingdomBacteria 
Replicon accessionNC_008816 
Strand
Start bp229796 
End bp231649 
Gene Length1854 bp 
Protein Length617 aa 
Translation table11 
GC content38% 
IMG OID640077946 
Productcell division protein FtsH2 
Protein accessionYP_001008642 
Protein GI123967784 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0465] ATP-dependent Zn proteases 
TIGRFAM ID[TIGR01241] ATP-dependent metalloprotease FtsH 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAACAAAC GTTGGAGAAA CGTAGGACTT TACGTTCTAG CTGTTATTAC TGTAATTTTC 
ATTGGTACCT CAGTTTTTGA TAAACCTAAT ACTGAAAGTT CTACAAAGAC CTTGAGATAT
AGTGATTTTA TAGAGGCAGT TCAAGATAAA GAAATCAGTA GAGTCCTAAT ATCTCCAGAT
AATGCCACAG CTCAAGTTGT TGAAAATGAT GGGAGCAGGT CTGAGGTCAA TTTAGCCCCT
GACAAAGATT TATTAAAAAT ACTGACTGAG AATAATGTAG ATATAGCTGT AACTCCTACA
AAATTAGCCA ATCCATGGCA ACAAGCTATA AGTAGCTTAA TTTTTCCAGT ACTTTTGATC
GGAGGCCTAT TTTTTCTTTT CAGAAGATCC CAAAGCGGTA ATGCTGGAGG TGGTAACCCT
GCCATGAGTT TTGGTAAAAG CAAAGCTAGA TTGCAAATGG AACCATCTAC ACAAGTAACC
TTTTCAGATG TTGCAGGTGT TGAAGGGGCA AAATTAGAAC TTACAGAAGT TGTAGATTTT
CTTAAGAGCC CAGATAGATT TACTGCAGTA GGAGCAAAAA TTCCAAAAGG AGTTCTTCTT
GTTGGCCCTC CTGGGACAGG AAAAACATTA TTAGCAAAAG CAGTAGCTGG AGAAGCAGGT
GTACCTTTTT TCTCAATATC TGGTTCAGAA TTTGTAGAGA TGTTTGTAGG AGTTGGAGCT
AGCAGAGTTA GAGATCTTTT TGAACAAGCT AAAAAGAATG CTCCTTGTAT TGTTTTTATT
GACGAAATAG ATGCAGTTGG AAGACAAAGG GGTGCTGGTA TGGGCGGAGG AAATGATGAA
AGAGAGCAAA CATTAAATCA ACTCCTAACT GAAATGGATG GTTTCGAAGG TAATTCAGGA
ATAATAATAG TTGCTGCCAC CAACAGACCA GATGTCTTAG ATTCAGCTTT AATGCGTCCT
GGAAGATTCG ATAGACAGGT AACAGTAGAT AGACCAGATT ATGCTGGAAG ATTGCAGATA
TTAAATGTTC ATGCGAAAGA TAAAACTCTT TCAAAAGACG TAGATTTAGA TAAAGTTGCT
AGAAGAACAC CAGGATTTAC TGGTGCAGAT TTAGCTAACC TCTTAAATGA AGCAGCAATA
TTAGCAGCTA GAAAAGATTT AGATAAAGTA AGTAACGATG AAGTCGGTGA TGCCATTGAA
AGAGTTATGG CTGGCCCAGA AAAGAAAGAT AGAGTCATCA GTGATAAGAA AAAAGAATTA
GTTGCTTATC ACGAAGCTGG TCATGCACTC GTTGGAGCAT TAATGCCTGA TTATGATCCA
GTAGCAAAAG TTTCAATTAT TCCAAGAGGT CAAGCTGGAG GTCTAACCTT CTTTACTCCA
AGTGAAGAAA GAATGGAATC TGGTCTTTAC TCACGTTCTT ACCTTCAAAA TCAAATGGCT
GTAGCTCTTG GTGGAAGAGT TGCTGAAGAA ATTGTTTATG GAGAAGAAGA AGTAACAACT
GGAGCTTCAA ATGATTTACA ACAAGTTGCT AATGTAGCAA GACAAATGAT CACTAAATTC
GGCATGAGTG ACAAAATAGG TCCTGTCGCT CTAGGTCAAT CTCAAGGTGG AATGTTTCTA
GGAAGAGATA TGAGCTCTAC AAGAGATTTC TCTGAAGACA CGGCCGCAAC AATTGATGTA
GAGGTTTCAG AACTTGTTGA TGTTGCCTAT AAGAGAGCTA CAAAAGTTTT ATCAGATAAC
AGAACAGTTC TAGACGAAAT GGCTCAAATG CTAATTGAAA GAGAAACTAT AGATACTGAA
GATATCCAAG ATTTGCTTAA CCGCTCAGAA GTAAAAGTCG CAAACTATAT TTAA
 
Protein sequence
MNKRWRNVGL YVLAVITVIF IGTSVFDKPN TESSTKTLRY SDFIEAVQDK EISRVLISPD 
NATAQVVEND GSRSEVNLAP DKDLLKILTE NNVDIAVTPT KLANPWQQAI SSLIFPVLLI
GGLFFLFRRS QSGNAGGGNP AMSFGKSKAR LQMEPSTQVT FSDVAGVEGA KLELTEVVDF
LKSPDRFTAV GAKIPKGVLL VGPPGTGKTL LAKAVAGEAG VPFFSISGSE FVEMFVGVGA
SRVRDLFEQA KKNAPCIVFI DEIDAVGRQR GAGMGGGNDE REQTLNQLLT EMDGFEGNSG
IIIVAATNRP DVLDSALMRP GRFDRQVTVD RPDYAGRLQI LNVHAKDKTL SKDVDLDKVA
RRTPGFTGAD LANLLNEAAI LAARKDLDKV SNDEVGDAIE RVMAGPEKKD RVISDKKKEL
VAYHEAGHAL VGALMPDYDP VAKVSIIPRG QAGGLTFFTP SEERMESGLY SRSYLQNQMA
VALGGRVAEE IVYGEEEVTT GASNDLQQVA NVARQMITKF GMSDKIGPVA LGQSQGGMFL
GRDMSSTRDF SEDTAATIDV EVSELVDVAY KRATKVLSDN RTVLDEMAQM LIERETIDTE
DIQDLLNRSE VKVANYI