Gene A9601_10151 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagA9601_10151 
Symbol 
ID4717726 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. AS9601 
KingdomBacteria 
Replicon accessionNC_008816 
Strand
Start bp876698 
End bp878452 
Gene Length1755 bp 
Protein Length584 aa 
Translation table11 
GC content32% 
IMG OID640078730 
Productcell division protein FtsH4 
Protein accessionYP_001009406 
Protein GI123968548 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0465] ATP-dependent Zn proteases 
TIGRFAM ID[TIGR01241] ATP-dependent metalloprotease FtsH 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.745337 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGTTTAGAT CAAAATTCTC ATATACAGAT TCTAAATCAA GTTATTCCGA TCTTTTAGAA 
GATATAGAGA CGGGGAAAAT AGAATCAATA TTTTTCTATC CAAGGCAGAG AGAAATTGAT
GTTCTGTATA TAAATGGCGA CAAATTTAAA ATACCTATCC TTTACAACGA TCAATTAATC
CTTGAAAAGG CTACCGAAAA TAAGGTAGAT CTAACTATTA ACAATAGTAG AAAAGAAGCC
TCAGCTGCTA ATTCATTTGC TTCAATAAGT CTTTTCCTGA TTTTCATATT AGCTATAGTC
TTAATCTTGA GGAGTACATC AAAATTGGCT TCCAGAGCTT TTGGTTTTAC CAAAAATCAA
GCTAAATTTT TAACTATTGA TGATGTAGAA ACGAGATTCG ATGATGTAGC TGGTGTCCCT
GAAGCCGCTG AGGAATTAAA AGAGGTAATA ACATTTTTGA AAGAACCAAA GAAATTTGAA
AATCTTGGAG CAAAAGTTCC TAAGGGAGTT CTTCTAATAG GCCCACCAGG AACTGGTAAA
ACATTATTAG CTAAAGCAAT TGCTGGTGAA TCAGGAGTGC CTTTTCTCTC AATATCGGCA
TCAGAGTTTG TAGAACTTTT TGTTGGTGTT GGAGCAAGCC GAGTTCGAGA TTTGTTCTCT
AAAGCTAAGG AAAAATCTCC TTGTATAATT TTCATTGATG AAATTGATTC CATTGGTAGG
CAAAGAGGGT CTGGGATCGG AGGAGGAAAC GATGAAAGAG AACAAACCCT TAATCAGCTT
CTAACTGAAT TAGATGGTTT TGCTGATAAT TCTGGGATTA TCGTTTTAGC AGCAACAAAT
AGACCAGATA TTTTGGATTC AGCATTATTA AGACCAGGTA GATTTGATAG GAAAATTGAA
GTAATGCTTC CAGATTTAGA TGGAAGAAAA AAAATTCTTT CAGTTCACTC ACTTTCCAAA
CCACTTTCAA ACGAAGTTGA CTTAGGATAC TGGGCTTCTA GAACAGTTGG ATTTTCGGGA
GCAGATCTTG CAAACTTGAT GAACGAGAGT GCTATTCACT GTGCAAGAGA CGAATCTAAA
TTAATCAGTG ATCTTCATAT AGAAAATGCG CTTGATAAAA TTACTATTGG ACTGAGAAGC
TCATTAATAA CTTCTCCTAA TATGAAAAAA ATTATTGCTT ATAATGAAGT CGGTAGAGCA
ATTGTATCTG CTGTGAGAAA TGGAATTGAA TCAGTTGATA AAATTACGAT TTTACCTAGA
TCTGGATCTA TAGGAGGATA TACAAAAATA TGCCCTGACG AAGATGTAAT TTCAAGCGGA
TTGATTTCAA AAAAATTGTT ATTTTCAAAA ATTGAAATTG CTCTAGCTGG AAGAGCAGCA
GAAACGATTG TTTTTGGTGA AGGTGAAATT ACACAATGTT CTGTAAATGA TATCTCTTAT
GCGACAAATA TCGTAAGGGA AATGGTTACA AAATATGGAT TTTCAATTAT TGGTCCAATT
TCAATGGATT CTGATAATAA TGAAATGTAT TTAGGAGATG GATTATTTAG AAGAAAGCCT
CTGATAGCAG AAAATACCAG TTCTAGAATA GATAATGAAA TCATAAATAT TTCTAAAATT
TCATTAAATA ATTCAATAAA AATATTGAAA AAAAATAGAG TCTTACTAGA TAAATTAGTT
GACATACTTT TAAATCAAGA AACTATAGAT AAAGAAGTTT TTAAATCAAC AACTTCTAAA
TTGTTGAAAG TTTGA
 
Protein sequence
MFRSKFSYTD SKSSYSDLLE DIETGKIESI FFYPRQREID VLYINGDKFK IPILYNDQLI 
LEKATENKVD LTINNSRKEA SAANSFASIS LFLIFILAIV LILRSTSKLA SRAFGFTKNQ
AKFLTIDDVE TRFDDVAGVP EAAEELKEVI TFLKEPKKFE NLGAKVPKGV LLIGPPGTGK
TLLAKAIAGE SGVPFLSISA SEFVELFVGV GASRVRDLFS KAKEKSPCII FIDEIDSIGR
QRGSGIGGGN DEREQTLNQL LTELDGFADN SGIIVLAATN RPDILDSALL RPGRFDRKIE
VMLPDLDGRK KILSVHSLSK PLSNEVDLGY WASRTVGFSG ADLANLMNES AIHCARDESK
LISDLHIENA LDKITIGLRS SLITSPNMKK IIAYNEVGRA IVSAVRNGIE SVDKITILPR
SGSIGGYTKI CPDEDVISSG LISKKLLFSK IEIALAGRAA ETIVFGEGEI TQCSVNDISY
ATNIVREMVT KYGFSIIGPI SMDSDNNEMY LGDGLFRRKP LIAENTSSRI DNEIINISKI
SLNNSIKILK KNRVLLDKLV DILLNQETID KEVFKSTTSK LLKV