Gene NATL1_21261 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNATL1_21261 
SymbolclpX 
ID4780921 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL1A 
KingdomBacteria 
Replicon accessionNC_008819 
Strand
Start bp1782853 
End bp1784208 
Gene Length1356 bp 
Protein Length451 aa 
Translation table11 
GC content38% 
IMG OID640085423 
ProductATP-dependent protease ATP-binding subunit ClpX 
Protein accessionYP_001015946 
Protein GI124026831 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1219] ATP-dependent protease Clp, ATPase subunit 
TIGRFAM ID[TIGR00382] endopeptidase Clp ATP-binding regulatory subunit (clpX) 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAAAAT TTGAGGCCCA TCTTAAATGC TCCTTTTGTG GCAAAGCCCA AGATCAAGTG 
AGGAAACTCA TTGCTGGTCC AGGAGTTTAC ATATGCGATG AATGTATTGA CTTATGCAAC
GAAATTTTAG ATGAAGAACT GATTGATAAT CCAACCCATC AAAGAAATGG TCATGAACAA
AGCCGCAAAG CTAAAGCTGC CACAACGACA GCTAAACCAG CTCCAACCTT GGCATCCATC
CCCAAGCCAA TTGAAATTAA AAAGTTTTTA GATGCTCAAG TAGTTGGGCA AGAACCAGCA
AAAAAAATCT TATCGGTAGC TGTCTACAAT CACTACAAGA GACTTGCTTG GAAAGGTGAC
GGGTCTGGCG AGACTGATTT AACGGCAACA AAATTACAAA AATCTAATAT TTTATTAATT
GGACCAACTG GATGCGGAAA GACATTACTT GCCCAAACAT TAGCTGAAAT GCTTGATGTT
CCATTTGCTG TTGCTGATGC AACAACTCTT ACTGAAGCTG GATATGTTGG AGAGGATGTA
GAAAACATTC TTTTACGTCT CCTACAGAAA GCAGATATGG ACGTGGATTT AGCACAAAGA
GGAATTATTT ACATAGATGA AATTGACAAA ATTGCTAGAA AAAGTGAAAA CCCATCCATT
ACTAGAGATG TCTCTGGAGA AGGGGTGCAA CAAGCTCTTC TAAAAATGCT CGAGGGTACT
GTTGCCAATG TTCCTCCTCA GGGAGGAAGA AAGCATCCTT ACGGTGACTC TATTCAAATT
GATACGAGCC AAATACTCTT TATATGTGGT GGAGCATTTG TTGGCTTAGA TGACGTAGTG
GAAAAAAGAC TTGGAAAAAA TTCAATTGGT TTCATACAAA ATGAAAACAG GACAAGAACA
AAATCTAATA GAGATCGTGT TGGTGCAGAT CTAATTAATG ACCTTGAGCC CGACGACCTA
GTGAAATATG GTCTGATTCC CGAATTTATT GGAAGAATGC CTGTCAGTGC AATATTAGAA
CCACTAAATG CTAAAGCACT TGAGTCCATT CTCACGGAAC CAAGAGATGC TTTAGTAAAA
CAATTTAGAA CTTTATTAAG TATGGATAAT GTAGAACTTT CATTTGACGA GGATGCTGTT
GAGGCAATTG CGCAAGAAGC TTATAAAAGA AAAACCGGAG CTAGAGCGTT GCGAGGAATT
GTTGAAGAAA TCATGCTAGA TCTAATGTAT AGTCTTCCAT CTCAAACTAA AATTAAAAAT
TTCAACGTAA CAAAGAAAAT GGTTGATGAA AGTACTGGTG GTAAAGTTGT TCCTCTCTTA
TCAAATGAAA AAAGAATCGT TAAGGAATCT GCCTAG
 
Protein sequence
MAKFEAHLKC SFCGKAQDQV RKLIAGPGVY ICDECIDLCN EILDEELIDN PTHQRNGHEQ 
SRKAKAATTT AKPAPTLASI PKPIEIKKFL DAQVVGQEPA KKILSVAVYN HYKRLAWKGD
GSGETDLTAT KLQKSNILLI GPTGCGKTLL AQTLAEMLDV PFAVADATTL TEAGYVGEDV
ENILLRLLQK ADMDVDLAQR GIIYIDEIDK IARKSENPSI TRDVSGEGVQ QALLKMLEGT
VANVPPQGGR KHPYGDSIQI DTSQILFICG GAFVGLDDVV EKRLGKNSIG FIQNENRTRT
KSNRDRVGAD LINDLEPDDL VKYGLIPEFI GRMPVSAILE PLNAKALESI LTEPRDALVK
QFRTLLSMDN VELSFDEDAV EAIAQEAYKR KTGARALRGI VEEIMLDLMY SLPSQTKIKN
FNVTKKMVDE STGGKVVPLL SNEKRIVKES A