Gene P9211_12591 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9211_12591 
Symbol 
ID5731220 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9211 
KingdomBacteria 
Replicon accessionNC_009976 
Strand
Start bp1132966 
End bp1134513 
Gene Length1548 bp 
Protein Length515 aa 
Translation table11 
GC content47% 
IMG OID641285628 
Producthypothetical protein 
Protein accessionYP_001551144 
Protein GI159903800 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.375416 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00229564 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAAATTAT TCCAGCAATT GCTGGTTTTC CCTGCTGCTT TGGGATTGGT TGCGCCTTTG 
GCTGCAAATG CAGCTGAAGT CAATATGACT GATGTTTCTA AGTATGCGGC GAAAACAGCT
AAAAGCATCA AAGCTCCTTC TAGTGCTCAA TTCTCAGACA TCGTTCCTGG GGACTGGGCC
TATACATCTC TTAAAAACCT AAGCGCTAGC TACGGTTGTG TAGATAATGC CTACACTCAA
AACCTTAATT CAGGTCTTGC TTTAACTCGT TATGAAGCTG CTGCATTAGT AAATGCATGC
CTTGATAACG GTCTGGTTGC AAGTGGTGAA GGTCTTTCTT CTGATGCTTC TCTTCTTGCC
GATGAGTTTG GCGTTGAGAT GGCAATTCTC AAAGGCCGTG TTGATGGACT TGAGTACAAG
CTTAATGAGC TTAGTGCTGG TCAGTTCTCA TCAACTACTA AGTTGGACGG AACGGTAGCT
TTCGTTGTAG GTGCTGTTGA CTATGAAAAC AGTGCTGATA CAGCAGTTGA TCATGGCGAC
AAGCTTACTG GTACTTACAG CTACAAGCTT GACTTGAACA CCAGCTTCAA TGGTAATGAC
CGCTTGCATG CAAGCATCAT GACCGGAAAC ATGGATGGAA ATAATCCATG GGGCGATAAA
GATGGTGGTA CTTACCTAGC TGTCGCTAAC GACAACGAGC AAGTTCTTGA GATAGACAAG
CTCTGGTATG AGTGGACCAA GGATGACCTT AAATTCTGGG CCGGTCCAAA GATCGAGAGC
AACCAGATGT TGGCATCTTC TCCATCTATC TATAAGCCTG TTCAGAAGCA ATTTGCCTTC
GGCGGTAATA CTGCTGCTTA TGCTTCAAGC ACAACAACAG GTTTTGGTGT CGCTTGGACA
CAGCCAACTG AGGCTGATAG AAAGTGGACA GTTAGTGCTA ACTACGCCTC TATTGGTGGT
GACGATGCTA CCAAGGGTAT CCTTACCGAT GAGCAAACTA AGTTCCTGAC TCAGGTTACT
TACGGTGGTC AGAGATGGCA GATCGCTGCT GCTGTTGCTC GCCATGGTTG CGCAGGCCAG
GATGCAAACA GCTCTTGTCA CGCATGGTCT GACCTCTATG CAACTGCTGC AGGCGACAAT
GCAACTGGAG AAGGCGAGAT GGCCTATTCA TTGCGCTATT ACTGGAAGCC AGTAGAAACA
GGTGCAATGC CTTCTATTCA GCTCGGTATG GATTACCGTG AGCTAGATGA TGCAGCTGAC
ACAGAAGTTC AGAGTACTGC TGCTTGGATG GCTGGTCTTA CTTGGGATGA TGCTTGGATC
GATGGCAACA GAGCTGGAAT CGCTTTTGGT TCTCGTGAGC ATGCTACCGA TTATGCAGGT
TCCGGTGACG ACGAAGCTGA TGACAACCTA GTTTGGGAAG CCTATTACGA TTACCAGTTG
ACTGATGGAA TCACCATCAC TCCAGCTCTA TTCGGTGGCT CTCATGTCTA TGACGGTTCT
GACGATGACA TCTTTGGTGC TCTAGTTCAG ACTGTATTTA AGTTCTGA
 
Protein sequence
MKLFQQLLVF PAALGLVAPL AANAAEVNMT DVSKYAAKTA KSIKAPSSAQ FSDIVPGDWA 
YTSLKNLSAS YGCVDNAYTQ NLNSGLALTR YEAAALVNAC LDNGLVASGE GLSSDASLLA
DEFGVEMAIL KGRVDGLEYK LNELSAGQFS STTKLDGTVA FVVGAVDYEN SADTAVDHGD
KLTGTYSYKL DLNTSFNGND RLHASIMTGN MDGNNPWGDK DGGTYLAVAN DNEQVLEIDK
LWYEWTKDDL KFWAGPKIES NQMLASSPSI YKPVQKQFAF GGNTAAYASS TTTGFGVAWT
QPTEADRKWT VSANYASIGG DDATKGILTD EQTKFLTQVT YGGQRWQIAA AVARHGCAGQ
DANSSCHAWS DLYATAAGDN ATGEGEMAYS LRYYWKPVET GAMPSIQLGM DYRELDDAAD
TEVQSTAAWM AGLTWDDAWI DGNRAGIAFG SREHATDYAG SGDDEADDNL VWEAYYDYQL
TDGITITPAL FGGSHVYDGS DDDIFGALVQ TVFKF