Gene P9303_21931 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9303_21931 
Symbol 
ID4777826 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9303 
KingdomBacteria 
Replicon accessionNC_008820 
Strand
Start bp1950855 
End bp1952018 
Gene Length1164 bp 
Protein Length387 aa 
Translation table11 
GC content39% 
IMG OID640087708 
Producthypothetical protein 
Protein accessionYP_001018193 
Protein GI124023886 
COG category[R] General function prediction only 
COG ID[COG0457] FOG: TPR repeat 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATTAAAC CACTTAGTCT AATAATTACA GCGTCTGCTG TTGTTATTAG TCCATTGTCT 
GCTGTAGCTG AGATTGATCC TGAACTTCAT AAGCTGTGTA TAGATGCCAA AGATTACAAA
GGATGTATCG AGGCAAGAAC AGAACCATCG CCTGAGATTG AGTCAAATGA AAATGAAGTT
GAAGTATCGG CACCTTCAAC CTACAACTAT GAAAAAGATT CAGTAAGGCA GCTAAAGATT
AGAGGTAAGT ATGGAAGGTA CTTAACCTTT ATAGGTCGAA CACCAAACAC CTATAGCGGG
ACTAGCGGTT CATATAGTCC AGGTAGTGGT GGGACCTTAA ATTGTTCTAC TTACGGTTCC
TCTACTTATG CTACAACTAA TTGCTACCGC ACTGGTTATG TAGCACCTTC TTATACACCA
GCCAGACCAG GTGGTACACA ACATCAAAGG TTCAGGTATG AACTTGATTG CCAAGATCAA
ACATATAATA TTAAAGGTGA CTTAAAGTCA GCCGGAGGGT TTAAGAAGGG TTGGATGCAT
GTGAGTAATG ATCCTGTCGC TAGTGCTGTA GCTAGAAAAT ACTGTCCTGT TATTGATACG
TTGGCTGTTG CTGGATATGT AAACAAGGGG GATGTTTTTG AGTCAGGATC CATACTATGG
AAGGATCGTT GGGGTCCAGA ACCAAAAGCA TCTATTAGCG AAGAAAAATA TTATTTATTC
AAAACGTATG TAAAAAATAA ACAATACAAG GAGGCGTTAA AACTATCAAA TAAATTGGTC
ATTGATTTTC CTGATGATCC ACGCTCATGG ACTCATCTAG GCGTTGCATA TTTTATTTTA
AAGGATTATT CTGCTGCAAA GGAACAATTA AACAAGGCAA TATTTATCAA CCCGCTGTTT
GAAGATGCCT ACTATAATCG AGGCCTAGTT TATTCAGCCT TAGGTTTATA CGATCAAGCA
ATCCGTGATT ACACTAAAGC TATTCGTATG TACCCAGATA GAATGCACTT TTGGGTGAAT
AGGTCTACTG CTTATTGGAG AAAAGGAGAC AAGCAGAAAT CTTGTAGTGA TTCTCGTAAG
TTAATTCAAT TAGGACTCCA GAATCCAGAG TGGCAAAAAT GGTGGCAAAA GTTTGGCAAA
AAAGAATGCA AGAAATACAA GTAA
 
Protein sequence
MIKPLSLIIT ASAVVISPLS AVAEIDPELH KLCIDAKDYK GCIEARTEPS PEIESNENEV 
EVSAPSTYNY EKDSVRQLKI RGKYGRYLTF IGRTPNTYSG TSGSYSPGSG GTLNCSTYGS
STYATTNCYR TGYVAPSYTP ARPGGTQHQR FRYELDCQDQ TYNIKGDLKS AGGFKKGWMH
VSNDPVASAV ARKYCPVIDT LAVAGYVNKG DVFESGSILW KDRWGPEPKA SISEEKYYLF
KTYVKNKQYK EALKLSNKLV IDFPDDPRSW THLGVAYFIL KDYSAAKEQL NKAIFINPLF
EDAYYNRGLV YSALGLYDQA IRDYTKAIRM YPDRMHFWVN RSTAYWRKGD KQKSCSDSRK
LIQLGLQNPE WQKWWQKFGK KECKKYK