Gene P9515_16221 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9515_16221 
Symbol 
ID4720496 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9515 
KingdomBacteria 
Replicon accessionNC_008817 
Strand
Start bp1417410 
End bp1418387 
Gene Length978 bp 
Protein Length325 aa 
Translation table11 
GC content31% 
IMG OID640081314 
Producthypothetical protein 
Protein accessionYP_001011936 
Protein GI123966855 
COG category[R] General function prediction only 
COG ID[COG1253] Hemolysins and related proteins containing CBS domains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.698801 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAACCAA GTGTTTACAT TCTAATTTTA CTAATAATAA TAATATTAGT AGGTTCCGCA 
TGTTGTTCAG GAGTAGAAGC AGCTTTTTTA GCTGTCAACT CCATAAGGAT TTTAGAATTA
GCATCAAAAC AAAAACCTAA AAGTTCTGCT AATCAACTTC TTAAACTTAG AAAACATCTT
GGAAGAACTT TAACTGTAAT TACCATAACT AATAATGGTT TTAATATAAT AGGTAGTCTC
CTTTTAGGTG TTTATGGAGC CTTAATAATT AAAAGTAGTT TTGGTTTAAC ATTATTTTCG
ATCGCTTTTT ATGTATTAGT AGTATTACTT GGCGAAGTAC TTCCTAAAGC AATAGGTACA
AGATTTTCAT TGCAAATAGC TATATTATCT GTTCCAATTT TGAGAATGTT AAATAGTTTA
ATGAGGCCTT TCTTAATATT AATAGAGCAT TTATTTCCTG TGATCACTGC AGAGAACGAA
ATATCAACGG ATGAAGAAGA AATTAGGCAA ATGGCAAAAA TTGGTTCACA GAAGGGGTTC
ATAGAGGCTG ATGAAGCCGC AATGATACTT AAAGTTTTTC AATTAAATGA TTTGAAAGCT
AAAGATTTAA TGATCCCAAG AGTATCTGCA CCTAGTCTTG ATGGTTCCTC AAATCTTGAT
GAAATTTCAA AACTAATAAT TTCAAATAAT TCTCCATGGT GGATTGTCTT GGGTGATAAA
GTTGATAAAA TACAAGGAAT AGCCAAACGT GAAAACTTAC TAACTAGTCT TATCAAAGGA
GAAAATAAAA AATTACTATC AGATTTATGT GATCCTGTGG ACTACATACC AGAAATGATA
AAAGTAGACA AGTTATTAAC AAAATTTGAC AAGGATAACA AAGGAGTGAA AGTAGTAGTA
GATGAGTTTG GAGGATTCGT GGGAATAATT GGATCAGAAG CTGTATTATC TGTATTGGCA
GGATGGTGGC AAGAATAA
 
Protein sequence
MEPSVYILIL LIIIILVGSA CCSGVEAAFL AVNSIRILEL ASKQKPKSSA NQLLKLRKHL 
GRTLTVITIT NNGFNIIGSL LLGVYGALII KSSFGLTLFS IAFYVLVVLL GEVLPKAIGT
RFSLQIAILS VPILRMLNSL MRPFLILIEH LFPVITAENE ISTDEEEIRQ MAKIGSQKGF
IEADEAAMIL KVFQLNDLKA KDLMIPRVSA PSLDGSSNLD EISKLIISNN SPWWIVLGDK
VDKIQGIAKR ENLLTSLIKG ENKKLLSDLC DPVDYIPEMI KVDKLLTKFD KDNKGVKVVV
DEFGGFVGII GSEAVLSVLA GWWQE