Gene A9601_03141 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagA9601_03141 
Symbol 
ID4717001 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. AS9601 
KingdomBacteria 
Replicon accessionNC_008816 
Strand
Start bp291140 
End bp292525 
Gene Length1386 bp 
Protein Length461 aa 
Translation table11 
GC content33% 
IMG OID640078016 
Producthypothetical protein 
Protein accessionYP_001008709 
Protein GI123967851 
COG category[S] Function unknown 
COG ID[COG0391] Uncharacterized conserved protein 
TIGRFAM ID[TIGR01826] conserved hypothetical protein, cofD-related 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.782504 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTATAAGC GTAAAAAAAG GGTAAGAAGG TACTATAAAA ACTTTAAAAA GTCTAATTTT 
GCCTTGTTTA ACAAAATCTT AAGAATTTTG AGTTGGCTTT TGCCAGGATT GGTAATAAAA
AGATGGATGC TTACATCTGC GGTAGGATTT TTGACTACAT TATTAGGCTT GGTAATTTGG
ACAAATTTAA GACCGCTCTA TTGGCTTATT GAAATCTTTT TTTCAGTAAT GACAGGTTTA
ACTAGTATTT TGCCTGTTTC ATTTATGGGA CCATTGATTT TTGTTATTGG AATATTATTA
ATAGGGATTG GACAAAATAG AAGTATTAAT TCTATTCAAA AAGCGCTTGT TCCAGAAAAA
GATACATTTT TAGTTGATGC ATTAAGAGTT AAAAGTAAAT TAAACAGAGG CCCAAATATT
GTTGCAATTG GGGGAGGTAC AGGCTTATCT ACTTTATTGA AAGGCTTAAA AAACTATAGT
AGTAATATTA CAGCAATCGT AACTGTATCC GATGATGGTG GAAGTAGTGG AATTCTCAGA
AAACAATTAG GTGTGCAACC TCCTGGAGAT ATTAGAAATT GTTTGGCAGC CTTATCTAAC
GAAGAACCAA CTTTAACTAG ATTATTTCAG TACAGATTTT CAGAGGGAAC TGGTCTGGAG
GGCCATAGTT TTGGAAATCT ATTCTTGTCA GCTTTAACAA CAATTACAGG CAATTTAGAA
AAAGCAGTTC AAGCCTCTAG TAAGGTTTTG GCGGTACAAG GTCAAGTTTT ACCGGCGACA
AATATTGATG TTATGTTATG GGCCGAATTA GAAGATGGTG AAAAAATTTT TGGTGAAAGC
AAGATCAGTA AATCTAAAAA ATTAATTTCG AGGATTGGTT ACCTACCTGA AAACCCTTCA
GCTCTTCCAA GTGCTCTTGA ATCTATAAAA GAAGCTGATT TAATTATTCT TGGCCCAGGT
AGTCTTTACA CTTCTTTATT GCCTAATCTG TTAGTACCAG AGATAGTAGA TGCTTTATTG
CAAAGTAATG CTCCCAAAAT CTATATAAGT AATTTGATGA CTCAGCCTGG AGAAACAGAT
GGACTTGATG TCTATCAACA TATCAAAGCA ATAGAAAAAC AATTATTAAA TTTTGGAGTT
AATACTCGAA TTTTTGACTC AATATTATCT CAGACTCAAT TTGAAAAGTC TCCATTAGTA
GATTATTACG AAAGTAGAGG GGCAGAGCCT GTCCAATGTA ATAAAGAAAA ACTATTATCT
GAGGGTTATT ATGTTTTGCA AGCACCACTA TATGCAAAAA GAATAACTCC AACACTAAGA
CATGATCCAA GGAGACTAGC AAGAGCAGTT ATGTTTATTT ACCGCAAATT AAAAAAAATA
AACTAA
 
Protein sequence
MYKRKKRVRR YYKNFKKSNF ALFNKILRIL SWLLPGLVIK RWMLTSAVGF LTTLLGLVIW 
TNLRPLYWLI EIFFSVMTGL TSILPVSFMG PLIFVIGILL IGIGQNRSIN SIQKALVPEK
DTFLVDALRV KSKLNRGPNI VAIGGGTGLS TLLKGLKNYS SNITAIVTVS DDGGSSGILR
KQLGVQPPGD IRNCLAALSN EEPTLTRLFQ YRFSEGTGLE GHSFGNLFLS ALTTITGNLE
KAVQASSKVL AVQGQVLPAT NIDVMLWAEL EDGEKIFGES KISKSKKLIS RIGYLPENPS
ALPSALESIK EADLIILGPG SLYTSLLPNL LVPEIVDALL QSNAPKIYIS NLMTQPGETD
GLDVYQHIKA IEKQLLNFGV NTRIFDSILS QTQFEKSPLV DYYESRGAEP VQCNKEKLLS
EGYYVLQAPL YAKRITPTLR HDPRRLARAV MFIYRKLKKI N