Gene P9301_13141 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9301_13141 
Symbol 
ID4912828 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9301 
KingdomBacteria 
Replicon accessionNC_009091 
Strand
Start bp1100559 
End bp1101647 
Gene Length1089 bp 
Protein Length362 aa 
Translation table11 
GC content36% 
IMG OID640160903 
Producthypothetical protein 
Protein accessionYP_001091538 
Protein GI126696652 
COG category[R] General function prediction only 
COG ID[COG2516] Biotin synthase-related enzyme 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCTGAAA TAGGTAAAAT TATTACTGAT TTGCAAGTTA ATGGGATAAG TTCAACTCCC 
AAAAAGGGCA ACAGAGGTAG GAAAGGAGGA GCTGGTCCAT CGGATCATAG AGCTTTAACA
ATTGAAGGAA AAACTGTAAT GGTTCCCGTT TATAATCACT TATCTAAGAA GTCTAATTAT
CAACTTTCAG AAGAGTCTGA CGGACAATTT ATTCTCCAAA ATAGTGAAGA TTCAAAAATC
AAGGAATTAT CTACAACAAA AGAACCAAAT TTTTATTCTT TAAAAACAAA AGATGGTATA
CCTTATAAAT CGATTGCTCT TCTTCATAGC AAGGATGTAT TAGCCACAAC TATTCTCCAA
AAATGTATTC GTTTCAGAAA TAGAGAAGAA TCTTGCCAGT TTTGTGCAAT AGAACAATCT
CTGAAAAATG AACAAACCAT CGTAAGAAAA ACTCCAGATC AAATTGCTGA AGTTGCAGAA
GCAGCTGTAA GACTTGATGG AATAAAGCAA TTAGTAATGA CAACAGGGAC CCCCAACACT
AGCGATAGGG GAGCGAGAAT AATGGCAGAA GCAGCTAAGG CAGTTAAGGC TAAAGTTGAT
ATTCCAATCC AAGGTCAATG CGAACCTCCT GATGATCCTA TTTGGTTTCA AAAAATGAAA
GACTCAGGTG TAGATAGTTT AGGCATGCAT TTAGAGGTTG TAGAGGAGGA GATAAGAAAA
AAAATTCTTC CCGGCAAATC TGAAATTCCT CTTGAAAGAT ACTATAAATC CTTTGAAGAA
AGTGTCGCAG TATTTGGAAG GGGAGAAGTT TCTACATATT TATTAGCAGG ATTAGGTGAT
AGCAAAGAAT CTCTAATAAA TTGCAGTAAA AAATTGATAT CTATAGGAGT TTATCCTTTT
ATAGTTCCAT TTGTGCCAAT AGCAGGAACT CCTCTAGAAC ATCATCCCTC CCCAAGCACT
GATTTCATGA TTGATATTTA TCAATCAGTC TCGCATTTAC TAAATGAAGG CAACATAAAA
TCCGATGAAA TGTCAGCTGG TTGTGCCAAA TGCGGTGCCT GCTCAGCTTT ATCCCTATTT
GAGAGTTAA
 
Protein sequence
MSEIGKIITD LQVNGISSTP KKGNRGRKGG AGPSDHRALT IEGKTVMVPV YNHLSKKSNY 
QLSEESDGQF ILQNSEDSKI KELSTTKEPN FYSLKTKDGI PYKSIALLHS KDVLATTILQ
KCIRFRNREE SCQFCAIEQS LKNEQTIVRK TPDQIAEVAE AAVRLDGIKQ LVMTTGTPNT
SDRGARIMAE AAKAVKAKVD IPIQGQCEPP DDPIWFQKMK DSGVDSLGMH LEVVEEEIRK
KILPGKSEIP LERYYKSFEE SVAVFGRGEV STYLLAGLGD SKESLINCSK KLISIGVYPF
IVPFVPIAGT PLEHHPSPST DFMIDIYQSV SHLLNEGNIK SDEMSAGCAK CGACSALSLF
ES