Gene P9303_20041 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9303_20041 
Symbol 
ID4776473 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9303 
KingdomBacteria 
Replicon accessionNC_008820 
Strand
Start bp1761137 
End bp1763878 
Gene Length2742 bp 
Protein Length913 aa 
Translation table11 
GC content59% 
IMG OID640087518 
Producthypothetical protein 
Protein accessionYP_001018011 
Protein GI124023704 
COG category[S] Function unknown 
COG ID[COG5361] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.431741 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGAACT TCACTGATTG GTTTAAGCCC TACCTGCTTT CGGATCAAAA CGGACTGAAC 
ACGAGCGCGC TTCTCGATGA ATTTTCCGAC CTGGCCGGAA GCAGCCTCAA GGGACTTAGC
CGCAATGAAG AGCGCATCGT CGATGCCTGC CTCCATGCCG TGCTCTGGGG ATATCCCCTA
GCCGAGACCT ACCGCTACCG GCAGCTCGGC ACCAAAGTGC AGGCTAAGGA GAACATGCTG
TTCAAGCCCA GTTCCGTAGC GAGCTGGCTG AACAAGAATT CAGCCCCGGC CCCCAACGCC
TCGGTGCTCT ACGTCACAAG CTGGCTGAAT CTCAACAAAG GGGATCGCAT CCTCCAGACA
CCAGCCAACA CCGACGAGAA TTACTACATC TGGGCGATCC TCGATAGCTA CATCAACACC
GTCGGATCGA TCGGCCCGCG GACCCAATCC AAGTCCAACG CGACTCAGGA TTCCCCCAAC
TACTACCTGC TAGCCGGCCC CTCGAGCCCT TACTACAGCG GCAACGACTG GCTCACGACC
CTGAGAACAA TGCAGGGTAA CCGCACTGTC AGGATCATCA GGGTCGACAC GCCCTACGCC
TGGGTTACCG CGAGATTCGG CAGCGACACC CTCAGTGAAT CCGCACTAGC CAACACCCAC
GACTTCATCA ACGGTGCTGA AGACATTGCC GGCAGCGGCT TTCAGATCAC CTCCATCGAC
CACTTCCAGC GCACAGGCTC GGTGCCCTAT CAAGAACCGA TCAGCCAGAG CAGCACCAAT
CAAAAAGCTG AAAAAGCTCA AAAGAAGTGG GGATCGATTC CGAGCACCGC CAAGGGCTTC
TTCGATCAAC TGGGAACAGC CCTGCAAGAC AGCCCTGTGC CGGCCCAGAT CAAGCCTGGA
ACATTCACCA ACATCCCGGA CGAGGCCATC TGGCTGGGTA ATCAGAACAA GGTGCAGAAC
GCACTGGGCG GCGACCACTA TCTACCAACC AGCAGCTATC AACCGAGTTC CGCCCTTTCC
AACAGCCAAA CCAAAGCACT GAATAAGCGC TTCAGCACGA TCGGCCTAAA CCTGAGCAGG
GGTTTCACCA TGCCCAGCGA CTGGAACAGC CGCGACCGAG AAATCTTCGA GGAGTCCTAT
CTATTTAGCA ACAAGCTGCT GAGCAAGGCC ACAACAACGA TTGCCTCCGG TGCAAAAGCA
ACTAACTACT GGCACATCGG CAACTACAAC ATGGGCGTTT ATCCCAACAC CTGGCACAAC
TGGCTGGTGC GCTGTGGTGT GGCGATCGAT GGCGGGGCCG CCAACATTCC CAATGACGGC
GTCTATCCCA CCACCCAGCG GGACCATAAC GGCTACAAAC TGAGCTCCCG CTACAACTAC
TCAATTACCC TGCCGCCCCT GAGCGAAGAA CTAGGAGGCA CCACCTACGG CCCAGCCAAT
GGGTTCTGGT CGTTCACGAT CTATCAACCC AATGCTGGCA GCGCTTACCA GCCGTTCCTT
GTCGAGAACG CCATCAACAA CCTGGCCTAC ACCAGCATCG ATGCCAGGGC GACGCTCACC
GCCAATGGCT GGCTGCGCAC CGCCAAGCCC GACAATTGGA ACAACTCCAC CGCAAAAGGC
ACAGCCCTAC GCACCGGCGT CGACGGGAAC ATCGAGGGCC TTGACGCCGA AACCACCTAC
TACGTGCAGG CCACCCGCAG GGATCACAGC GATGAGAACA ACCTGCTGAT CAAACTTTCA
GCCAGCTATG AGCCCTCCTA CAACGTGGTC AAAGGAACCC CTGGCGTGCC TATCGGGGGC
CAAGGTTCGC CTGGGCCAGC GATCGACCTT TCGGCTACAG CAGAAGGCAG CAGCCTGTCG
TTCGGATGGA TTCAACCGGT GGCACAGCTG GGATCCCCGC AACAGGACAG GCTGGAGGTT
GATGAAGATG GCAAGATCGT GCTTGAGCTT CGCGCCGACC AGCCCAGCAG CGCGCGGACT
AACTGGCTGC CGACCCCAAA TAGCGGGTGG GGCCGCGCCG CCCACGACTT TCAAGTGATG
GCACGCTACT ACGAGCCCAC GGCGGACAAC CCCACAATCC TGGCGGCCGT GAAGCATCTC
GGTCGCGACG ATATCGACGG ATCCATACCC TATATCCCAC CTCCTGTGGA GCGCAAATCC
CTGCGCCGAC TCAGCATCTG GGAACAGTTG GATGACGCAG GACGAACCCT GCTCCAGCAG
CGCACCGGCA ACAACACGGT TGATCCTTTG AGCGGCACTG ATCGCTTCGA TGCAGACGCG
GTGGGCGCCA TACTCGATCT GCGCTGGGCG AATGGCGCCC TCGAAGGCAG CAACTGGGAC
ATCCGCTACG ACTACAGCCG CAATGCGGAC TACATCAACG AACTGTTCTT CTACCGCGTC
GACGATGTCA CCGGCCTGGT GAATGATCTT CGCCCAGGTG ATTCCGGCTA CAAGGCCGCC
GCCCTGGCCC GACGGGTGAA TGCCGATCAG CCGATCAACA ATGCAACGAA CAACAGCACC
TATTCAGGCA CGCTGCGACT CGAGGGCGGG GCGATCTACA TGCCGCTGGT GCGCACGGAT
GCCGGAGAAC TACTCCTGCC CAATGCCCGC AGCACCGGGA ACGTCTCGCT GTTCTCCCTC
GTAGGAAGCG ATGCCTTCGC CTTCGATGAT CAGCTCAGCA GCGGCGATCA ACACAACAAT
GACGGACTGT TCAGGGTGAC AGGACTCACC CCAGTGGCCT GA
 
Protein sequence
MQNFTDWFKP YLLSDQNGLN TSALLDEFSD LAGSSLKGLS RNEERIVDAC LHAVLWGYPL 
AETYRYRQLG TKVQAKENML FKPSSVASWL NKNSAPAPNA SVLYVTSWLN LNKGDRILQT
PANTDENYYI WAILDSYINT VGSIGPRTQS KSNATQDSPN YYLLAGPSSP YYSGNDWLTT
LRTMQGNRTV RIIRVDTPYA WVTARFGSDT LSESALANTH DFINGAEDIA GSGFQITSID
HFQRTGSVPY QEPISQSSTN QKAEKAQKKW GSIPSTAKGF FDQLGTALQD SPVPAQIKPG
TFTNIPDEAI WLGNQNKVQN ALGGDHYLPT SSYQPSSALS NSQTKALNKR FSTIGLNLSR
GFTMPSDWNS RDREIFEESY LFSNKLLSKA TTTIASGAKA TNYWHIGNYN MGVYPNTWHN
WLVRCGVAID GGAANIPNDG VYPTTQRDHN GYKLSSRYNY SITLPPLSEE LGGTTYGPAN
GFWSFTIYQP NAGSAYQPFL VENAINNLAY TSIDARATLT ANGWLRTAKP DNWNNSTAKG
TALRTGVDGN IEGLDAETTY YVQATRRDHS DENNLLIKLS ASYEPSYNVV KGTPGVPIGG
QGSPGPAIDL SATAEGSSLS FGWIQPVAQL GSPQQDRLEV DEDGKIVLEL RADQPSSART
NWLPTPNSGW GRAAHDFQVM ARYYEPTADN PTILAAVKHL GRDDIDGSIP YIPPPVERKS
LRRLSIWEQL DDAGRTLLQQ RTGNNTVDPL SGTDRFDADA VGAILDLRWA NGALEGSNWD
IRYDYSRNAD YINELFFYRV DDVTGLVNDL RPGDSGYKAA ALARRVNADQ PINNATNNST
YSGTLRLEGG AIYMPLVRTD AGELLLPNAR STGNVSLFSL VGSDAFAFDD QLSSGDQHNN
DGLFRVTGLT PVA