Gene P9303_20991 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9303_20991 
Symbol 
ID4776836 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9303 
KingdomBacteria 
Replicon accessionNC_008820 
Strand
Start bp1859890 
End bp1861350 
Gene Length1461 bp 
Protein Length486 aa 
Translation table11 
GC content40% 
IMG OID640087607 
Producthypothetical protein 
Protein accessionYP_001018099 
Protein GI124023792 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATTTCAA TCCTTTATCA CCTTTTAAAT CCTCAAAATC TTGAGAGAAA GAGCATCTCT 
AACCGTGGTT ATGCTGAATA CTTAGAGTAC GAATCCTGGC TTGATAGTAA GACTTTACCA
TGGGCGTGGA ATTTGGCTGA TCATGAAGAT CACTTTCATA AAAATGGCCG TATCAGATTT
AATGTATCTG GATGGAATAA TGAGCAACTT GCCATACCCC CAGATTCAGA TGGTTTGATC
GATGAGTATA GAAATCTTGC TCGAGAAGCA TTTAAGCTCT TTGAGGAGAA CCTGGGCATA
GACTTTGTCG AAACTAATGA GGAAGATGCC GACATCTTCT TCATCGATAA TCATAGGGAT
GGTGGTTACT GGTCATATAC TCCATTAGAG GAAAGAAGTC TTTACTCCGA GAAGGCTTAT
CCGGAACACA GCTTTATCAA TATTACAGTA GATGACTCGT TATCGGCGAA ACTGCATGGT
GGGCTTTTTG CTACATTCAT TCATGAGATT GGCCATAGCT TGGGCCTTGG CCATGATGGC
AATTACAACT TTGATGACTC TAACCCTAAA GCTAGTAATT ATGAAAATGC GGGCCCATAC
CTGAACTCAT CGCAGCAATC GTCAATGATG TCGTATTTTA GAGTTCCATC TAAAGACGCA
TTGAGAAGCT GGGGAGTTGA AATGATTAAC CCTAATATTG CTAATGCTCA TTTTGAACAT
ACAAATACGC CAATGCCTAT TGACTGGTTG GCTTTAGACA ACATTTATAA ACAGCAAGGA
TACGGTATAT CCAATTCCTT CAATGGCGAT ACTCTCTATG GCGCGGATAC TTCTATTCCT
GCTGAAGTCA GCAACGTTTG GAACCAATTT GCTGATTTTA TTCATCACAA TGCATTTACC
ATCGTTGATG GCTCGGGTCA TGACATCATC GATGTGAGCT TTTCTGGATT TGATCAGACT
ATTGATCTTC GAGCCACTGA TCCTAATTCT GATTTTTTAT ACCCATCAGA CGTCAATGGA
CTTAAAGGTA ATCTATACAT AGCAGCCAAT ACAGAGATTG AAGAGGCAAT TACAGGCTCC
GGAAACGATC TTTTGATTGG CAACAAATTC AATAATATCC TAGATGGAGG ATCCGGTTCT
GACGAGCTTT GGGGATTTAA AGGGGCGAAC ACTTTAAAGG CTGGTGATTT TGATGATGTC
ACCGATAAGT TCTATATAAA AGCAAGTAAT ATGGTTGAAC AGTGCGATTT TCTTTTTCAG
GTTGACTCTT CTGATAGGAT TTATATAGAC ACAAGCGATG ATGGACAGAT CACCTATCAA
GATCATATTG AAGACCCCAA TGGTAGCAAT TATGTGGGTG TTGGCATCTT TGTAGATGCT
GTTTTAGAGG CTTTAGTCAT TAATTCTGGC TTGACTTCAG ATCAGGTCAA TGATATTACC
AAGGGCGGAG ACTTTTACTA G
 
Protein sequence
MISILYHLLN PQNLERKSIS NRGYAEYLEY ESWLDSKTLP WAWNLADHED HFHKNGRIRF 
NVSGWNNEQL AIPPDSDGLI DEYRNLAREA FKLFEENLGI DFVETNEEDA DIFFIDNHRD
GGYWSYTPLE ERSLYSEKAY PEHSFINITV DDSLSAKLHG GLFATFIHEI GHSLGLGHDG
NYNFDDSNPK ASNYENAGPY LNSSQQSSMM SYFRVPSKDA LRSWGVEMIN PNIANAHFEH
TNTPMPIDWL ALDNIYKQQG YGISNSFNGD TLYGADTSIP AEVSNVWNQF ADFIHHNAFT
IVDGSGHDII DVSFSGFDQT IDLRATDPNS DFLYPSDVNG LKGNLYIAAN TEIEEAITGS
GNDLLIGNKF NNILDGGSGS DELWGFKGAN TLKAGDFDDV TDKFYIKASN MVEQCDFLFQ
VDSSDRIYID TSDDGQITYQ DHIEDPNGSN YVGVGIFVDA VLEALVINSG LTSDQVNDIT
KGGDFY