Gene P9301_04471 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9301_04471 
Symbol 
ID4911293 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9301 
KingdomBacteria 
Replicon accessionNC_009091 
Strand
Start bp388504 
End bp389994 
Gene Length1491 bp 
Protein Length496 aa 
Translation table11 
GC content28% 
IMG OID640160025 
Producthypothetical protein 
Protein accessionYP_001090671 
Protein GI126695785 
COG category[R] General function prediction only 
COG ID[COG3046] Uncharacterized protein related to deoxyribodipyrimidine photolyase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.77539 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAACAAG TATCAATTAT TTTCCCGAAT CAACTTTTTA GAGAAAGCCC AATCTTAAAA 
ATAAATTGTG AAGTTTTGAT TTTGGAAGAC TCATTATTTT TTGGAAATGA TAAATTTCAT
AAATTAATTA ATCATAAAAA TAAGTTGGTT TTTCATAGAG CATCTATGCT CGCTTATAAA
AATTATTTAG AAATATCTGG CTTTAAAGTT TTATATATCG AAAACAAGAA TAATGTTTCT
ACAGTTGATT ACTTATCGGA ATTTATTAAA AATAAATATC AGAAAATAAA TCTCATTGAC
CCTCATGATT TTTTAATATT GAAGAGGATT AATAATTTTG TCGAAAGTAA TAATTTAGAT
TTAAATATTT TACCTTCTCC TATGTTTATG AGCCGTGAAG ATTTAAAAGA TTTATTTGTA
TCAAATGCAA AAAAACCTCT TATGGGGAGA TTTTATGAGA ATCAAAGAAA GAGCCAAAAG
ATATTAGTTA ATCCTGATGA TACACCTGAA GGTGGTAAAT GGAGTTTCGA TGAAATGAAC
AGAAAAAAAT TACCAAAAAA AATAAATATA CCCGATACAC CTAAATTACA AAAAAATAAA
TTTGTAGTTA ATGCAGAAAG GTCATTAGCC AATTTTGATA TTGAGTTTAT TGGTGAAAGC
AATAACTTTT TATATCCAAC TAATTTTGAA GAGGCAGATG AATGGTTAAA TGATTTTTTT
AAACATAGAT TTTTTTTATT TGGAGATTAT GAGGATGCTA TTTCTAAGGA AAATTCTTTT
TTATGGCACA GTTTACTTTC TCCTCTTTTA AATAGCGGCT TATTAACACC AGATGTAGTA
GTAAATAAAG CATTACTTTT TGCAAAAAAT AATAATGTTC CTATCAACTC TTTAGAGGGT
TTTATTCGTC AAATTATTGG ATGGAGAGAA TTTATTTGCC TCGTCTATAA AAAGTACGGA
ACAAAGATGC GAAACAGTAA TTTTTGGAAT TTTGAAGAGA AGCCAATTCC AAAATCTTTT
TATCAAGGAA ATACAGGAAT TGAACCTGTA GACGTTGTTA TAAAAAATAT TATTAAATTT
GGTTATTGTC ATCATATTGA GCGGCTAATG ATTGTTGGCA ACTTTATGCT TTTATGTAGA
ATTCACCCCA ACCAAGTTTA TAAATGGTTT ATGGAAATGT TTATTGATTC GTATGATTGG
GTTATGGTCC CAAATGTTTA CGGAATGAGT CAGTTTAGTG ATGGTGGAAT CTTTTCAACA
AAGCCATATA TATCAAGCTC TAATTATGTA AAAAAAATGT CTAATTTTAA AAGCGGCCCA
TGGTGTGAAA TATGGGATGG CTTATTTTGG AAATTCATTA AAGATAATGA AAGCTTTTTT
AGAAAGCAAT ATCGTCTGGC AATGTTAACT AGAAATCTCG ATAAAATGTC AGAGGAAAAA
TTAAATAATC ACCTAAAAAC GGCCGATAAA TTTTTAAGAG ATATTCAATA A
 
Protein sequence
MKQVSIIFPN QLFRESPILK INCEVLILED SLFFGNDKFH KLINHKNKLV FHRASMLAYK 
NYLEISGFKV LYIENKNNVS TVDYLSEFIK NKYQKINLID PHDFLILKRI NNFVESNNLD
LNILPSPMFM SREDLKDLFV SNAKKPLMGR FYENQRKSQK ILVNPDDTPE GGKWSFDEMN
RKKLPKKINI PDTPKLQKNK FVVNAERSLA NFDIEFIGES NNFLYPTNFE EADEWLNDFF
KHRFFLFGDY EDAISKENSF LWHSLLSPLL NSGLLTPDVV VNKALLFAKN NNVPINSLEG
FIRQIIGWRE FICLVYKKYG TKMRNSNFWN FEEKPIPKSF YQGNTGIEPV DVVIKNIIKF
GYCHHIERLM IVGNFMLLCR IHPNQVYKWF MEMFIDSYDW VMVPNVYGMS QFSDGGIFST
KPYISSSNYV KKMSNFKSGP WCEIWDGLFW KFIKDNESFF RKQYRLAMLT RNLDKMSEEK
LNNHLKTADK FLRDIQ