Gene A9601_04781 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagA9601_04781 
Symbol 
ID4717176 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. AS9601 
KingdomBacteria 
Replicon accessionNC_008816 
Strand
Start bp414083 
End bp415573 
Gene Length1491 bp 
Protein Length496 aa 
Translation table11 
GC content27% 
IMG OID640078190 
Producthypothetical protein 
Protein accessionYP_001008873 
Protein GI123968015 
COG category[R] General function prediction only 
COG ID[COG3046] Uncharacterized protein related to deoxyribodipyrimidine photolyase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.289419 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAACAAG TATCAATTAT TTTCCCGAAT CAACTTTTTA GAGAAAGCCC AATCTTAAAA 
ATAAATTGTG AAGTTTTGAT TTTGGAAGAC TCATTATTTT TTGGAAATGA TAAATTTCAT
AAATTAATTA ACCATAAAAA TAAGTTAGTT TTTCATAGAG CATCTATGCT CGCTTATAAA
AATTATTTAG AAATATCTGG CTTTAAAGTT TTGTATATCG AAAACAAGAA TAATGTTTCT
ACAGTTGATT ACTTATCGGA ATTTATTAAA AATAAATATC AGAAAATAAA TCTCATTGAC
CCTCATGATT TTTTAATAAT GAAGAGGATT AATAATTTTG TTGAAAGTAA TAATTTAACT
TTAAATATTT TGCCTTCTCC TATGTTTATG AGCAGTGAAG ATTTAAAAAA TTTATTTGTA
TCAAATGCAA AAAAACCTCT TATGGGGAGA TTTTATGAGA ATCAAAGAAA GAGCCAAAAG
ATATTAGTTA ATTCTGATGA TACACCTGAA GGTGGTAAAT GGAGTTTTGA TGAAATGAAC
AGAAAAAAAT TACCAAAAAA AATAAATATA CCCGACACAC CAAAATTACA AAAAAATAAA
TTTGTAGTTA ATGCAGAAAG GTCATTAGCT AATTTTGATA TTGAGTTTAT TGGAGAAAGT
AATAACTTTT TATATCCAAC TAATTTTGAA GAGGCAGATG AATGGTTAAA TGATTTTTTT
AAACATAAAT TTTTCTTATT TGGAGATTAT GAGGATGCTA TTTCTAAAGA AAATTCTTTT
TTATGGCACA GTTTACTTTC TCCTCTTTTA AATAGCGGCT TATTAACCCC AGATGTAGTA
GTAAATAAAG CATTACTTTT TGCAAAAAAT AATAATGTTC CTATTAACTC TTTAGAGGGT
TTTATTCGTC AAATTATTGG ATGGAGAGAA TTTATTTGCC TCGTCTATAA AAAGTACGGA
ACAAAGATGC GAAATAGTAA TTTTTGGAAT TTTGAAGATA AGCCAATTCC AAAATCTTTT
TATCAAGGAA ATACAGGAAT TGAACCAGTA GACGTTGTTA TAAAAAATAT TATTAAATTT
GGTTATTGTC ATCATATTGA GCGGCTAATG ATTGTTGGCA ACTTTATGCT TCTATGTAGA
ATTCACCCCA ACCAAGTTTA TAAATGGTTT ATGGAAATGT TTATTGATTC CTATGATTGG
GTTATGGTCC CAAATGTTTA CGGAATGAGT CAGTTTAGTG ATGGTGGTAT CTTTTCAACA
AAGCCATATA TATCAAGCTC TAATTATGTA AAAAAAATGT CTAATTTTAA AAGTGGTCTA
TGGTGCGAAA TATGGGATGG CTTATTTTGG AAATTTATTA AAGATAATGA AAGCTTTTTT
AGAAAGCAAT ATCGTCTGGC AATGTTAACG AGAAATCTCG ATAAAATGTC AGCGGAAAAA
TTAAATAATC ACTTAAAAAA GGCCGATAAA TTTTTAAGAG ATATTCAATA A
 
Protein sequence
MKQVSIIFPN QLFRESPILK INCEVLILED SLFFGNDKFH KLINHKNKLV FHRASMLAYK 
NYLEISGFKV LYIENKNNVS TVDYLSEFIK NKYQKINLID PHDFLIMKRI NNFVESNNLT
LNILPSPMFM SSEDLKNLFV SNAKKPLMGR FYENQRKSQK ILVNSDDTPE GGKWSFDEMN
RKKLPKKINI PDTPKLQKNK FVVNAERSLA NFDIEFIGES NNFLYPTNFE EADEWLNDFF
KHKFFLFGDY EDAISKENSF LWHSLLSPLL NSGLLTPDVV VNKALLFAKN NNVPINSLEG
FIRQIIGWRE FICLVYKKYG TKMRNSNFWN FEDKPIPKSF YQGNTGIEPV DVVIKNIIKF
GYCHHIERLM IVGNFMLLCR IHPNQVYKWF MEMFIDSYDW VMVPNVYGMS QFSDGGIFST
KPYISSSNYV KKMSNFKSGL WCEIWDGLFW KFIKDNESFF RKQYRLAMLT RNLDKMSAEK
LNNHLKKADK FLRDIQ