Gene A9601_11361 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagA9601_11361 
Symbol 
ID4717848 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. AS9601 
KingdomBacteria 
Replicon accessionNC_008816 
Strand
Start bp956660 
End bp957733 
Gene Length1074 bp 
Protein Length357 aa 
Translation table11 
GC content40% 
IMG OID640078851 
ProductWD-40 repeat-containing G-protein 
Protein accessionYP_001009527 
Protein GI123968669 
COG category[R] General function prediction only 
COG ID[COG2319] FOG: WD40 repeat 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCTGAAA TAGAACCATT TAGCCCAAGA GGCATGTTCC ATGAAGGATG GACTGCTGAA 
GTTAACGATT ACGCCATAGC ATGTGGCTGG GCTCTTAAAG GAAAACAATT TATTGTTGGC
GACGTAGCAG GAGGTCTTTT TTCGTTCGAA GGAGATACTG GAAAAATTAT TTGGAAAAAA
GAAAATACTC ACTCTGGTGG TCTACTAGCA ATGGCAATTC ATCCAGAGGG TGAAATTTTT
GCAACATCAG GTCAGGATGG AAATGTTCAA ATTTGTAATT GCCATGAAGG TAAAGTTATT
AAAACACTTG ATCTTGGCAA AGGTTGGGTA GAACACCTCA AGTGGTCTAA TGACGGTTTA
TTTCTAGCGA TAGCTTCCTC AAAAAAAGTA TATGTTTTTA ATGAAATTGG TGAAGAGAAA
TGGATCTCAG AAGACCATCC AAGCACAGTA AGTGCAATAA CGTGGTCAAA TAAAAATGAG
CTAGCAACTG CTTGCTATGG AAGAGTGACA TTCTTTGACA TAGTTAATAA TAAAACGAAT
CAAAAACTCG AATGGCAAGG ATCATTGGTA TCTATGGAAT TAAGCCCTGA TGGAGATATA
GTCGCCTGTG GGAGTCAAGA CAATTCAGTT CATTTTTGGA GAAGATCAAC AGGAATGGAT
GCTGAAATGA CTGGATACCC TGGAAAACCA AGTCACCTTT CTTTTGATGA CAGCGGAAAA
TTATTAGCGA CTAGCGGCAG TGAAAGAATT ACAGTATGGA GCTTTATAGG AAATGGTCCA
GAAGGGACTA TGCCAGGAGA GCTATGTCAC CATACAGAAC CTATTTCGAG CTTAGCCTTT
TCAAATAAAG GTATGCTTGT AGCCTCAGGA TCTAGGGATG GTTCTGTTGT CGCAAGTTTC
CTAAAAAATG ACGGCAATGG AGACCCAGTT GGGGCTGCAT TTGCCGGAGA TTTGGTAGGG
GCAATATCAT GGAGACCTGA TGATTGTGCA CTTGCAGCAG TTAATGCAAA AGGTGTTGTA
AATGTATGGA AATTTAAAGT TCGTACTAAC CTTTTTTCAA AGGGATTTAA GTAA
 
Protein sequence
MPEIEPFSPR GMFHEGWTAE VNDYAIACGW ALKGKQFIVG DVAGGLFSFE GDTGKIIWKK 
ENTHSGGLLA MAIHPEGEIF ATSGQDGNVQ ICNCHEGKVI KTLDLGKGWV EHLKWSNDGL
FLAIASSKKV YVFNEIGEEK WISEDHPSTV SAITWSNKNE LATACYGRVT FFDIVNNKTN
QKLEWQGSLV SMELSPDGDI VACGSQDNSV HFWRRSTGMD AEMTGYPGKP SHLSFDDSGK
LLATSGSERI TVWSFIGNGP EGTMPGELCH HTEPISSLAF SNKGMLVASG SRDGSVVASF
LKNDGNGDPV GAAFAGDLVG AISWRPDDCA LAAVNAKGVV NVWKFKVRTN LFSKGFK