Gene P9211_09661 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9211_09661 
Symbol 
ID5731079 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9211 
KingdomBacteria 
Replicon accessionNC_009976 
Strand
Start bp858720 
End bp860375 
Gene Length1656 bp 
Protein Length551 aa 
Translation table11 
GC content35% 
IMG OID641285333 
Producthypothetical protein 
Protein accessionYP_001550851 
Protein GI159903507 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.308392 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.00593947 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGAAAACT ATATAGAAAA GAAATTGACC GAAGGTGGAT ACCAACAAAT CTTGTTGTTA 
GCTCCATCCT TGCTTGGGGA GTCTTTAGCA GCTCAATTGC AATCTGCCAA TAAATCCAAT
GAGATTATAT TGCGACAAGA AAATTTAACT AAAGCTCCAG CTCTTGTTAT ATGGGCAATT
GATAATGTAG TAATTCCATC GACGATTAGA TTTGAGCTAA GGACGCTATC CGAAAGATGG
GCACCATCGC CAATACTTCT TCTACTGCCA AGTAAAACAT CAATACCTCC TAATGAAATT
TTGAATTTTG AAAGTGATGG AATACTCCAA GATCCAGATA TAAAGACATT AGTTGACTCT
ATTTCAACAA TTCTAGAAGG GGGGAGGGTA TTTCAATTAA AACAAGCTCA GAATCCAATA
AATAGCACTA GGAAAAGATC AATTGGTATA GGTCAGTATC TTTTGAAACA AGGGATAGAC
CAAATTGATA TGAAAATTGC TCAATTGGAA CCAATTCTTA GCCCTCTACC CATAAACCCC
TTTCTGCGTA TAGCCATAAA CGGTAGGAAG AGAGAACTAA ATAGCGCAAA AAACATATTA
ATTTGGGTAT GGGGACCAAT TCATAACATG CCAATAAGTG ATATTTCAGC TAAATTAACT
TCAGATATTG ATTATATATA CGATAATTTT GTAGCAGATA TAGTACTCCC CCAAAGGGAT
TCGAAGGCTG TAATAGAACT CATAATAACT CGCCTGAGAA AATCAGTAAG TGATCCTTTA
TCAAACTCTA CAGGGACTAT ATTTGCCTTA CAGGCAATTA CTGAATGCAA ACAAAAAACA
CTTTTACTGG AATTGATTAC TCAACTAGAA AAGTTACTAT TGCGGTTAAT TTCCCTGGAT
AAGAATGAGT CTAAAATAAT AGATACCTGG AATTCATTTC AACTTAACCT TCGCAAAGAG
GCTATCCGCT CAATAGCAGA ACCTTATACA ACAATAGAAT ATGAAGGAAA CTCTGTACTA
TTAAGAGATC GTCTAGAGAA ACTAACTGAA TTAGACGAGA TTGATGAGGA TATGCCTAGT
CCTAAAAATA TTGTTCAAAC CCTCATTTTA AATGAATCCT TAAAAGTTGA TGACCAATAC
CTTCCCTACG ATCACCCAAA GTCAGTTATA AGAACGGAAA TGATCTTAAC TAATTGGCTT
ATAAGAACAG CTGAAATTAT TAGTTCAGAG CTTCTTAATC AAGCATCAAT TTGGCCAGAC
CTTAGACAAT ATTTGCTAAC TTCAAATCTT ATTTCTACAA GAGAACTTGA ACGTCTTCGC
AATCAATTAA ATTCGCAATC TAGAATACAA AGTCTATTTA CTCGTCCTAT TCATTTATAC
GAAAGTAAAA GACTTCTCTA CCGTATCAAC CAAAGCTCTA TTGAATCTTA TATATTAACA
GAGTTACGAG ATAAAGAATT AAGGGAACTG GGTTGGCTCC AAAAACAAGT TACGTTATTA
GTAGAAGCAA GAGATGCATT GGCTCCGCAG ATACAATCCC TGGTAAAATA TATAGGTAAT
TTCATGGTGA TACTACTAAC TAACGTACTT GGTCGTGCCA TTGGTTTAGT TGGCAAAGGA
ATAGCTCAAG GGATGGGTAG ATCTCTATCC AGATAA
 
Protein sequence
MENYIEKKLT EGGYQQILLL APSLLGESLA AQLQSANKSN EIILRQENLT KAPALVIWAI 
DNVVIPSTIR FELRTLSERW APSPILLLLP SKTSIPPNEI LNFESDGILQ DPDIKTLVDS
ISTILEGGRV FQLKQAQNPI NSTRKRSIGI GQYLLKQGID QIDMKIAQLE PILSPLPINP
FLRIAINGRK RELNSAKNIL IWVWGPIHNM PISDISAKLT SDIDYIYDNF VADIVLPQRD
SKAVIELIIT RLRKSVSDPL SNSTGTIFAL QAITECKQKT LLLELITQLE KLLLRLISLD
KNESKIIDTW NSFQLNLRKE AIRSIAEPYT TIEYEGNSVL LRDRLEKLTE LDEIDEDMPS
PKNIVQTLIL NESLKVDDQY LPYDHPKSVI RTEMILTNWL IRTAEIISSE LLNQASIWPD
LRQYLLTSNL ISTRELERLR NQLNSQSRIQ SLFTRPIHLY ESKRLLYRIN QSSIESYILT
ELRDKELREL GWLQKQVTLL VEARDALAPQ IQSLVKYIGN FMVILLTNVL GRAIGLVGKG
IAQGMGRSLS R