Gene A9601_11421 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagA9601_11421 
Symbol 
ID4717855 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. AS9601 
KingdomBacteria 
Replicon accessionNC_008816 
Strand
Start bp959398 
End bp960840 
Gene Length1443 bp 
Protein Length480 aa 
Translation table11 
GC content33% 
IMG OID640078857 
Producthypothetical protein 
Protein accessionYP_001009533 
Protein GI123968675 
COG category[S] Function unknown 
COG ID[COG2308] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAATATA TGTTTTCAAG TTATCAGCCT AAAAATAGTT TTGATGAATA CTTTAAGGAC 
AATGTAAACT CTGCTAGAGA AATATTGATT CCACTTCTTT CATCTTTAGA TAATATGGGA
CTTGAAGAAT TAAACAGGAA TCACTCTGCC GCAAAAAAAT TATTACTAAG ACATGGTGCA
ACTTTTAGAT TAAACGATAC TGGTTTAAAA GGTACTGAGA GAATATTACC TTTTGATCCA
CTTCCCAGAA TAATTAGTAA AGATGATTGG GTAACGTTAG AAAAAGGCCT AAAACAAAGG
CTTGAGGCAA TAGATTTATT CCTAGATGAT ATTTATAATT CTCAAAAAAT AATAAATGAT
GGAATAATTC CAAGAGAATT AATAGAGAGT TCAGAAGGTT GGAGACCTCA GATGATAGGT
TTCAAACCTC CACTAAATAA ATGGTGTCAA ATTTCGGGAC TTGATTTAAT AAGGGATAGA
AAAGGAGATT GGCATGTTTT AGAAGATAAT TTAAGGTGCC CTTCTGGGGT TGCTTATTTT
TTAGAAAATA GATTAGTTAT GAAAAATATT TTTCCTAATC TTTTCTCAGG AAGAATAGTA
AAACCAATTG ATGAATATCC ATCATATCTT TTAAAAACGC TTCAAGAACT TGCTGTTTGG
ACTGACACTC CCAAGATAGT TCTACTAACT CCAGGAATTT TTAATAGTGC TTATTTTGAA
CATAGTTATC TAGCGCAAGA AATGGGCATC CAACTAGTTC AAGGTCATGA CTTAGTTTGT
AATGATGATT ATGTATATTT AAAAACTACC TCTGGATTAA AAAGAGTAGA TGTCATTTAC
AGGCGAATTG ATGATGATTT CTTAGATCCT CTTAATTTCA GAAAAGATTC CTGCCTTGGT
GTCAGCGGAT TACTTGATGT TTTTAAGGCA GGTCATGTTG CTTTAGCAAA TGCACCTGGT
ACTGGAATAG CAGATGACAA AATGATTTAT TCATTTGTTC CAAAAATGAT TAAATATTAT
CTTGATGAAG AAATTATTAT TAAAAATGTA GAAACGTATA TTTGTCATTA CCAAAAGGAT
CGAGAATATG TTCTAGAAAA TTTATCAAAA CTTGTTGTTA AGTCTGTAGC AGAAGCCGGT
GGTTATGGAA TGTTAATTGG ACCTCACTCA ACAACCAGTG AAATAGAAGA ATTCGCTAAT
AAAATTAAAA ATAATCCTAG AAATTTCATA GCACAACCAA CGTTAGAATT ATCTACTGTG
CCATCGTTAT GTGATGGAGA ACTATATCCA TGTCATGTTG ATTTAAGGCC ATACATCTTA
AGAGGAAAAG ATTCATGGGT TAGCCCAGGC GGGCTAACGA GGGTAGCATT AAAAAAAGGA
TCATTAGTCG TCAATTCTTC TCAAGGTGGA GGATGCAAAG ATACATGGGT TGTAGGTAAA
TAA
 
Protein sequence
MKYMFSSYQP KNSFDEYFKD NVNSAREILI PLLSSLDNMG LEELNRNHSA AKKLLLRHGA 
TFRLNDTGLK GTERILPFDP LPRIISKDDW VTLEKGLKQR LEAIDLFLDD IYNSQKIIND
GIIPRELIES SEGWRPQMIG FKPPLNKWCQ ISGLDLIRDR KGDWHVLEDN LRCPSGVAYF
LENRLVMKNI FPNLFSGRIV KPIDEYPSYL LKTLQELAVW TDTPKIVLLT PGIFNSAYFE
HSYLAQEMGI QLVQGHDLVC NDDYVYLKTT SGLKRVDVIY RRIDDDFLDP LNFRKDSCLG
VSGLLDVFKA GHVALANAPG TGIADDKMIY SFVPKMIKYY LDEEIIIKNV ETYICHYQKD
REYVLENLSK LVVKSVAEAG GYGMLIGPHS TTSEIEEFAN KIKNNPRNFI AQPTLELSTV
PSLCDGELYP CHVDLRPYIL RGKDSWVSPG GLTRVALKKG SLVVNSSQGG GCKDTWVVGK