Gene A9601_07871 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagA9601_07871 
Symbol 
ID4717493 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. AS9601 
KingdomBacteria 
Replicon accessionNC_008816 
Strand
Start bp684081 
End bp685400 
Gene Length1320 bp 
Protein Length439 aa 
Translation table11 
GC content28% 
IMG OID640078501 
Producthypothetical protein 
Protein accessionYP_001009180 
Protein GI123968322 
COG category[S] Function unknown 
COG ID[COG4487] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0459178 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAGATA TTAAATGTCC TTCATGCGGC AAAACTTTCC GGATTGATCC CAGCAGCTTT 
GAAGAAATAC TTCTTCAGAT AAAAGACGAG GAGTTTAACA AACAAATAAA AGAAAGACTT
ACTCTAGCTG AAGAAGATAA TAAAAAAGCT TTGGAAATTT TAAAACGAGA GTTAAAAATA
CAGTTAATAG AGCAAAATCG TATTAAAGAG TCCGAAATCC AAACTCTTGA ATCTAAATTA
AAAATAGCTG AAGAAAAGAA AACAAATGCT CTTAATGATT TAAAAAATCA AGCAACAAAT
AAAATTAATT CACTGAATAA TGAATTAATC AAGTTAAAGG ATGAAATTAA AAATCAGTCT
TTAATTTCAG AATTATCCTT AAAAAATAAA GTTAGTGAAG CTGTTAATAA TTTAGAAAAA
GAAAACTCAT CATTAACAAA TTCCATTGAA AAGATGAGGC TTGAACATTC AATTAATGAA
AAATTAATTG AAGAAAAGTT TAAAAGCAAA ATTAGTGAAA GGGACTTGAC TATTCAGGAG
TTAAGAGAAA TGAAATCCAG ATTATCTACA AAGATGATAG GAGAAACATT AGAAATCCAT
TGCGAAACCC AATTTAATCT GAATCGTGCC TCTGCGTTTA AAAACTCATA TTTCGAAAAG
GATAATGATG CCACTTCTGG AAGTAAAGGG GACTATATAT TTAGAGAATT TGATGAAAAT
AAAACTGAAG TTGTATCAAT AATGTTCGAG ATGAAGAATG AAAGTTTAAA TGGAACTAAT
AAAAGAAAAA ACGAAGATTT TTTAAAAGAA TTAGATAAAG ATAGAAGGCA AAAATCTTGT
GAATATGCAG TACTCGTTTC TCTATTAGAA CCAGATAGTG AACTATATAA TGCTGGCATA
GTAGATGTTT CTCATAGATT CCCAAAAATG TTTGTCATAA GACCTCAATT TTTCTTACCC
ATTATTTCTC TGTTAAGAAA TGCATCTATG GAAACCTTAA AATACAAATC ACAAATTGAT
TTAATGAAAC GTGAGAATTT TGATATAACT AATTTTGAAA GTACTCTTGA GCAATTCAAA
AATGCAGTTG GTAAAAATGT TTCTCTTGCC CAAGATAGAT TTAATGATGC AATTTCAGAA
ATTGATAAAT CAATAACTCA TTTACAAAAA ACTAAGGAGG CTTTAGTTCT CTCAAAAAAA
CATCTTTTAT CTGCTGACAG CAAATCTCAA GATTTGACAG TAAAAAAATT AACTAGAAAT
AACCCAACCA TGAAGAAAAA GTTTAATGAT TTAAATAATT TCGAAGATGA AGTAGCCTAA
 
Protein sequence
MKDIKCPSCG KTFRIDPSSF EEILLQIKDE EFNKQIKERL TLAEEDNKKA LEILKRELKI 
QLIEQNRIKE SEIQTLESKL KIAEEKKTNA LNDLKNQATN KINSLNNELI KLKDEIKNQS
LISELSLKNK VSEAVNNLEK ENSSLTNSIE KMRLEHSINE KLIEEKFKSK ISERDLTIQE
LREMKSRLST KMIGETLEIH CETQFNLNRA SAFKNSYFEK DNDATSGSKG DYIFREFDEN
KTEVVSIMFE MKNESLNGTN KRKNEDFLKE LDKDRRQKSC EYAVLVSLLE PDSELYNAGI
VDVSHRFPKM FVIRPQFFLP IISLLRNASM ETLKYKSQID LMKRENFDIT NFESTLEQFK
NAVGKNVSLA QDRFNDAISE IDKSITHLQK TKEALVLSKK HLLSADSKSQ DLTVKKLTRN
NPTMKKKFND LNNFEDEVA