Gene A9601_01211 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagA9601_01211 
Symbol 
ID4716804 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. AS9601 
KingdomBacteria 
Replicon accessionNC_008816 
Strand
Start bp119093 
End bp120247 
Gene Length1155 bp 
Protein Length384 aa 
Translation table11 
GC content28% 
IMG OID640077819 
Producthypothetical protein 
Protein accessionYP_001008516 
Protein GI123967658 
COG category[S] Function unknown 
COG ID[COG3146] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACCAAA AAATACATAA AGTTGAAGTC AAATTATCAA TTAAAGAAAT CTCCAAGGAG 
ATATGGAATG AATTAGCAAA TGAAATCAAT AATCCATTTT ATGAATGGAC TTGGCTTAAA
AACCTTGAAA TATCAAAAAG TGTTTCAAGA GAAACTGGTT GGCAGCCTCT ATATTTTGTT
GCTTATAAGA ATGAAGAAAT ATTAGGTATC GCTCCACTTT TCTTAAAAAG TCATAGCTAT
GGAGAATTCA TTTTTGATCA ATCATTTGCA CGATTGGCTC AAGAGCTTAA TTTAAATTAT
TACCCTAAAT TAATTGGAAT GAGCCCTTAT AGTCCTGTAA ATGGATATCA ATTTCTTTAT
AAAAAAAATA AAGATAAGAA AGAAATTACA AATTTACTTA TAAAAAATAT AGAAAGCTTT
GCGATTACAA ACAAAATTTT AAGTTGTAAT TTTTTATATA TTGATGAAAG CTGGGGCAAC
CATCTTAAAT CTTTGGGATA CTATGAATGG ATAAATTCCA GCAGTGAATG GAGAAGTAAT
GGAGAAAAAA CGTTTGATGA TTTTCTTTCT AGATTTAATT CTAATCAGAG AAAAAATATC
AAAAAAGAGA GGAAATCAAT TACTAAACAA GATATTAAAA TAAAAATTTT TAATAAAGAT
GATATCAACC AAGAAATCCT CAAAAAAATG CATAATTTTT ATGAACAGCA TTGCTCGAGG
TGGGGAGTTT GGGGAAGTAA ATATCTAACA TCTACATTTT TCGAAAAAAT TGTTGATAAT
AAAAAAAATC TTTTACTTTT TAGCGCATCA AAAAATAATT CAAATGATAT TTTTGCTATG
TCGATGTGCG TTAAAAATAA AAACAACTTA TGGGGTAGAT ATTGGGGTAG TGAAGAAGAC
ATATCTAATT TACATTTTGA ATTATGTTAC TACCAGCCAA TTGAATGGGC AATAAAAAAT
AGTATCTATT TTTTTGATCC TGGGGCAGGT GGTAAACATA AAAGGCGGAG GGGGTTTTTT
GCAAAAAGCA CCATTAGCTT GCATAAGTGG TTTGACAAAA ATATGGAAAA TATAATTTAT
CCTTGGCTAA ATGAAGTTAA TAAACAAACC GAGACCGAAA TTGAATTTGA GAATGATTCT
ATACCCTTTA AATAA
 
Protein sequence
MNQKIHKVEV KLSIKEISKE IWNELANEIN NPFYEWTWLK NLEISKSVSR ETGWQPLYFV 
AYKNEEILGI APLFLKSHSY GEFIFDQSFA RLAQELNLNY YPKLIGMSPY SPVNGYQFLY
KKNKDKKEIT NLLIKNIESF AITNKILSCN FLYIDESWGN HLKSLGYYEW INSSSEWRSN
GEKTFDDFLS RFNSNQRKNI KKERKSITKQ DIKIKIFNKD DINQEILKKM HNFYEQHCSR
WGVWGSKYLT STFFEKIVDN KKNLLLFSAS KNNSNDIFAM SMCVKNKNNL WGRYWGSEED
ISNLHFELCY YQPIEWAIKN SIYFFDPGAG GKHKRRRGFF AKSTISLHKW FDKNMENIIY
PWLNEVNKQT ETEIEFENDS IPFK