Gene A9601_00601 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagA9601_00601 
Symbol 
ID4716742 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. AS9601 
KingdomBacteria 
Replicon accessionNC_008816 
Strand
Start bp63742 
End bp65064 
Gene Length1323 bp 
Protein Length440 aa 
Translation table11 
GC content35% 
IMG OID640077757 
Producthypothetical protein 
Protein accessionYP_001008455 
Protein GI123967597 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAAAT TATTGGCATT TTCATTAATT AGTTCCTCGA TATTTCTAGG AATCAATACA 
CTAAATGCAG AAGAATATGA GGCCTTCGGA ATAGATTATT CAGGAGATGC ATCTATTGGA
AATAGAGTAT GGGGTGTTTT GGATGGTCAA AAAACACTAC TTAGTACAAA AGTGTTTGAT
AATAATGGTT GGACACCAGC AGAATCATAT ATAAACGCAA AAACTGGCGA GATAATGGTT
AGAGGAGCAG GGACTAAATT TCATGCTTAC AATTGGAAAA CAGATACTTG GCGAGATATC
TCAGATAATG GTAATTTTCA AAAGTATTTT GTAAAACCGA TGTCAGTTGG AACAACTGCC
GATAGTTCAA TACAAATTGG AGCAGATGCT AATGATATTG ATGTTGTTGA AGATGGTTTG
AATATTGATG GTGCTGCTGT TATTACTAAA AATACTGACG GATCAATTCA ACTTGGAGCA
GATGGTAATG ATATAGATGT TGTTGAAGAT GGTTTGAATA TTGATGGTGC TGCTGTTATT
ACTAAAAATA CTGACGGATC AATTCAACTT GGAGCAGATG GTAATGATAT AGATGTCGTA
GCAGATGGTT TGAACATTGA TGGAACTGCT GTTATTACAA AAAATGCTGA TGGCACAATC
CAAATCGGAA CAGATGAAAA CGATATTGAT ATAACTTCAG AGGGACTTGC AATTGATGGA
GAACCATTAA TTACCAAGAA AGCAAATGGA GAATTACATA TTGGTAAGAA CTCATGGATA
ACAAAAGAAG AAAATGGAAG ACAAAAAGTT TATGCGAAAG ATGCCAATGG AAATCCAATT
CCTATCGATT ACACAAATGG GACCAAGTTA CTTATTAATG GAAGAGATGT AGAACAGTCA
ATCAATAATG TTGGTGCTTT AAGTGCCGCC CTAACAGGAT TGCCCACAGT TCCTACAGAT
ACAACCCTTG CTTGCGGATT AGGAACTGGA ACTCATGGAG GTGATTTTGC TTTTTCTGGT
GGCTGTGCTT CTAAAGTTAA TGACAAATTA TCAATTAACT ATGCGGCGTC AATGACAATG
CCAGGTCAAG ATTATGCCGG TGATTTTGAA GATACTTTTT CCGCTAGAGC AGGATTTGTT
TGGAAATTAG GTAAGGCCAC AAAACCTATT CAAATTAGTA TGAATGAAAA AGAGAATTTC
GAAACAAAAA TCAAAACTCT AGAAGAAAAA AATAAACAAC TCTTAGCAAG GCTAGAAAGA
TTAGAAAAAG TCGCACTTGG AGATCTTAAA TCAAAAGATT TAGCAGTTTA TAAACTCAAA
TAA
 
Protein sequence
MKKLLAFSLI SSSIFLGINT LNAEEYEAFG IDYSGDASIG NRVWGVLDGQ KTLLSTKVFD 
NNGWTPAESY INAKTGEIMV RGAGTKFHAY NWKTDTWRDI SDNGNFQKYF VKPMSVGTTA
DSSIQIGADA NDIDVVEDGL NIDGAAVITK NTDGSIQLGA DGNDIDVVED GLNIDGAAVI
TKNTDGSIQL GADGNDIDVV ADGLNIDGTA VITKNADGTI QIGTDENDID ITSEGLAIDG
EPLITKKANG ELHIGKNSWI TKEENGRQKV YAKDANGNPI PIDYTNGTKL LINGRDVEQS
INNVGALSAA LTGLPTVPTD TTLACGLGTG THGGDFAFSG GCASKVNDKL SINYAASMTM
PGQDYAGDFE DTFSARAGFV WKLGKATKPI QISMNEKENF ETKIKTLEEK NKQLLARLER
LEKVALGDLK SKDLAVYKLK