Gene A9601_13451 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagA9601_13451 
Symbol 
ID4718064 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. AS9601 
KingdomBacteria 
Replicon accessionNC_008816 
Strand
Start bp1120077 
End bp1121381 
Gene Length1305 bp 
Protein Length434 aa 
Translation table11 
GC content28% 
IMG OID640079064 
Producthypothetical protein 
Protein accessionYP_001009736 
Protein GI123968878 
COG category[R] General function prediction only 
COG ID[COG4310] Uncharacterized protein conserved in bacteria with an aminopeptidase-like domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAGATA TTTACAATGA TTTATCTTTT TTATTCAATA ATAATAGAGG TATAGTTAGT 
GACTTAAATA ATGATTTAAA CAAAAGATTA TGTGAATTAA TTCCATTTAA AAAAATAAAA
TATAAATCTG GAGAAAAAAT TGACAATTGG AAAGTTCCTT TATCTTGGGA ATTAATTAGA
GTAGAAGTTA AAATTAATTC CTTATCAATA AATCAAAAAG ATATACCTTT GATTGTGCCT
TTTGGAACCG CATCATTTAG AGTTTCAGGA AATTATATTG ATCTTAAAAA ATTTATTTAT
ACTTTAGAGG ATAAACCTTT AGCTACTCCT TATAGAACAA ATTACTATTC GCCCAAAAAT
TATAAGATCT GTTTACCCTT TAAATATTTA TCCTGTCTAA ATGATGAAGA CCAAATATCA
ATAAATGTAG AGTCAAGAAC CAAGCCATCG AATTTAGAAG TTCTTGAAAT AACCTTAGAA
GGTAATTCAA AGCATGAAAT TCTTTTTACA ACTTATAACT GTCATCCTGG ATTAGGTAAT
GATAATTTTT CCGGTTTAAT TGGATTGTGC AAATTATACA GACAACTTTC AAATCTTAAT
AACCTTCACT TTACTTATCG TTTTGCCGTT TTCCCAGAGA CAATAGGTGC TATTTTTTAT
ATAAACTATC TTCAAAAAAA TGATGAATTA CAAAATATTC TTTTTAGCTC AGTTTTAACA
TGCCTAGGAG GTAAATTAAA AAATTATAGT TTCAAAGAAT CCCCAGTAAA ATCATCTTAT
TCCGAAGCGT ATAAAAACGA ATTAAAAAAA GAAATCCCTA ATATAAAAAT AATGCCATTT
ACTCCTGATG GCAGTGATGA GAGACAATTC TCTTCCCCAA ATGTTGGAAT TGCGAGTTCA
AGCCTATGCA GAAACAGATA CTACGAATAT GAAGAATATC ATACATCTCT TGATACACTT
GAATATATGG ATATTTGTGC AGTAAACGAA AGCACAAATT TTATATTTAA TGCAATTAAA
AGTCTTGATA AAAATCTCAG GATTCCAAAG TCTCATGCCA GATTTGGAGA ACCTTGCTTA
AGTGCTTATG ATTTGTTTTT ACATGATGGG GGCTCATATA CATCGAAAAA AACCAATTCT
AATATAAATC AAAAAAAAAT ATTATTTACA CTTTTATCAA TTATTGATGG TAAACTCTCT
TTCGAAGAGA TAGTTAAACT TACTATGAGT AAAATAGATT CCGAGAGATC AGATGTTGAG
AAAGTTCTTC AAAAAGTAAT AGATTTAAAT ATAGTTTATG ATTAA
 
Protein sequence
MKDIYNDLSF LFNNNRGIVS DLNNDLNKRL CELIPFKKIK YKSGEKIDNW KVPLSWELIR 
VEVKINSLSI NQKDIPLIVP FGTASFRVSG NYIDLKKFIY TLEDKPLATP YRTNYYSPKN
YKICLPFKYL SCLNDEDQIS INVESRTKPS NLEVLEITLE GNSKHEILFT TYNCHPGLGN
DNFSGLIGLC KLYRQLSNLN NLHFTYRFAV FPETIGAIFY INYLQKNDEL QNILFSSVLT
CLGGKLKNYS FKESPVKSSY SEAYKNELKK EIPNIKIMPF TPDGSDERQF SSPNVGIASS
SLCRNRYYEY EEYHTSLDTL EYMDICAVNE STNFIFNAIK SLDKNLRIPK SHARFGEPCL
SAYDLFLHDG GSYTSKKTNS NINQKKILFT LLSIIDGKLS FEEIVKLTMS KIDSERSDVE
KVLQKVIDLN IVYD