Gene A9601_13891 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagA9601_13891 
Symbol 
ID4718110 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. AS9601 
KingdomBacteria 
Replicon accessionNC_008816 
Strand
Start bp1156152 
End bp1157402 
Gene Length1251 bp 
Protein Length416 aa 
Translation table11 
GC content28% 
IMG OID640079110 
Producthypothetical protein 
Protein accessionYP_001009780 
Protein GI123968922 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAATAG AAAAGCCTCT TTTTAAATTT TTTTATATTG GCATTTTTTT ATTACCTTCA 
GCTCCTAGCA TTGGATCTAT TTTTCTTTTT TTATGTCTCA TTTGCTCATT AATAAATAAC
TTTTTAGAAT TAATCAAAGA TAGATGGAAT ATTACTTTTT TTATATCTAT TTTTTTGTTT
CCAATAATTT GCTTAATACA GAGTAGTAGA TTTTTCTACA AATTTAATAA TTTTGATAAG
TCACTTACAT GGATTGGTTT AAATAATTGG ATTCCTTTAA TTTTATGTTT CATCGCCTTT
CAAAAATTTG TTAATAGCAA ATCAGATAGG GAAATTATTG GAAAGCTTTT AATAGCTGGT
AGTTTTCCCC TCATAATTTC AGGAATTGGT CAATATTGGT TTAATTGGTA TGGACCATTC
GAATTTTTAA ATGGATTTAT TATTTGGTTT CAAAGACCTA TGCAAACCGA AACTGGATTA
ACTAGTTTAT TTAGCAACCA AAATTATGCG GGATCTTGGT TTTGTATAGT TTGGCCATTT
TGTCTATCCT TTTTTATTCA ATCATTCAGA AATAATTTAC ATAGATTTAT ATCACTAGGA
TTTTTGATTT CCATTTCAAC ATGTCTGATA TTAACAACTT CCAGAAATGC ATGGGGAGGA
TTATTGTTAT TGATTACTTT ATTAAGAGGA GCCTCCCTTT TTTGGCCAAT ATTTATAGGT
ATAACTATTA CAATAATTAG TGTTTTTCTA CTAAATATTC TAATCCCACT AGATATACAA
ACGACTATAA GTAACCTATT TCCTTCTTGG ATTAATCAAG AATTCACTTC AACTCATTTT
CAATTTAGAG AGTCAAGGCC TGAAATATGG TGGGAAGCCA TAAAACTAAT ATTCAAAAAT
CCCCTATTAG GTTTGGGAGC TGGTGCATTT CCTATTATCT ATCAATCCTT AAAGAATGCT
TATGCAGGAC ATACACATAA TTTAGTATTC GAATTAGCTT TAAGTTATGG TATCCCAATC
ACATTAATAG TTTTTGTACC AATATTTCTA ATTTGTTTTT TTTCCTTTAA AGAAATTTAT
ATCAAGAAAA CAAATAATAT TGATATAAAT GAAAGAGCAT GGTTCGCTTC ATTTTTTACA
CTATTATGCA CTCAACAAGT TGATGTACAG TACTTTGATC TAAGAATAAG TATAATTTTC
TGGGTTTTAC TAGCAGGGCT TAAAACACGT ATAAGCCCCC AAATAATTTA A
 
Protein sequence
MKIEKPLFKF FYIGIFLLPS APSIGSIFLF LCLICSLINN FLELIKDRWN ITFFISIFLF 
PIICLIQSSR FFYKFNNFDK SLTWIGLNNW IPLILCFIAF QKFVNSKSDR EIIGKLLIAG
SFPLIISGIG QYWFNWYGPF EFLNGFIIWF QRPMQTETGL TSLFSNQNYA GSWFCIVWPF
CLSFFIQSFR NNLHRFISLG FLISISTCLI LTTSRNAWGG LLLLITLLRG ASLFWPIFIG
ITITIISVFL LNILIPLDIQ TTISNLFPSW INQEFTSTHF QFRESRPEIW WEAIKLIFKN
PLLGLGAGAF PIIYQSLKNA YAGHTHNLVF ELALSYGIPI TLIVFVPIFL ICFFSFKEIY
IKKTNNIDIN ERAWFASFFT LLCTQQVDVQ YFDLRISIIF WVLLAGLKTR ISPQII