Gene A9601_14311 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagA9601_14311 
Symbol 
ID4718152 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. AS9601 
KingdomBacteria 
Replicon accessionNC_008816 
Strand
Start bp1203163 
End bp1206069 
Gene Length2907 bp 
Protein Length968 aa 
Translation table11 
GC content25% 
IMG OID640079152 
Producthypothetical protein 
Protein accessionYP_001009821 
Protein GI123968963 
COG category[V] Defense mechanisms 
COG ID[COG2274] ABC-type bacteriocin/lantibiotic exporters, contain an N-terminal double-glycine peptidase domain 
TIGRFAM ID[TIGR01846] type I secretion system ABC transporter, HlyB family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATATGG ATAAAGATAG TTACTTTAAA AATTTAGATC TTGATAAGGA GATAATTAAA 
ATAATTGAAA AAGATATTTT TTTAGAGAAT TATTCAGTAG GTGAAGAAAT ATTCAATCCT
GAAATCACAA TTAATAAAGT TTCAATTATT TTATCTGGAA GTATTAGACA AATAAAAAGA
GATTCTACAA ATAATACCAA TATTTATAAA TATGTTAAAA ATGATTTTTT ATTTATTCCT
GAATTGATTT ATAAACTTAA AAATTCTTTT TACTATATAG CAGCAAATGA CTTGCAGTTA
ATATCTATTG AAAAGGAAAA ATTTTTAAAT TTATTAAAGG AAAATAATGA ATTTCGTAAA
TGGATAAATA ATCAAATCTT CAAAAATGAG AAAATTTCAA TTTTAAATAA ATTATTAAAG
GAGGAATTTA ATAACAATTT TGATAAAGAA CAACTTCTTA ATAATCTTTC AGAAAATATA
GACCTTGTTA ATGAGAAAAT TTTGAAAAAT ATTCAGGATA AAAAGATTGA TTCAAAAAAT
TTTGAAATTA TTTCAATTTC TAAATCAATT CACTTTGATT ACCTTGAAAA AATAAATTTC
GAGAAGATAT TAGGTTTAGA TTTTTCAGAA CTAGAAAGAT TAGTAATTAT TAATAATAAA
TTCAAGAAAT TTAAAATTCA TAAAAATAAA ATTCCCAAAG TAGATAAAAT TTCTCAAATG
GTTGAAGAAT CTATAGATGA ATCAAAAAAA GAATTACATT ATGCAGATAT AAATATTAAA
AAAAATGTTT GCAATAGGAA AGATAATGTT ATTGAATGTT TTAGGATTTT AAGTAAGTTA
ATTGATATTA ATTATCGAGT TGATCCAATA AGCAATTATT TAGATTTTCT TGATAAGAAT
AAAAAGAAAT ATACTTTTAG GAATTACGCA GAAATTGCTT ATGGATTAGG TTTAGAGGTA
TCTTGTGGAG AACTTAGCAT ATCGCAAGTA CTTAAGGTTA AAACGCCTTC ATTAATAATT
TATAAAAATG ATTTAGCTTT AGTAGTTAAT GCAGACAGAG AAAAACTGAC TCTAATTTAC
CCTGCAGATG GATTGATTAC TTTGTATAAA AATGATTTAG AGAAAATATA TGAGGGAAAT
ATTAACATCA TAAATATTTC AAAAAATCGT TTAACTCAAG AAAACAAATT CTCAATAAGT
TGGTTTATTC CAATTTTAAA AGAATATAAA AATACTTTAT TCCAAATTTT AATTTCAGGA
TTAGTTGTAC AAATATTTAT ATTGTCGAAT CCATTATTAA TTCAAGTAAT AATTGATAAG
GTTATATCGC AAAGAAGTCT AGATACACTA CAAGTATTAG GATTCGCACT ATTAGTAATT
ACAGTTATTG AAGCAGTATT ATCAAGTATA AAATCTTTTA TCCTTTCAGA AACTACTAAT
AGAATAGATC AAAAATTAGG AATTAAAATA ATTGATCATT TATTTAGATT ACCTCTTGAA
TATTATGACA AAAGATCTAT AGGAGAATTA TCTAATAGGG TAGGTGAACT TGAGAAAATT
AGGAACTTTT TGACTAGTCA AGGTATTAAT ACTTTTTTAG ATGCGTCATT TTCTTTATTT
TATATTTTCG TACTATTTTT ATATAGCGGT AAGCTTACAT TAATAGCTTT AAGTGTTATC
CCAATTCAGA TTTTAATTAC ATACTATGGA TCGCCACTTT TTAAAAAACA ATATCGGAAA
GCAGCTATTA ATAATGCAAA TACTCAAAGT TATTTAGTAG AAGTGCTATC TGGTATCCAA
ACGGTAAAAA CACAAAATGC AGAAACCTCA AGTCGCTGGA GATGGCAGAA TTACTATTCA
AAATTTATTA AGAGTACATA CCAAAAAACG ATTACAGCTG TTTCATTAAA TCAACTTACT
CAATCTCTGC AAAAAATTTC TCAATTAATA GTTTTATGGT ATGGAGCAAT AATGGTTTTA
AACGGTGAAT TTACTCTTGG TCAACTAATT GCATTTAGAA TCATTTCTGG ATATGTAACA
CAACCAATTT TAAGGTTGAG CACTATATGG CAACAGTACC AGGAAATAAA AATTAGTTTT
GAAAGATTGG GAGATATTGT TAATACTCCA AAAGAAAATG AATCAAAAGA TTTAGGAAAA
ATTCAACTGC CAAGTGTTGA GGGGAATATT TTATTTGATA ATGTATCATT TAAATTTATT
GGCGACTCCA AAACAACTCT GAATAAAATC AACTGTCAAA TTGATAAAAA TTCTTTTGTT
GGAATTGTTG GTAAAAGTGG AAGTGGTAAA AGTACATTTT GTAAATTAAT TTCTAGGCTT
TATGTACCTA ATGAGGGGTC TATTTTAATT GATAAATACG ATATCCAAAA GGTAGAAATA
AGTTCAATTA GAAGGCAATT AGGGATAGTT AGTCAAGACC CTTTACTTTT CGCTGGAACA
ATAAGAGATA ATATATGTTT TGGTGATGAA AGTTTTTCTG ATAAGGAGAT TGTAGAAGCA
TCAAAAATAT GTTGCGCCCA TGAATTTATT ATGGAACTTC CATTGGGATA CAATACAAAA
ATTTCTGAGA AAGGAAGTTC ATTAAGTGGG GGACAACGTC AGAGAATTGC ATTAGTAAGA
GCATTATTAA AAAAACCAAA AATAATTATC TTAGATGAAG CAACAAGTGC TTTAGATATA
GAAACTGAAC AACTATTTGT TAAAAATCTA TTAAATAAAT TTAAAAATTC AACAATAATA
ATTATTACGC ATAGATTATC TAACGTTATA AATGCAGATA AAATTCTTGT TTTTGAAAAA
GGTGACTTAT CTGAACAAGG AGATCATGAA TCACTTCTTA AAAACAAATC AGTATATTAT
TCACTCTTAA ATAATGAGGA AAAATAA
 
Protein sequence
MNMDKDSYFK NLDLDKEIIK IIEKDIFLEN YSVGEEIFNP EITINKVSII LSGSIRQIKR 
DSTNNTNIYK YVKNDFLFIP ELIYKLKNSF YYIAANDLQL ISIEKEKFLN LLKENNEFRK
WINNQIFKNE KISILNKLLK EEFNNNFDKE QLLNNLSENI DLVNEKILKN IQDKKIDSKN
FEIISISKSI HFDYLEKINF EKILGLDFSE LERLVIINNK FKKFKIHKNK IPKVDKISQM
VEESIDESKK ELHYADINIK KNVCNRKDNV IECFRILSKL IDINYRVDPI SNYLDFLDKN
KKKYTFRNYA EIAYGLGLEV SCGELSISQV LKVKTPSLII YKNDLALVVN ADREKLTLIY
PADGLITLYK NDLEKIYEGN INIINISKNR LTQENKFSIS WFIPILKEYK NTLFQILISG
LVVQIFILSN PLLIQVIIDK VISQRSLDTL QVLGFALLVI TVIEAVLSSI KSFILSETTN
RIDQKLGIKI IDHLFRLPLE YYDKRSIGEL SNRVGELEKI RNFLTSQGIN TFLDASFSLF
YIFVLFLYSG KLTLIALSVI PIQILITYYG SPLFKKQYRK AAINNANTQS YLVEVLSGIQ
TVKTQNAETS SRWRWQNYYS KFIKSTYQKT ITAVSLNQLT QSLQKISQLI VLWYGAIMVL
NGEFTLGQLI AFRIISGYVT QPILRLSTIW QQYQEIKISF ERLGDIVNTP KENESKDLGK
IQLPSVEGNI LFDNVSFKFI GDSKTTLNKI NCQIDKNSFV GIVGKSGSGK STFCKLISRL
YVPNEGSILI DKYDIQKVEI SSIRRQLGIV SQDPLLFAGT IRDNICFGDE SFSDKEIVEA
SKICCAHEFI MELPLGYNTK ISEKGSSLSG GQRQRIALVR ALLKKPKIII LDEATSALDI
ETEQLFVKNL LNKFKNSTII IITHRLSNVI NADKILVFEK GDLSEQGDHE SLLKNKSVYY
SLLNNEEK