Gene A9601_14091 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagA9601_14091 
Symbol 
ID4718130 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. AS9601 
KingdomBacteria 
Replicon accessionNC_008816 
Strand
Start bp1180203 
End bp1182002 
Gene Length1800 bp 
Protein Length599 aa 
Translation table11 
GC content27% 
IMG OID640079130 
Producthypothetical protein 
Protein accessionYP_001009800 
Protein GI123968942 
COG category[V] Defense mechanisms 
COG ID[COG2274] ABC-type bacteriocin/lantibiotic exporters, contain an N-terminal double-glycine peptidase domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACTACCA GAAACGTAGA AAATATTGAT ATTAATTTGC CGAATTTAAT TTATGCGCTT 
TGGAAAAAAT TAAAAGAACA AAGAAAAATG CAAATTATTT TTCTTTTTTG TTTTGTATTG
GCTAGCGCAT TTTCCGAAGT TTTTTCATTA GGTTCAGTAT TGCCATTTTT ATATGTATTA
ATAAACCCAA TAGGACTTTG GAATTTAACT TTCTTTAGAA ATATTTTTAT ATTTTTGGGT
ATTAATAATC CTAATTACTT ATTACTCCCA ATGACAGTAA TTTTTTGTCT TTGTATAGTT
TTTGCAGCTT TTTTTAGATT AGTCACTATT TGGCTAAACT GCAGATTGTC TGCTGCAATA
GGTTCAGATT TGAGTTGTGA GGTTTTTACA AGAACTATTT TTCAACCATA TAAATACCAT
TTAGAAAGAA ATAGTAGTGA ATTAATTGCG GCGATTAACA TTCATATTCC TCAATCTATT
TATTCAATAA ATTTATTTTT TAAATTAATA AGTAACGCAA TTATTGCTTC AAGTATCATA
ATTGCATTGT TAATTATCAA TCTGAAGATT GCCTTATCAT TAATAATTGT TTTTGGATTT
GCTTATCTAT TGATTTCTAT TTTTATAAAA AATAAACTTG CTGCAAATAG TTTGTTTGCA
GTAAATGCAA CTCAAAATCA ATTATCAATA ATACAAGAAA GTTTAGGTGG TATAAGAGAC
TTGATAATTG ATCAAAATTT TAATTATTAC ATTAAAAAAT TTATTAAATA TGATAAACCA
TTAAGAATAA GAGATTTACA AAATGAGTTT TTGGGTTCTT TCCCTAAATA TGCCTTAGAA
GCTCTTGGTA TGATTCTTAT AGCTGTTTTA GGTTTCTTAA TTAAATCTTT ATCTCCTAGC
ACAGTTAATG CAATACCACT TCTAGGAACT ATTGCTCTTG GGGCACAAAG ATTACTACCT
TCTCTACAGC AAGTATTTAC AAGTTGGTCT GCAATAAAAG CTAAGCAAGA AAATTTAAAA
AAAATACTTG ATATACTTGA TCGCCAAAAC TATAACAAAA ATTTTAATTT AGGATATTCT
GAAATTTCCT TTAACAAGGA ATTAAGACTT TCATCAATAA GTTTTAAACA TTTAAATCAA
AAAAAATCAA TATTTGAAAA TATCCACTTA ACTATTAATA AAGGCGAATG TCTAGGAATT
ATTGGTACAA CAGGTAGTGG TAAAAGTACT TTTATAGATA TAGTTATGGG TTTGCTAATA
GGCTCACAGG GGTACTTGAA AATTGATGGT CTAAATTTAT ATTCAGGGAA AGATAGTTCA
AGAAAAATAA GGGCTTGGAT GTCAAAAATT GCGCATGTTC CTCAAAGTAT TTTTCTTTCT
GATAGCACTA TTGCTGAAAA TATAGCTTTT GGTATTGAAT TAAATAATAT TGATTACAGA
AAATTAAAAA ATGCTATTGA AGCAGCTCAA CTTAATGATT TTATTGAAAG TCTGCCGAAT
AAATACAACA CTTTTGTGGG AGAAAGAGGG GTTAAATTAA GTGGAGGTCA AAGGCAAAGA
ATTGGTATTG CTAGAGCATT TTATAAGAAT CCACAAATTT TAATTTTAGA TGAAGCAACA
AGTGCATTAG ATATTAGAAC AGAGAGGAAA ATAATGGAAA AAGTAAATTG CCTAAGTAAA
GACTTAACTA TCATTATTAT TGCTCATCGT CACTCAACTT TAAAAAACTG TGATAGGGTT
ATTGAGATTA ATGGAGGTAA GATAATCAAA GAAGGTTTAC CTAAAGATGT GTTATATTAA
 
Protein sequence
MTTRNVENID INLPNLIYAL WKKLKEQRKM QIIFLFCFVL ASAFSEVFSL GSVLPFLYVL 
INPIGLWNLT FFRNIFIFLG INNPNYLLLP MTVIFCLCIV FAAFFRLVTI WLNCRLSAAI
GSDLSCEVFT RTIFQPYKYH LERNSSELIA AINIHIPQSI YSINLFFKLI SNAIIASSII
IALLIINLKI ALSLIIVFGF AYLLISIFIK NKLAANSLFA VNATQNQLSI IQESLGGIRD
LIIDQNFNYY IKKFIKYDKP LRIRDLQNEF LGSFPKYALE ALGMILIAVL GFLIKSLSPS
TVNAIPLLGT IALGAQRLLP SLQQVFTSWS AIKAKQENLK KILDILDRQN YNKNFNLGYS
EISFNKELRL SSISFKHLNQ KKSIFENIHL TINKGECLGI IGTTGSGKST FIDIVMGLLI
GSQGYLKIDG LNLYSGKDSS RKIRAWMSKI AHVPQSIFLS DSTIAENIAF GIELNNIDYR
KLKNAIEAAQ LNDFIESLPN KYNTFVGERG VKLSGGQRQR IGIARAFYKN PQILILDEAT
SALDIRTERK IMEKVNCLSK DLTIIIIAHR HSTLKNCDRV IEINGGKIIK EGLPKDVLY