Gene A9601_01061 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagA9601_01061 
Symbol 
ID4716789 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. AS9601 
KingdomBacteria 
Replicon accessionNC_008816 
Strand
Start bp106295 
End bp107368 
Gene Length1074 bp 
Protein Length357 aa 
Translation table11 
GC content32% 
IMG OID640077804 
Productserine protease 
Protein accessionYP_001008501 
Protein GI123967643 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.057763 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGCATTA GACATCCTGC AAAAGTAATA TCCCAAACAT TAAGCAAGCC AAAAATTAAT 
GAAGTTAATT TATATTCGAA CAAATCTTTT ATAACTAAAG CTGTAGAAAG AACCGGTGCA
GCTGTGGTGA CAATTGATAC TCAAAGATAT GTTAAAAAAA GAAATTTTCC AAGAAATTCT
CAACTATTTT TAGACCCATA TTTTGAAAGA TTTTTTGGAT TAGATTTAAA TAACGAAAAT
CGACCAAGGA TAGAGCAAAA CCAAGGCAGT GGATTTATAT TTGCAGATGG ACTTGTAATG
ACCAATGCTC ATGTAGTGAA TGGATCAGAT AAGGTAATTG TTGGTTTAAC CAATGGCAAA
AAATTAAACG CTAAACTGAT AGGTCAAGAC TCTTTTACTG ATTTAGCTGT GCTAAAGATT
GAAGGGAAAG GGCCTTGGCC AAAAGCAAAA TTGGGCGATT CTGCAAAGAT TAAAGTTGGT
GATTGGGCTA TAGCAGTTGG AAATCCATTT GGATTGGAAA ACACAGTTAC GCTTGGTATT
ATTAGTAATC TAAATAGAAA CGTAAATCAA TTAGGAATAT ATGATAAAAA ACTTGAACTG
ATACAAACTG ACGCTGCTAT TAATCCTGGC AATTCTGGAG GTCCACTGTT GAATAGCGAT
GGAGAAGTAA TTGGTATTAA TACCTTGATA AGATCAGGTC CAGGAGCCGG TTTGAGTTTC
GCAATCCCAA TTAATAAAGC TAAAGAAATT GCCTATCAAC TTTTAAACAA TGGGAAAGTA
ATACATCCTA TGATTGGAAT TAGCCTAATA GAAGAAAGTG TTTCTGAGAG AAAAAATAAT
GTCGTAAAAG TTGGATATGT AGTACCGAAC AGTCCAGCTG AAAAAAGTGG AATCAAGATA
GATGATATTT TAATTAAAAT AGACAATAAA GATATTGAAA CCGCATCAGA CGTAATAGAA
CAAATTAGTA AAAATGGTAT CAAAAAACAA GTAAATATAT TATTAAAGCG TAAAAATAAA
TTTATTAAAT TAAAAGTAAT ACCAACTGAT ATTACTAATC TACAAAATAA ATAA
 
Protein sequence
MGIRHPAKVI SQTLSKPKIN EVNLYSNKSF ITKAVERTGA AVVTIDTQRY VKKRNFPRNS 
QLFLDPYFER FFGLDLNNEN RPRIEQNQGS GFIFADGLVM TNAHVVNGSD KVIVGLTNGK
KLNAKLIGQD SFTDLAVLKI EGKGPWPKAK LGDSAKIKVG DWAIAVGNPF GLENTVTLGI
ISNLNRNVNQ LGIYDKKLEL IQTDAAINPG NSGGPLLNSD GEVIGINTLI RSGPGAGLSF
AIPINKAKEI AYQLLNNGKV IHPMIGISLI EESVSERKNN VVKVGYVVPN SPAEKSGIKI
DDILIKIDNK DIETASDVIE QISKNGIKKQ VNILLKRKNK FIKLKVIPTD ITNLQNK