Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Maqu_2259 |
Symbol | |
ID | 4654899 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Marinobacter aquaeolei VT8 |
Kingdom | Bacteria |
Replicon accession | NC_008740 |
Strand | - |
Start bp | 2532107 |
End bp | 2533585 |
Gene Length | 1479 bp |
Protein Length | 492 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 639812233 |
Product | protease Do |
Protein accession | YP_959524 |
Protein GI | 120555173 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | [TIGR02037] periplasmic serine protease, Do/DeqQ family |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.661037 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCAGAAA AAAATATAGA AGCTGCCCGT ATGCCTTCCC TGGCCAGGCC CGGTGCCATG CTGGGCGTTC TGCTGATGCT GGCTGCCATG GTGTCGGTAT TCTGGAGCCA GGGCGTCGCA GCCAGAGGCC TGCCGGATTT CACTGAACTG GTAGAAGATA ACTCCAGTGC GGTGGTGAAT ATCAGTACCA CGACGGATCC GGGCAGCCAG AGCTCTGGCT TCCATGGGCT CCCTTTTGAT GAAAGGCAAC TGGAGCAGTT GCCGCCTTTC CTTCAGGATT TTTTCCGCGG CCCGCAGTCT CCTTTTGGTG GTTCCCCTCG GCCGCAGCAG CCGCGCAGGA GCATGGGCTC CGGCTTTATT GTGTCAGCCG ATGGTTATGT ACTGACCAAC AATCACGTGG TTGAGGGTGC CGATGAAGTG ATTGTGCGCC TGAATGACCG CCGCGAGTTT TCCGCCACTA TCGTGGGTAC CGATCCCCGC TCGGATATGG CGGTTCTCAA AATCGAGAAT GGTGAAGACC TGCCGGTGGT CAGCGTTGGA CGTTCCCGGG ATCTGAAAGT CGGCGAATGG GTTTTCGCGA TCGGCTCGCC ATTCGGGTTT GACTACACGG TGACGGCGGG CATTGTCAGT GCTCTGGGCC GCTCGCTGCC ATCCGAGAAC TACGTGCCGT TTATCCAGAC CGACGTTGCC ATTAACCCCG GTAACTCCGG TGGCCCGCTG TTCAACCTGG AAGGCGAAGT GGTGGGCATA AACTCCCAGA TTTACACCCG CTCCGGCGGC TTCATGGGGG TGTCGTTCGC CATTCCGATT GACGATGCCA TGAACGTATT CCGCCAGCTC CGTGACAAAG GTACCGTCGC CCGGGGTTGG TTGGGCGTGC TTATTCAGGA AGTGAATCGG GATCTGGCCG AGAGTTTCGG GCTGCGCCGT CCCCGCGGCG CGTTGATTGC CGAAGTGATG CCGGATTCAC CGGCGGAGAA GGGGGGCCTT GAGGCCGGTG ACATTGTGCT GGAATACAAT GGCGAGGATG TTCAGTTGTC CTCTGACCTG CCGCCCATGG TTGGCCGCAC ACCGGTAGGT GAATCGGCGC GCCTGACGGT GTTGCGCGGT GGTGATGAAA TCACCCTCGA CGTGGCAATT GGCAAGTTGC CGGAAGACGG CGATGACGCT GCCCAGCCCT TCACCGGTAG CCGCGACAAC AGTGCGGGCG CACCGCTTGG GTTGTCCGTT GAGCCACTGG CTCCCGAGAC TGCCCGTTCA GTGGGCGTGG AAGGCGGTGT TGTAGTCGCT GGGGTGGATC GTGGCCCGGC TTTTGAGGCT GGTATTCGTG CCAGGGACAT CATTACCGAG ATCAACCGCC AGCAGATTCG CTCGGTGGAG GACTTCCGGT CGGTGGTCCG CGACTTGCCG GAAAACCGGG CGGTTTCGGT CCGGATTGTA AGGCAGGGGA GGGCGATCTA CCTGGTCATG AAGCCCTGA
|
Protein sequence | MPEKNIEAAR MPSLARPGAM LGVLLMLAAM VSVFWSQGVA ARGLPDFTEL VEDNSSAVVN ISTTTDPGSQ SSGFHGLPFD ERQLEQLPPF LQDFFRGPQS PFGGSPRPQQ PRRSMGSGFI VSADGYVLTN NHVVEGADEV IVRLNDRREF SATIVGTDPR SDMAVLKIEN GEDLPVVSVG RSRDLKVGEW VFAIGSPFGF DYTVTAGIVS ALGRSLPSEN YVPFIQTDVA INPGNSGGPL FNLEGEVVGI NSQIYTRSGG FMGVSFAIPI DDAMNVFRQL RDKGTVARGW LGVLIQEVNR DLAESFGLRR PRGALIAEVM PDSPAEKGGL EAGDIVLEYN GEDVQLSSDL PPMVGRTPVG ESARLTVLRG GDEITLDVAI GKLPEDGDDA AQPFTGSRDN SAGAPLGLSV EPLAPETARS VGVEGGVVVA GVDRGPAFEA GIRARDIITE INRQQIRSVE DFRSVVRDLP ENRAVSVRIV RQGRAIYLVM KP
|
| |