Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mmar10_0256 |
Symbol | |
ID | 4284120 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Maricaulis maris MCS10 |
Kingdom | Bacteria |
Replicon accession | NC_008347 |
Strand | + |
Start bp | 295234 |
End bp | 297402 |
Gene Length | 2169 bp |
Protein Length | 722 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 638139719 |
Product | heparinase II/III family protein |
Protein accession | YP_755487 |
Protein GI | 114568807 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.627284 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 72 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATCCGGA TTGTCGGACT GGTCGCTGCC GCCTTGCTGG CAACGACCTC CTGCGCGTCG GCGCAGCCAA ACCTGATCCT CACGGGCGAG GGCGTTGAAG CCTTCCGTGA AGCGGGGGCG CTGCCGCCAC TGATGCAACG GGCCCTGGAC GCCGCAACGC GACGGGTGGA GGCCAGCATC CAGGCCGGCC CGATTGTCCC GGAGCCTGTC GATCCCGGTG GCGGTTATTC GCACGAGCGC CACAAGGAAA ACTACAAGAT CATCCACGAT GCCGGCTTGC TCTTTCAGCT TACCGGAGAG CGGCGCTATC TCGAGCATGC CGAGACCTAC CTGCTGGCCT ATGCGGACAT GTATGGTGAC CTGCCATTGC ACCCTGAGCG TCGCAACCAG GCGCCGGGCC GGTTGTTCTG GCAAAGCCTG AACGAAGCTG TCTGGCTGGT CTATTCGATC CAGGGCTATG ACGCGATCCG AGCTGAACTC TCTGATGCGT CGCGCGATCG CATCGAGGCG GCCCTGTTCC GTCCCATGGC AGAATTCCTG TCGACAGAGT CGCCGCAGAC TTTCCAACGC ATCCACAATC ACGGCACCTG GGCCGCTGCA GCCGTCGGCA TGACCGGCTA TGCGCTGGAC GATCCTGCGC TTGTAGAGCG TGCGCTGTTT GGGCTCGAGC TCGACGGTGA GGCAGGCTTT TTGGCCCAGC TCGACCAGCT TTTTTCACCG GACGGGTACT ACACTGAGGG CCCCTATTAT CAACGCTACG CACTCATGCC GTTCGTGTTG TTCGGACAGG CGGTGCAGAA TAACGAGCCC GAGCGGGGCA TATTCGAGCA TCGCGACGGC ATTTTGCTCG AAGCCATCCT TTCGACCATC CACCAGAGCT ATGCGGGCCG TTTCTTTCCG ATCAATGATG CGATCCGGGA AAAGGGTCTC GACACGGTTG AGGTCGTCTA CGGGGTCGCG GCCGCCTACT CCCTGACGGG AGATACCGGT CTGCTGTCGA TTGCCGATCA GCAGGGCGCG ACGGTTCTGA CCGGCGACGG GCTGGCCGTG GCGACCGACC TTGATGCCGG AATGGCCTCT CCCTTTGCCT TTGACACACG ACTGCTTCGC GATGGCCCTG ACGGAAATCA GGGCGGCCTT GCCATCCTGC GCATGGGCGC GGGAGAGCTC GCGCAGACGC TTGTGGCCAA GCACACCGGT CAGGGCATGG GGCACGGGCA TTTCGACAAG CTGTCGCTCA TTCTCTACGA CGGCGGACAG GAGATCCTGA CCGATTATGG CGCCGCCCGC TTCTTGAATG TCCCCTCCAA GGATGGTGGA CGTTATCTGC CGGAGAACAG CAGCTGGGCC CGCCAGACCA TTGCCCACAA TACCGTGGTC CTGAACGAGA CCAGCCATCA TGGCGGTGAC TGGCGTCGCG CTCAGGAGAG CTGGCCGAGC ATTGCCCTGT TCGAGCAGCG CGATGGCGTC AATGTGGTTT CGGCGAGTAT CTCAGATGCC TATCCGGATG CGACGCTGAC CCGAACCACT TTCCAGCTGC AGAGTGAAAA CACCGCCAGT CCGCTCATCC TGGACGTCTT TGACGTTGTT GCCGACGAGC CGTCGCAAAT TGACCTGCCC ACCTATTTCG CTGGCCAGCT GACGGATTTC GATATCGCGT TTGAACGCTT CGGAAGTCAG CAAACGGCGC TGGGGGCGGG CAATGGCTAT CAGCATTTGT GGGTGGAAGC CCAAAGCGAG ATTGCGGCCG ATCGGGCACG CTTCACCTGG CTGACGGAAA ACCGGTTCTA CACCTATCAT GCCCTGGTCA GCGAGCCGTT CGAGGCTCGA ATTGTCAGAA CCGGCGCGAA CGATCCCGAT TTCAATCTCC GGACCCAACA GGGCATCCTG TTGCGTGTTC CCAGCGCGGA ATCAGCCCGT TTCATTTCGG TGTTCGAACC GCATGGTCGA TACGACCCGG CCCAGGAAAT CACGGTCGCG AGCGAGAGCG GCATTGACGA GATCCGTTCC GTTGAAGGCG AGACCGCCAC GCTCATCCAG ATCGTTTTCA ACTCAGGTCA ACGCCTCAGC GTTGGCCTGG CTCATGATCT CCATGCCGGA GCTCACACGA TAAACGCCGA CGGCCATCGT TACGAATGGA CCGGGGCTTA CAACATATTT GGGGAGTGA
|
Protein sequence | MIRIVGLVAA ALLATTSCAS AQPNLILTGE GVEAFREAGA LPPLMQRALD AATRRVEASI QAGPIVPEPV DPGGGYSHER HKENYKIIHD AGLLFQLTGE RRYLEHAETY LLAYADMYGD LPLHPERRNQ APGRLFWQSL NEAVWLVYSI QGYDAIRAEL SDASRDRIEA ALFRPMAEFL STESPQTFQR IHNHGTWAAA AVGMTGYALD DPALVERALF GLELDGEAGF LAQLDQLFSP DGYYTEGPYY QRYALMPFVL FGQAVQNNEP ERGIFEHRDG ILLEAILSTI HQSYAGRFFP INDAIREKGL DTVEVVYGVA AAYSLTGDTG LLSIADQQGA TVLTGDGLAV ATDLDAGMAS PFAFDTRLLR DGPDGNQGGL AILRMGAGEL AQTLVAKHTG QGMGHGHFDK LSLILYDGGQ EILTDYGAAR FLNVPSKDGG RYLPENSSWA RQTIAHNTVV LNETSHHGGD WRRAQESWPS IALFEQRDGV NVVSASISDA YPDATLTRTT FQLQSENTAS PLILDVFDVV ADEPSQIDLP TYFAGQLTDF DIAFERFGSQ QTALGAGNGY QHLWVEAQSE IAADRARFTW LTENRFYTYH ALVSEPFEAR IVRTGANDPD FNLRTQQGIL LRVPSAESAR FISVFEPHGR YDPAQEITVA SESGIDEIRS VEGETATLIQ IVFNSGQRLS VGLAHDLHAG AHTINADGHR YEWTGAYNIF GE
|
| |