Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mmar10_0473 |
Symbol | |
ID | 4284162 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Maricaulis maris MCS10 |
Kingdom | Bacteria |
Replicon accession | NC_008347 |
Strand | - |
Start bp | 551674 |
End bp | 553761 |
Gene Length | 2088 bp |
Protein Length | 695 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 638139937 |
Product | endothelin-converting protein 1 |
Protein accession | YP_755704 |
Protein GI | 114569024 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG3590] Predicted metalloendopeptidase |
TIGRFAM ID | [TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.190105 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 44 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAAAGC TTTTGCTGAC CAGCGCCTGC GTTGCGCTGA TGGCGGCCTG CTCGCCTGCC ACCGAAACGC CGGAAACAGA CACTGGCACA ACCGAAGCGG CGATGCCCGA GACAAGCCAG CCCGCCGGTT CGCCGGACCT GGGAAGCTGG GGCGTCGAGA TCGAACACGT CTCCGACAGC GTCGCTCCGG GCGATGACTT CAACCGCTAC GTCAATGAAG GCTGGCTCGA CTCGACCGAA TTGCCCCAGG GCTTCTCATC CTTTGGCGGT TTCACCGAGC TCTACCTGCG CTCGGAAGAG CGGGTGGAAG GCATCATCCA GGAAGCCGTT GCCGCAAATG CCGAAGCCGG CACCCTGCAG CAACAGGTCG GCGACCTCTA CGCCTCCTAC ATGGATACCG ACACGCTGGC GGCCCGCGGT CTCGAGCCGG CCCGGCCGAT GCTCAATTCC ATTGCCGAAG CGCAGTCGCA TGACGACATC GCGACCCTGA TGGGCAGCCC CGGTACCGCG TCGATCTTCG GCGCCGGCGT CGGTCGTGAC CCGGGCAATC CCCAGCGCTA CATCGTCTCG GTCAGCCAGA GCGGTCTGGG CCTGCCGGAC AAGTCCTATT ATGAGCGCGA CGACGAGACC TTCGTCGGTT ATCGCGACGC CTATGTCGCC TACATGACCG ACGTCTTCGA GATGATCGGC ATGGACAATG CCGCCGAGCG CGCCCAGGGC GTGCTGGACC TGGAAACGCG CATCGCACAG ATCCACTGGA CCCGTGCGGA AAGCCGCGAC CGTGTGCGGA CCTATAATCT GATGAGCACG GATGACCTGG TCGCCTCGGC CCCGGGCTTC CCCTGGGGTG CCTTCATGAC CGCCCTGGAA TACGAGAATG AGACCGAAGT CGTGGTCCGC CAGGACACCG CCATCCAGTC GCTTGCCACA CTGTTCACCG AAATCCCGGT CGAAGCCTGG CAGGACTATC TCAGCTTCCG CTACATGTCG TCCAACCAGA ACCTGCTGAC GCCGGAATTC TACGAGCGCT CCTTCGACTT CTACAGCCGC ACCCTGCGCG GCACGGAAGA ACCCCGGGCC CGCGATCGTC GCGGCATCCA GTATGTGAAC GGCAATCTGG GTCAGGCCAT CGGCCAGATC TATGTCGAGC AATACTTCCC GCCCGAGCAC AAGGCGCAGA TGGAAGAGCT GGTCGAGTAT CTCCGTCGGG CCTTGCGCGA GCGCATCGAG ACGCTTGAGT GGATGGATGA CGAAACCCGC GTCCAGGCCT TCGACAAGCT GGAGAAATTC CTGCCGAAAA TCGGCTATCC CGACATCTGG CCGGATTATT CGGCGATCGA GATCCGGTCC GATGACCTGT TCGGCAACTC CCAGCGCGTC GCCGAATGGT TCCGCGCCGA CAGCCGTTCG CGCCTCGGCA GCCCGATCCG CGAATGGGAA TGGTTCATGT CACCGCAAAC GGTGAATGCC TACTACTCCT CCACCGCCAA CGAGATCGTC TTCCCGGCGG CCATCCTGCA GGGTCCCTTC TTTGACCCTT ATGCGGATGC AGCCGTGAAC TTCGGCGGTA TCGGTGCCGT CATCGGCCAC GAAATGGGCC ACGGCTTCGA CGATCAGGGC AGCCAGTCCG ATGGCGACGG TGTCCTGCGC AATTGGTGGA CCGACACCAG CCGCGAAAAC TTCGACGGCC TGACCAACCA GATCGTCGCC CAGTATGACG GCTTCTCCCC GGTCGAAGGC CAGAGTGTCG ATGGTCGCCT GACGCTCGGC GAGAATATCG GTGATATCGG CGGTCTTTCC ATGGCCCACC GCGCCTACCA GATGTATCTG GCCGATAATG GTGGCGAAGC TGAAGTACTC GATGGCTTCA CCGGCGATCA GCGCTTCTTC ATGGCCTGGG CTCAGGTCTG GCGGAATGTG CGGACCGAAG ACAGCCTGCG CGCACAGCTC CTGTCCGACC CGCACAGCCC GGCCCAGTAC CGGATCAACG GCGTCGTCCG CAATAATGAC GCCTGGTACG AGGCCTTCGG CGTGACCGAG GACCATGAAC TCTACCTGGC ACCGGAAGAT CGCGTATCGA TCTGGTAA
|
Protein sequence | MKKLLLTSAC VALMAACSPA TETPETDTGT TEAAMPETSQ PAGSPDLGSW GVEIEHVSDS VAPGDDFNRY VNEGWLDSTE LPQGFSSFGG FTELYLRSEE RVEGIIQEAV AANAEAGTLQ QQVGDLYASY MDTDTLAARG LEPARPMLNS IAEAQSHDDI ATLMGSPGTA SIFGAGVGRD PGNPQRYIVS VSQSGLGLPD KSYYERDDET FVGYRDAYVA YMTDVFEMIG MDNAAERAQG VLDLETRIAQ IHWTRAESRD RVRTYNLMST DDLVASAPGF PWGAFMTALE YENETEVVVR QDTAIQSLAT LFTEIPVEAW QDYLSFRYMS SNQNLLTPEF YERSFDFYSR TLRGTEEPRA RDRRGIQYVN GNLGQAIGQI YVEQYFPPEH KAQMEELVEY LRRALRERIE TLEWMDDETR VQAFDKLEKF LPKIGYPDIW PDYSAIEIRS DDLFGNSQRV AEWFRADSRS RLGSPIREWE WFMSPQTVNA YYSSTANEIV FPAAILQGPF FDPYADAAVN FGGIGAVIGH EMGHGFDDQG SQSDGDGVLR NWWTDTSREN FDGLTNQIVA QYDGFSPVEG QSVDGRLTLG ENIGDIGGLS MAHRAYQMYL ADNGGEAEVL DGFTGDQRFF MAWAQVWRNV RTEDSLRAQL LSDPHSPAQY RINGVVRNND AWYEAFGVTE DHELYLAPED RVSIW
|
| |