Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mmar10_2901 |
Symbol | |
ID | 4286331 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Maricaulis maris MCS10 |
Kingdom | Bacteria |
Replicon accession | NC_008347 |
Strand | + |
Start bp | 3184484 |
End bp | 3186460 |
Gene Length | 1977 bp |
Protein Length | 658 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 638142396 |
Product | hypothetical protein |
Protein accession | YP_758120 |
Protein GI | 114571440 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG1305] Transglutaminase-like enzymes, putative cysteine proteases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 36 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 50 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCCCAAC ATGGCGATCC GGACGTGGCC GATGTCACAA GTCCGGCTTC TGGCGACACT TCACCGGCAC TGTCTGATCC CACCGCCCTT GGCACGCTAC CGGATTCTGC GACTTTCACG CCGACGCCGA ACTTCGTTTC TGAACTGGCT CCACCGGCCC GCGGCACGGA AGTATTCGAG GCCGAGGGTG GGGTTCGCTA CGCGCTGTAT CAGCTCGAGC ACACCAATAT GGGGGAGGAT CCCCGGGAAT ATGTGCGGAT CATCTCGGAA CCCGTCACCT CGGGAGGGGT GGATTCGGTG GCCAACGGGA CCGTGGTTTT CAATCCGGCC TTCCAAACCC TGGAATTCCA CACCTTGCGC GTCGAGCGCG ACGGCAGTTT TGAGGACCGG ACAGACAGCA CGACCATTGA GTTCGCCCGC CGCGAGACCC GGCTTGAGCG CCGGATGTTC GACGGCCGGG CAACGGCTAT CGTCCGGTTC AGCGATATCC GCATCGGCGA CCGGGTCGAG TATTCCTACA CGATCACCGG TCGCAATCCG GCTCTGCCGG CCAATGACAG CCGGGAAATC CGGCTCGGTT TTGGCGCCCC TGTCGAGCGC ATGCTGGTCA CATCAACCTG GCCGAATTCG CGCCCGCCCC ATTATCGCCA ACTCGGGCCC GCCGCCGCGG CCAATGTAAC CGCCTCCAGT GAGGGTCCCG CCCAGACGCT CCGCTTTGGC CCGATACCGA CAGCGACTTT CACCGGGGAG CGCAGCGCGC CGTCCTGGAT CCGTCAGAGT CCGTCTCTGC AACTGTCTGA TTTTGCAAAT TGGGAGGATG TATCCCGGTG GTCGGCACCC ATGTATCAAC CAGGATCGTC TGACGCCGTT GATGAGATTG CCGACAGGAT CAGGGCCGAG CATGCCGATC CGGCCGATCA GCTGGTGGCA GCCTTGCGCT TCACCCAGGA CGAGATCCGC TACCTCGCGA TCAGCTTTGG CGCAGGTGGC TACGTCCCGG CCATGCCGCC AGAGACGCTC GAACGGCGCT ATGGCGATTG CAAAGCCAAG ACGGTCTTGC TCATTGCCCT GCTTGAGGCC CTGGATATCG AGGCGGAAGC GGCTCTCGTT CACACCAGTC AGGGCCGGGC ATTGCTCGAT GGCCTGCCCC GGCACACCGC GTTCAATCAT GTGATCGTAC GCGCTTATTT GGACGGTGAG GCCTATTGGC TTGACGGGAC ACGACGCGAG CAGGGCGGAC GGCTCGACAC CCTGGACCAA CCCGATTTCG GCTTCGCCTT GCCGATCAAC ACGGACGGGG CTGCACCCGT TGCAATGGAA CCGACCCAGT TCGGCGAATC TTTCTTTGAA GGCGAGGAAG CGCTGACAAT CCATTCGATC AACGGCGATG CAACACTGGA GGTGACCTAT ACCGCCCGCA ACCTCGGGGC CGATCGCGCC CGGACCTCAT TGTCGCGAAC CGGTCGAGCC GACCTGCAGG AGCGCTTCCT TGATCAGTAT GGCGCGCGAT TTGGCGCTGT CTCTTCGACC CGGGATCTGA CCATCGAAGA CGACCGCGAA ACCAATGTAT TGATCATGCA TATGCAGATC GGGATCGAAA ACGTGCTGAC ACCCCATGAG GATGGGGAAC GTCTTCAGGC GAGTGCACGG ATGCAGATGC GCTCGCCAAC GAACGGCAAT GCCGAACGCA ATCGTCGCTT CCCTCTGACG CTGACCTATC CGGTGCACCA AACCGCTCGC CTCGTTGTGG ACCTGCCAGA TGCCATGGCT GACTGGACGC TGGAGCCGGA GGCGCGACAA TTGACCGTCG ACGGGATCGA TTTCTCAACC GTCCGGAGCC GGGACGGCAA TCGCATCACG CTGGACTACA CCTTGCTGGT CGACCGCCCC TATCTGCCCC CAGAAGAAGC CGCCGCAGCC CTGGCCCTTG GCGACCAGAT CAACCAGTTG ACCCGGTGGT CCCTGGTCTC GCCATAA
|
Protein sequence | MAQHGDPDVA DVTSPASGDT SPALSDPTAL GTLPDSATFT PTPNFVSELA PPARGTEVFE AEGGVRYALY QLEHTNMGED PREYVRIISE PVTSGGVDSV ANGTVVFNPA FQTLEFHTLR VERDGSFEDR TDSTTIEFAR RETRLERRMF DGRATAIVRF SDIRIGDRVE YSYTITGRNP ALPANDSREI RLGFGAPVER MLVTSTWPNS RPPHYRQLGP AAAANVTASS EGPAQTLRFG PIPTATFTGE RSAPSWIRQS PSLQLSDFAN WEDVSRWSAP MYQPGSSDAV DEIADRIRAE HADPADQLVA ALRFTQDEIR YLAISFGAGG YVPAMPPETL ERRYGDCKAK TVLLIALLEA LDIEAEAALV HTSQGRALLD GLPRHTAFNH VIVRAYLDGE AYWLDGTRRE QGGRLDTLDQ PDFGFALPIN TDGAAPVAME PTQFGESFFE GEEALTIHSI NGDATLEVTY TARNLGADRA RTSLSRTGRA DLQERFLDQY GARFGAVSST RDLTIEDDRE TNVLIMHMQI GIENVLTPHE DGERLQASAR MQMRSPTNGN AERNRRFPLT LTYPVHQTAR LVVDLPDAMA DWTLEPEARQ LTVDGIDFST VRSRDGNRIT LDYTLLVDRP YLPPEEAAAA LALGDQINQL TRWSLVSP
|
| |