Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rmar_2001 |
Symbol | |
ID | 8568658 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodothermus marinus DSM 4252 |
Kingdom | Bacteria |
Replicon accession | NC_013501 |
Strand | - |
Start bp | 2332125 |
End bp | 2334965 |
Gene Length | 2841 bp |
Protein Length | 946 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | |
Product | carbamoyl-phosphate synthase, large subunit |
Protein accession | YP_003291270 |
Protein GI | 268317551 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.890669 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCGAAGC GGACCGACAT CAAGCGGATC CTGCTGATCG GATCGGGTCC AATCGTCATC GGGCAGGCCT GCGAGTTCGA CTACTCCGGC AGCCAGGCCG CCCGAGCCTT GCGTAAAGAA GGCTACGAGG TCATTCTGGT CAACTCGAAC CCGGCCACAA TCATGACCGA CCCGATCACG GCCGATCGGG TCTATCTGCA GGAGTTGACG CCGGAATCCA TCCGCCGCAT CGTCGAGAAA GAGCGTCCCG ACGCCGTGCT GCCCACGATG GGCGGGCAGA CGGCGCTGAA CCTGGCCGCC CAGCTCCACG AAGAGGGCTT CTGGGAGGCG ATGGGCGTGG AGATCATCGG CGTGGACATC GAGGCGATCC AGATCACGGA GGATCGCCAG AAATTCCGCG ACCTGATGGA GCAGATCGGC ATCGATCAGG CCCGGAGCCG CACGGCCCGT AGCCTGCTGG AAGCCAAAGA GATCCTGCAG GAGCTGGGCG GGCTGCCCGT GGTGATCCGC CCGTCATTCA CGCTGGGCGG CACCGGCGGC GGTATCGTCT GGACGATGGA GGAGTTCGAC CGCAAGGTGA CGCGCGGGCT GGAGCTTTCG CCCGTCCACC AGGTGCTCAT CGAGGAAAGC GTCTACGGCT GGAAGGAATA CGAGCTGGAG CTGCTGCGCG ACGCGAACGA CAACGTGATC ATCGTCTGTC CCATCGAAAA CCTCGACCCG ATGGGCGTAC ACACGGGCGA CTCGATCACA GTGGCGCCCG CGCAGACGCT CACCGACAAA CAGTACCAGC GCATGCGCGA CGCGGCGATC AAGATGATGC GCTCGATCGG CAAGTTTGCC GGCGGCTGCA ACGTGCAGTT TGCCGTCGAG CCGCACACCG GGCGCATGAT CGCCGTCGAG ATCAACCCGC GCATGTCGCG CTCGTCGGCG CTGGCCTCGA AGGCCACGGG TTACCCGATC GCCAAGGTGG CAGCCCGCCT GGCCGTGGGT TACACGCTCG ACGAATTGCC CAACGACGTG ACGGGCACCA CGAGCGCCTG TTTCGAGCCG TCGATCGACT ACGTGGTCGT CAAGATCCCG CGCTGGAATT TCGAAAAGTT CGAGGGCGTC GACGAGGAGC TGACCACGCA GATGAAGGCG GTAGGCGAGG TGATGGCCAT CGGCCGCACC TTCCCCGAAG CGCTCCAGAA GGCCTGGCAG AGCCTGGAAA ACGGCTATGC CGGCCTCGGC GCCGACCGGG AAGACCCGTC GCGCGAAGAG GTGCGCGCCC GTCTCAAGAA GCCCTACTGG GACCGCACGC TGCAAATTCG CAACGCCTTC CGGCTGGGCG CCTCGGTCGA GGAGATCCAC GACATTACCT ACATCGATCC GTGGTTCCTC TACCAGATCG AGGACATCGT CAAAATCGAA CGGGAGCTGG AGCGGCGTAC GCTCGACCAG CTCGACGCCG ACTTCCTGCG GCTGGTCAAG CAGTACGGCT TCTCGGACGT GCAGATTGCC TACCTGCTGC AAGGGAACGT GACCGAGGAG GACGTGCGGG CGCGGCGTAA GGCGCTGGGC ATCACGCCCA CGTTCCGGCT CGTCGACACC TGCGCCGCCG AATTTCCGGC CCAGACGCCC TACTACTACA GTACCTACGA GACCGAGAAC GAAAGCGAGG TCACCGACCG CGAAAAGGTC ATCATTCTGG GTGCCGGACC CAACCGCATC GGCCAGGGCA TCGAGTTCGA CTACTGCTGC GTGCACGGGG TGCTGGCCGC CAAGGAGATG GGCTACGAGG CCATCATGAT CAACTGCAAT CCGGAGACGG TATCGACCGA CTTCGACGTG GCCGACAAGC TCTACTTCGA GCCCGTCTTC TGGGAGCGGG TGCTCGACAT CATCGAGCAC GAAAACCGGC ACGGCAAGCT TAAAGGGGTG ATCGTGCAGC TCGGCGGCCA GACGGCGCTC AAGCTGGCCC GCAAGCTCCA TGAGCACGGC ATCCCGATCC TGGGCACGTC CTTCCCGATG ATGGACCTGG CCGAGGAGCG GAGCAAGTTC TCGGCGCTGC TGCGCGAGCT GGAGATCCCG TATCCGCCTT ACGGGGCGGC GCGGACGGTG GCCGAGGCCG TCGAGGTGGC CGAGCGCATC GGCTACCCGA TCCTGATCCG GCCCAGCTAC GTGCTCGGCG GCCAGGGCAT GCGCATCGCC ATCAACAAAG AAGAAGTCGA ACGCTATGTC CGCAACATCC TCAAACTCCT GCCCGACAAC GAGATCCTGC TGGACCTCTT CCTGGAGAAC GGGATCGAGG TGGACGTCGA TGCCGCCTGC GACGGCGAGG AGGTCTGGAT TGCCGGCATC ATGCAGCATA TCGAACCGGC CGGCGTCCAC TCGGGCGACT CGACGGCCGT GTTGCCGCCC TTCTCGCTGT CGGAGGAGGT GCTGAACACG ATCCGCCGCT ATACCGAGGA CATCGCCCGG CGCCTGCAGG TGGTGGGCCT GATCAACGTG CAGATGGTGG TCAAAGACAA CGTGGTCTAC GTGATCGAGG CCAACCCGCG GGCGTCGCGG ACCATGCCCT TCGTGGCCAA GGCGACGGGC GTACCGGTGG CCAAGATCGG CACGCAGCTC ATGCTGGGCC GCAAGCTCCG GGAGTTCCGC GAGGCTGGCC TGCTCGAATC GAAGCTCAAG GGCTACGCGA TCAAAGAGCC GGTCTTCTCC TGGGACAAAT TCCCGGAGGT GCCCAAGGAG CTGGGGCCGG AGATGAAGTC CACCGGCGAG GCCATCGCCT TCGTCGAAAC GCTCACCGAC GAACACTTCC GCCGGCCCTA CGCCATCCGC AACCTCTACC TGAGTCGATA G
|
Protein sequence | MPKRTDIKRI LLIGSGPIVI GQACEFDYSG SQAARALRKE GYEVILVNSN PATIMTDPIT ADRVYLQELT PESIRRIVEK ERPDAVLPTM GGQTALNLAA QLHEEGFWEA MGVEIIGVDI EAIQITEDRQ KFRDLMEQIG IDQARSRTAR SLLEAKEILQ ELGGLPVVIR PSFTLGGTGG GIVWTMEEFD RKVTRGLELS PVHQVLIEES VYGWKEYELE LLRDANDNVI IVCPIENLDP MGVHTGDSIT VAPAQTLTDK QYQRMRDAAI KMMRSIGKFA GGCNVQFAVE PHTGRMIAVE INPRMSRSSA LASKATGYPI AKVAARLAVG YTLDELPNDV TGTTSACFEP SIDYVVVKIP RWNFEKFEGV DEELTTQMKA VGEVMAIGRT FPEALQKAWQ SLENGYAGLG ADREDPSREE VRARLKKPYW DRTLQIRNAF RLGASVEEIH DITYIDPWFL YQIEDIVKIE RELERRTLDQ LDADFLRLVK QYGFSDVQIA YLLQGNVTEE DVRARRKALG ITPTFRLVDT CAAEFPAQTP YYYSTYETEN ESEVTDREKV IILGAGPNRI GQGIEFDYCC VHGVLAAKEM GYEAIMINCN PETVSTDFDV ADKLYFEPVF WERVLDIIEH ENRHGKLKGV IVQLGGQTAL KLARKLHEHG IPILGTSFPM MDLAEERSKF SALLRELEIP YPPYGAARTV AEAVEVAERI GYPILIRPSY VLGGQGMRIA INKEEVERYV RNILKLLPDN EILLDLFLEN GIEVDVDAAC DGEEVWIAGI MQHIEPAGVH SGDSTAVLPP FSLSEEVLNT IRRYTEDIAR RLQVVGLINV QMVVKDNVVY VIEANPRASR TMPFVAKATG VPVAKIGTQL MLGRKLREFR EAGLLESKLK GYAIKEPVFS WDKFPEVPKE LGPEMKSTGE AIAFVETLTD EHFRRPYAIR NLYLSR
|
| |