Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msed_1990 |
Symbol | carB |
ID | 5103377 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Metallosphaera sedula DSM 5348 |
Kingdom | Archaea |
Replicon accession | NC_009440 |
Strand | + |
Start bp | 1924613 |
End bp | 1927753 |
Gene Length | 3141 bp |
Protein Length | 1046 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 640507878 |
Product | carbamoyl phosphate synthase large subunit |
Protein accession | YP_001192054 |
Protein GI | 146304738 |
COG category | [E] Amino acid transport and metabolism [F] Nucleotide transport and metabolism [I] Lipid transport and metabolism |
COG ID | [COG0458] Carbamoylphosphate synthase large subunit (split gene in MJ) [COG4770] Acetyl/propionyl-CoA carboxylase, alpha subunit |
TIGRFAM ID | [TIGR01369] carbamoyl-phosphate synthase, large subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.950925 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGGGAA TTAAGAAGGT TCTTGTCGTT GGTTCAGGAC CCATAAAGAT AGCCGAGGCT GCGGAATTCG ACTACAGCGG ATCCCAATCG CTTAAGGCCT ACAAGGAGGA AGGTATCGAG ACAGTACTGG TTAACTCCAA CGTTGCCACG GTTCAGACCA GTTACGGAAT GGCAGATAAG CTCTACATGA TTCCTGTAAC ATGGTGGTCC GTAGAGAAGG TCATTGAGCA GGAGAGACCA GATGCCATTG CCATCGGATT TGGAGGACAG ACTGCACTAA ACGTGGGAGT TGACCTCTAC AAGAGGGGAA TTCTCCAGAA GTATGGGATT AAGGTACTCG GAACTCCCAT CGAGGGTATA GAGAAGGCAT TGAGCAGGGA GAAGTTCAGG GAGACAATGA TAGACGTTAA GATCCCAGTT CCTCCCAGCT TTTCCGCTAA GAGCTCTGAG GAAGCCCTAG AGAAGGCAGA GGAGATTGGC TACCCCGTAA TGGTCAGGGT AAGCTTTAAC TTGGGAGGCA GAGGTTCTAC GGTGGCCTGG GATAGGGAAT CCCTTAGGAA GAGTATAGAC AGGGCATTCT CGCAGAGCTA CATTGGAGAA GTCCTAGTGG AAAAGTACCT GCACCATTGG AAGGAGTTAG AGTACGAGGT TGTGAGAGAC TCTAAGGGGA ACTCCGCAGT TATAGCATGC GTGGAAAACC TGGATCCCAT GGGGGTTCAC ACGGGAGAAT CCATAGTTAT CACCCCATGC CAGACGCTGG ATAACAAGGA ATTCCAAGAG ATGAGGACTT TGTCCATGAA AGTGGCAGAA TCCATCAGTC TAGTGGGTGA GTGTAACGTA CAGTTCGCAC TGGATCCCAA CAGCTACACC TATTACGTTA TAGAAACCAA CCCTAGGATG TCTAGATCCA GTGCCCTCGC GAGTAAGGCA ACAGGTTATC CCCTTGCCTA CGTTTCCGCA AAACTTTCCC TAGGCTACAC GCTGGAGGAG ATCCTGAACA AGGTATCAGG GGCGACGTGC GCCTGTTTCG AGCCCAGCTT AGATTACGTT GTCATGAAAA TACCAAGGTG GGACCTGCAG AAGTTTGAGG AAGTAGATAC TAGCCTAAGC ACGGAGATGA AGAGCGTTGG AGAAATCATG AGCATAGGGA GGTCCTTTGA GGAGGCCCTT CAAAAGGGAT TGAGAATGCT AGATATTGGG GAACCAGGGG TGGTTGGAGG AAGATACTAC ATGTCCACTG CCCCCAAGGA GGATGCACTC AAGAAACTTG GCAAGAGAAT TCCCTACTGG CCAATATGGG CAGCTAAGGC ATTCAAGGAG GGAGCGTCAG TTGAGGAGGT GTACAAGGCC ACAGGGGTAG ACAGGTTCTT CCTAAGGAAG ATAGAGAACC TTGTGAAAAC TTTTGAAGAG ATCAAGGGAT CTAAACTCGA CCTAGAGAAG CTGAAATACC TTAAGACACT TGGTTTCAGC GACGAACAGG TTGCCATGGC CACCAACTCC AGCGAAGATG AGGTGTGGAA GGTCAGGTCC TCCAATCGCA TCCTTCCAAA CATAAAGCTA ATTGATACTC TAGCTGGAGA ATGGCCAGCC GTCACCAACT ACCTTTACCT AACATACGTT GGCTCTGAGG ATGATATTGA GCTGAACTCC ACTGGGAACA ACCTTCTCAT AGTGGGGGCA GGAGGTTTCA GAATTGGAGT TTCAGTCGAG TTTGATTGGG GAGTGGTGGA GTTGATGAAG GCTTCTCTGA AGTACTTTGA CCAGGTAGCC GTGCTGAACT ATAACCCCGA AACAGTTTCC ACGGACTGGG ACGTGACCAG GAAGCTCTAC TTTGACGAGA TATCAGCCGA GAGAGTTCTA GACATAATAA ACAAGGAGAA ATACACTTAC GTTGCCACAT TCGCGGGTGG ACAATTGGGT AATAACATTG CAAAGAAGCT GGAGGAGAGA GGGGTGAAGC TTTTCGGAAC CTCTGGATCC TCGGTAGACA TGGCAGAGGA CAGGGAGAAG TTTTCCAAGC TCCTAGATAT GCTAGGAATT AAGCAACCTG AGTGGATCTC AGCTAAGGAT CCCCAAGAGA TAAGGAAATT TGTGGAACAC GTTGGTTTCC CAGTATTGGT CAGGCCGAGT TATGTGCTGA GTGGATCGTC CATGAGCATT GTCTGGACCG AGCAGGAGCT CGATGAGGTC ATTCAGAGGG CCAGATTATC CACAAAGTAT CCTGTGGTGA TAAGCAAGTT CTTGGATAAC GCGTTTGAGG CTGAAGTTGA TGCGGTGAGC GATGGTAAGG GCGTCCTTGG AGTCGTGATG GAACATGTGG AAGAGGCTGG AGTCCACAGC GGTGACAGCA CCATGTCCAT ACCCACAAGG AAAATCACTC CCGTTGTCGA AGATCTCAAG AAGATTTCCC TTACCCTTGC CAGGGAGATT GGAGTTAGGG GTCCATTCAA CTTGCAGTTC GTGATAAAGG ACAACGTGCC CTACGTTATA GAGATGAATC TCAGGGCAAG TAGATCAATG CCTTTCTCCA GCAAGGCCAA GGGAGTCAAC CTAATGGAGA TGGCTGTTAA GGGCATCTTG CAGGGTCTCA ACCTGGATGA GTTCATGGAG CCTGAGAGCA AATCATGGGC AGTAAAGTCA GCCCAATTCT CGTGGACTCA ACTTAAGGAC TCCTATCCAT TCCTAGGCCC CGAGATGAGA AGCACGGGAG AGGCTGCGTC CCTTGGTACA AGCTTCTACG ATGCGTTGCT CAAGAGTTGG TTATCATGCT CTCCCAACAG ACTGCCAAGG CAAGGCAAGG TTGCCCTAGT TTACGGTAAG GGGAATGGAG AGTACTTAAT GGAGGCTGGG AAAAACCTAG AAAGATATGG TCTAGTCGTA AAATCGTTGG ATTTCGGTGA GGTACCAGGG TTTGAAACGC TTAAACAGGA GGAGGCACTA TCCCTGATTA GAAAGGGACA GGTTGATCTT GTGGTGACTA ACGGTTACAT GAAAAGCCTA GACTACAGCA TCAGGAGAAC TGCCGTTGAC CTGAACATTC CAATCATCCT CAACGGAAGG TTGGGGAAGG AGGTCTCCCA GGCAATGTTA CTCAACGAGA TGACCTTTTA TGAGATGAGG AGGTATGGAG GTGGAATCTA A
|
Protein sequence | MKGIKKVLVV GSGPIKIAEA AEFDYSGSQS LKAYKEEGIE TVLVNSNVAT VQTSYGMADK LYMIPVTWWS VEKVIEQERP DAIAIGFGGQ TALNVGVDLY KRGILQKYGI KVLGTPIEGI EKALSREKFR ETMIDVKIPV PPSFSAKSSE EALEKAEEIG YPVMVRVSFN LGGRGSTVAW DRESLRKSID RAFSQSYIGE VLVEKYLHHW KELEYEVVRD SKGNSAVIAC VENLDPMGVH TGESIVITPC QTLDNKEFQE MRTLSMKVAE SISLVGECNV QFALDPNSYT YYVIETNPRM SRSSALASKA TGYPLAYVSA KLSLGYTLEE ILNKVSGATC ACFEPSLDYV VMKIPRWDLQ KFEEVDTSLS TEMKSVGEIM SIGRSFEEAL QKGLRMLDIG EPGVVGGRYY MSTAPKEDAL KKLGKRIPYW PIWAAKAFKE GASVEEVYKA TGVDRFFLRK IENLVKTFEE IKGSKLDLEK LKYLKTLGFS DEQVAMATNS SEDEVWKVRS SNRILPNIKL IDTLAGEWPA VTNYLYLTYV GSEDDIELNS TGNNLLIVGA GGFRIGVSVE FDWGVVELMK ASLKYFDQVA VLNYNPETVS TDWDVTRKLY FDEISAERVL DIINKEKYTY VATFAGGQLG NNIAKKLEER GVKLFGTSGS SVDMAEDREK FSKLLDMLGI KQPEWISAKD PQEIRKFVEH VGFPVLVRPS YVLSGSSMSI VWTEQELDEV IQRARLSTKY PVVISKFLDN AFEAEVDAVS DGKGVLGVVM EHVEEAGVHS GDSTMSIPTR KITPVVEDLK KISLTLAREI GVRGPFNLQF VIKDNVPYVI EMNLRASRSM PFSSKAKGVN LMEMAVKGIL QGLNLDEFME PESKSWAVKS AQFSWTQLKD SYPFLGPEMR STGEAASLGT SFYDALLKSW LSCSPNRLPR QGKVALVYGK GNGEYLMEAG KNLERYGLVV KSLDFGEVPG FETLKQEEAL SLIRKGQVDL VVTNGYMKSL DYSIRRTAVD LNIPIILNGR LGKEVSQAML LNEMTFYEMR RYGGGI
|
| |