Gene Msed_1990 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsed_1990 
SymbolcarB 
ID5103377 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMetallosphaera sedula DSM 5348 
KingdomArchaea 
Replicon accessionNC_009440 
Strand
Start bp1924613 
End bp1927753 
Gene Length3141 bp 
Protein Length1046 aa 
Translation table11 
GC content49% 
IMG OID640507878 
Productcarbamoyl phosphate synthase large subunit 
Protein accessionYP_001192054 
Protein GI146304738 
COG category[E] Amino acid transport and metabolism
[F] Nucleotide transport and metabolism
[I] Lipid transport and metabolism 
COG ID[COG0458] Carbamoylphosphate synthase large subunit (split gene in MJ)
[COG4770] Acetyl/propionyl-CoA carboxylase, alpha subunit 
TIGRFAM ID[TIGR01369] carbamoyl-phosphate synthase, large subunit 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.950925 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGGGAA TTAAGAAGGT TCTTGTCGTT GGTTCAGGAC CCATAAAGAT AGCCGAGGCT 
GCGGAATTCG ACTACAGCGG ATCCCAATCG CTTAAGGCCT ACAAGGAGGA AGGTATCGAG
ACAGTACTGG TTAACTCCAA CGTTGCCACG GTTCAGACCA GTTACGGAAT GGCAGATAAG
CTCTACATGA TTCCTGTAAC ATGGTGGTCC GTAGAGAAGG TCATTGAGCA GGAGAGACCA
GATGCCATTG CCATCGGATT TGGAGGACAG ACTGCACTAA ACGTGGGAGT TGACCTCTAC
AAGAGGGGAA TTCTCCAGAA GTATGGGATT AAGGTACTCG GAACTCCCAT CGAGGGTATA
GAGAAGGCAT TGAGCAGGGA GAAGTTCAGG GAGACAATGA TAGACGTTAA GATCCCAGTT
CCTCCCAGCT TTTCCGCTAA GAGCTCTGAG GAAGCCCTAG AGAAGGCAGA GGAGATTGGC
TACCCCGTAA TGGTCAGGGT AAGCTTTAAC TTGGGAGGCA GAGGTTCTAC GGTGGCCTGG
GATAGGGAAT CCCTTAGGAA GAGTATAGAC AGGGCATTCT CGCAGAGCTA CATTGGAGAA
GTCCTAGTGG AAAAGTACCT GCACCATTGG AAGGAGTTAG AGTACGAGGT TGTGAGAGAC
TCTAAGGGGA ACTCCGCAGT TATAGCATGC GTGGAAAACC TGGATCCCAT GGGGGTTCAC
ACGGGAGAAT CCATAGTTAT CACCCCATGC CAGACGCTGG ATAACAAGGA ATTCCAAGAG
ATGAGGACTT TGTCCATGAA AGTGGCAGAA TCCATCAGTC TAGTGGGTGA GTGTAACGTA
CAGTTCGCAC TGGATCCCAA CAGCTACACC TATTACGTTA TAGAAACCAA CCCTAGGATG
TCTAGATCCA GTGCCCTCGC GAGTAAGGCA ACAGGTTATC CCCTTGCCTA CGTTTCCGCA
AAACTTTCCC TAGGCTACAC GCTGGAGGAG ATCCTGAACA AGGTATCAGG GGCGACGTGC
GCCTGTTTCG AGCCCAGCTT AGATTACGTT GTCATGAAAA TACCAAGGTG GGACCTGCAG
AAGTTTGAGG AAGTAGATAC TAGCCTAAGC ACGGAGATGA AGAGCGTTGG AGAAATCATG
AGCATAGGGA GGTCCTTTGA GGAGGCCCTT CAAAAGGGAT TGAGAATGCT AGATATTGGG
GAACCAGGGG TGGTTGGAGG AAGATACTAC ATGTCCACTG CCCCCAAGGA GGATGCACTC
AAGAAACTTG GCAAGAGAAT TCCCTACTGG CCAATATGGG CAGCTAAGGC ATTCAAGGAG
GGAGCGTCAG TTGAGGAGGT GTACAAGGCC ACAGGGGTAG ACAGGTTCTT CCTAAGGAAG
ATAGAGAACC TTGTGAAAAC TTTTGAAGAG ATCAAGGGAT CTAAACTCGA CCTAGAGAAG
CTGAAATACC TTAAGACACT TGGTTTCAGC GACGAACAGG TTGCCATGGC CACCAACTCC
AGCGAAGATG AGGTGTGGAA GGTCAGGTCC TCCAATCGCA TCCTTCCAAA CATAAAGCTA
ATTGATACTC TAGCTGGAGA ATGGCCAGCC GTCACCAACT ACCTTTACCT AACATACGTT
GGCTCTGAGG ATGATATTGA GCTGAACTCC ACTGGGAACA ACCTTCTCAT AGTGGGGGCA
GGAGGTTTCA GAATTGGAGT TTCAGTCGAG TTTGATTGGG GAGTGGTGGA GTTGATGAAG
GCTTCTCTGA AGTACTTTGA CCAGGTAGCC GTGCTGAACT ATAACCCCGA AACAGTTTCC
ACGGACTGGG ACGTGACCAG GAAGCTCTAC TTTGACGAGA TATCAGCCGA GAGAGTTCTA
GACATAATAA ACAAGGAGAA ATACACTTAC GTTGCCACAT TCGCGGGTGG ACAATTGGGT
AATAACATTG CAAAGAAGCT GGAGGAGAGA GGGGTGAAGC TTTTCGGAAC CTCTGGATCC
TCGGTAGACA TGGCAGAGGA CAGGGAGAAG TTTTCCAAGC TCCTAGATAT GCTAGGAATT
AAGCAACCTG AGTGGATCTC AGCTAAGGAT CCCCAAGAGA TAAGGAAATT TGTGGAACAC
GTTGGTTTCC CAGTATTGGT CAGGCCGAGT TATGTGCTGA GTGGATCGTC CATGAGCATT
GTCTGGACCG AGCAGGAGCT CGATGAGGTC ATTCAGAGGG CCAGATTATC CACAAAGTAT
CCTGTGGTGA TAAGCAAGTT CTTGGATAAC GCGTTTGAGG CTGAAGTTGA TGCGGTGAGC
GATGGTAAGG GCGTCCTTGG AGTCGTGATG GAACATGTGG AAGAGGCTGG AGTCCACAGC
GGTGACAGCA CCATGTCCAT ACCCACAAGG AAAATCACTC CCGTTGTCGA AGATCTCAAG
AAGATTTCCC TTACCCTTGC CAGGGAGATT GGAGTTAGGG GTCCATTCAA CTTGCAGTTC
GTGATAAAGG ACAACGTGCC CTACGTTATA GAGATGAATC TCAGGGCAAG TAGATCAATG
CCTTTCTCCA GCAAGGCCAA GGGAGTCAAC CTAATGGAGA TGGCTGTTAA GGGCATCTTG
CAGGGTCTCA ACCTGGATGA GTTCATGGAG CCTGAGAGCA AATCATGGGC AGTAAAGTCA
GCCCAATTCT CGTGGACTCA ACTTAAGGAC TCCTATCCAT TCCTAGGCCC CGAGATGAGA
AGCACGGGAG AGGCTGCGTC CCTTGGTACA AGCTTCTACG ATGCGTTGCT CAAGAGTTGG
TTATCATGCT CTCCCAACAG ACTGCCAAGG CAAGGCAAGG TTGCCCTAGT TTACGGTAAG
GGGAATGGAG AGTACTTAAT GGAGGCTGGG AAAAACCTAG AAAGATATGG TCTAGTCGTA
AAATCGTTGG ATTTCGGTGA GGTACCAGGG TTTGAAACGC TTAAACAGGA GGAGGCACTA
TCCCTGATTA GAAAGGGACA GGTTGATCTT GTGGTGACTA ACGGTTACAT GAAAAGCCTA
GACTACAGCA TCAGGAGAAC TGCCGTTGAC CTGAACATTC CAATCATCCT CAACGGAAGG
TTGGGGAAGG AGGTCTCCCA GGCAATGTTA CTCAACGAGA TGACCTTTTA TGAGATGAGG
AGGTATGGAG GTGGAATCTA A
 
Protein sequence
MKGIKKVLVV GSGPIKIAEA AEFDYSGSQS LKAYKEEGIE TVLVNSNVAT VQTSYGMADK 
LYMIPVTWWS VEKVIEQERP DAIAIGFGGQ TALNVGVDLY KRGILQKYGI KVLGTPIEGI
EKALSREKFR ETMIDVKIPV PPSFSAKSSE EALEKAEEIG YPVMVRVSFN LGGRGSTVAW
DRESLRKSID RAFSQSYIGE VLVEKYLHHW KELEYEVVRD SKGNSAVIAC VENLDPMGVH
TGESIVITPC QTLDNKEFQE MRTLSMKVAE SISLVGECNV QFALDPNSYT YYVIETNPRM
SRSSALASKA TGYPLAYVSA KLSLGYTLEE ILNKVSGATC ACFEPSLDYV VMKIPRWDLQ
KFEEVDTSLS TEMKSVGEIM SIGRSFEEAL QKGLRMLDIG EPGVVGGRYY MSTAPKEDAL
KKLGKRIPYW PIWAAKAFKE GASVEEVYKA TGVDRFFLRK IENLVKTFEE IKGSKLDLEK
LKYLKTLGFS DEQVAMATNS SEDEVWKVRS SNRILPNIKL IDTLAGEWPA VTNYLYLTYV
GSEDDIELNS TGNNLLIVGA GGFRIGVSVE FDWGVVELMK ASLKYFDQVA VLNYNPETVS
TDWDVTRKLY FDEISAERVL DIINKEKYTY VATFAGGQLG NNIAKKLEER GVKLFGTSGS
SVDMAEDREK FSKLLDMLGI KQPEWISAKD PQEIRKFVEH VGFPVLVRPS YVLSGSSMSI
VWTEQELDEV IQRARLSTKY PVVISKFLDN AFEAEVDAVS DGKGVLGVVM EHVEEAGVHS
GDSTMSIPTR KITPVVEDLK KISLTLAREI GVRGPFNLQF VIKDNVPYVI EMNLRASRSM
PFSSKAKGVN LMEMAVKGIL QGLNLDEFME PESKSWAVKS AQFSWTQLKD SYPFLGPEMR
STGEAASLGT SFYDALLKSW LSCSPNRLPR QGKVALVYGK GNGEYLMEAG KNLERYGLVV
KSLDFGEVPG FETLKQEEAL SLIRKGQVDL VVTNGYMKSL DYSIRRTAVD LNIPIILNGR
LGKEVSQAML LNEMTFYEMR RYGGGI