Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msed_0147 |
Symbol | |
ID | 5105000 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Metallosphaera sedula DSM 5348 |
Kingdom | Archaea |
Replicon accession | NC_009440 |
Strand | + |
Start bp | 118141 |
End bp | 119673 |
Gene Length | 1533 bp |
Protein Length | 510 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 640506050 |
Product | carbamoyl-phosphate synthase L chain, ATP-binding |
Protein accession | YP_001190248 |
Protein GI | 146302932 |
COG category | [I] Lipid transport and metabolism |
COG ID | [COG4770] Acetyl/propionyl-CoA carboxylase, alpha subunit |
TIGRFAM ID | [TIGR00514] acetyl-CoA carboxylase, biotin carboxylase subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.012 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCACCCT TTAGTAGAGT TTTGGTTGCA AACAGGGGAG AAATTGCAGT AAGGGTAATG AAGGCAATAA AGGAAATGGG AATGACAGCA ATAGCTGTTT ACTCTGAGGC TGACAAGTAC GCAGTCCACG TTAAGTATGC CGATGAAGCT TATTATATTG GACCCTCGCC GGCCTTGGAA AGTTACCTCA ACATACCCCA CATCATTGAC GCAGCGGAGA AGGCTCACGC TGACGCTGTT CATCCTGGAT ATGGATTCTT GTCGGAGAAT GCTGACTTCG TGGAGGCAGT TGAAAAGGCA GGAATGACTT ACATAGGTCC CTCTGCTGAG GTCATGAGAA AGATAAAGGA TAAGCTGGAT GGGAAAAGGA TAGCCCAGTT ATCTGGTGTC CCCATTGCCC CTGGCTCGGA TGGCCCCGTA GAATCCATTG ACGAGGCTCT TAAGTTGGCT GAGAAGATAG GATACCCCAT CATGGTTAAG GCCGCTAGCG GGGGTGGTGG AGTAGGTATA ACAAAGATAG ATACACCTGA CCAGCTCATT GACGCATGGG AAAGAAACAA GAGGTTAGCT ACACAAGCCT TCGGACGATC TGATCTATAC ATAGAAAAAG CCGCCGTAAA CCCTAGGCAC ATTGAGTTTC AGTTAATTGG CGATAAGTAC GGCAACTATG TCGTTGCTTG GGAGAGGGAA TGTACTATTC AGAGAAGAAA CCAGAAGTTG ATAGAGGAGG CACCATCTCC AGCAATCACA ATGGAAGAAA GGTCACGAAT GTTCGAGCCT ATATACAAAT ATGGGAAGTT AATTAATTAC TTTACCCTGG GTACTTTCGA GACAGTTTTC TCTGATGCCA CAAGGGAGTT CTACTTCCTT GAGCTGAACA AAAGGCTTCA GGTAGAACAC CCAGTTACTG AGTTAATATT CAGAATTGAT CTGGTAAAGC TACAGATAAG GCTAGCTGCA GGAGAACATT TGCCATTCAC GCAGGAGGAA CTCAACAAGA GGGCGAGAGG TGCAGCAATA GAGTTCAGGA TAAATGCCGA GGATCCAATA AATAATTTCA GCGGAAGCTC AGGTTTCATT ACGTACTACA GGGAGCCCAC GGGTCCTGGA GTGAGAATGG ATAGCGGTGT AACGGAGGGA AGCTGGGTAC CTCCTTTCTA CGACTCTCTA GTATCGAAGT TGATTGTGTA TGGAGAAGAC AGGCAATACG CAATACAAAC TGCCATGAGG GCACTAGACG ATTACAAGAT TGGCGGAGTC AAAACGACTA TACCGCTATA CAAGCTCATC ATGAGGGATC CCGACTTTCA GGAAGGAAGG TTCAGTACTG CCTATATTTC CCAGAAGATT GACTCAATGG TTAAGAAACT GAAGGCCGAA GAGGAGATGA TGGCTTCAGT GGCCGCAGTT CTTCAGAGCA GGGGACTCCT TAGAAAGAAG GCTTCAGCTC CTCAGGAGCA GGCGAAACCA GGCTCAGGAT GGAAGAGTTA CGGTATCATG ATGCAGAGCA CTCCTAGGGT GATGTGGGGA TGA
|
Protein sequence | MPPFSRVLVA NRGEIAVRVM KAIKEMGMTA IAVYSEADKY AVHVKYADEA YYIGPSPALE SYLNIPHIID AAEKAHADAV HPGYGFLSEN ADFVEAVEKA GMTYIGPSAE VMRKIKDKLD GKRIAQLSGV PIAPGSDGPV ESIDEALKLA EKIGYPIMVK AASGGGGVGI TKIDTPDQLI DAWERNKRLA TQAFGRSDLY IEKAAVNPRH IEFQLIGDKY GNYVVAWERE CTIQRRNQKL IEEAPSPAIT MEERSRMFEP IYKYGKLINY FTLGTFETVF SDATREFYFL ELNKRLQVEH PVTELIFRID LVKLQIRLAA GEHLPFTQEE LNKRARGAAI EFRINAEDPI NNFSGSSGFI TYYREPTGPG VRMDSGVTEG SWVPPFYDSL VSKLIVYGED RQYAIQTAMR ALDDYKIGGV KTTIPLYKLI MRDPDFQEGR FSTAYISQKI DSMVKKLKAE EEMMASVAAV LQSRGLLRKK ASAPQEQAKP GSGWKSYGIM MQSTPRVMWG
|
| |