Gene Msed_0147 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsed_0147 
Symbol 
ID5105000 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMetallosphaera sedula DSM 5348 
KingdomArchaea 
Replicon accessionNC_009440 
Strand
Start bp118141 
End bp119673 
Gene Length1533 bp 
Protein Length510 aa 
Translation table11 
GC content47% 
IMG OID640506050 
Productcarbamoyl-phosphate synthase L chain, ATP-binding 
Protein accessionYP_001190248 
Protein GI146302932 
COG category[I] Lipid transport and metabolism 
COG ID[COG4770] Acetyl/propionyl-CoA carboxylase, alpha subunit 
TIGRFAM ID[TIGR00514] acetyl-CoA carboxylase, biotin carboxylase subunit 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.012 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCACCCT TTAGTAGAGT TTTGGTTGCA AACAGGGGAG AAATTGCAGT AAGGGTAATG 
AAGGCAATAA AGGAAATGGG AATGACAGCA ATAGCTGTTT ACTCTGAGGC TGACAAGTAC
GCAGTCCACG TTAAGTATGC CGATGAAGCT TATTATATTG GACCCTCGCC GGCCTTGGAA
AGTTACCTCA ACATACCCCA CATCATTGAC GCAGCGGAGA AGGCTCACGC TGACGCTGTT
CATCCTGGAT ATGGATTCTT GTCGGAGAAT GCTGACTTCG TGGAGGCAGT TGAAAAGGCA
GGAATGACTT ACATAGGTCC CTCTGCTGAG GTCATGAGAA AGATAAAGGA TAAGCTGGAT
GGGAAAAGGA TAGCCCAGTT ATCTGGTGTC CCCATTGCCC CTGGCTCGGA TGGCCCCGTA
GAATCCATTG ACGAGGCTCT TAAGTTGGCT GAGAAGATAG GATACCCCAT CATGGTTAAG
GCCGCTAGCG GGGGTGGTGG AGTAGGTATA ACAAAGATAG ATACACCTGA CCAGCTCATT
GACGCATGGG AAAGAAACAA GAGGTTAGCT ACACAAGCCT TCGGACGATC TGATCTATAC
ATAGAAAAAG CCGCCGTAAA CCCTAGGCAC ATTGAGTTTC AGTTAATTGG CGATAAGTAC
GGCAACTATG TCGTTGCTTG GGAGAGGGAA TGTACTATTC AGAGAAGAAA CCAGAAGTTG
ATAGAGGAGG CACCATCTCC AGCAATCACA ATGGAAGAAA GGTCACGAAT GTTCGAGCCT
ATATACAAAT ATGGGAAGTT AATTAATTAC TTTACCCTGG GTACTTTCGA GACAGTTTTC
TCTGATGCCA CAAGGGAGTT CTACTTCCTT GAGCTGAACA AAAGGCTTCA GGTAGAACAC
CCAGTTACTG AGTTAATATT CAGAATTGAT CTGGTAAAGC TACAGATAAG GCTAGCTGCA
GGAGAACATT TGCCATTCAC GCAGGAGGAA CTCAACAAGA GGGCGAGAGG TGCAGCAATA
GAGTTCAGGA TAAATGCCGA GGATCCAATA AATAATTTCA GCGGAAGCTC AGGTTTCATT
ACGTACTACA GGGAGCCCAC GGGTCCTGGA GTGAGAATGG ATAGCGGTGT AACGGAGGGA
AGCTGGGTAC CTCCTTTCTA CGACTCTCTA GTATCGAAGT TGATTGTGTA TGGAGAAGAC
AGGCAATACG CAATACAAAC TGCCATGAGG GCACTAGACG ATTACAAGAT TGGCGGAGTC
AAAACGACTA TACCGCTATA CAAGCTCATC ATGAGGGATC CCGACTTTCA GGAAGGAAGG
TTCAGTACTG CCTATATTTC CCAGAAGATT GACTCAATGG TTAAGAAACT GAAGGCCGAA
GAGGAGATGA TGGCTTCAGT GGCCGCAGTT CTTCAGAGCA GGGGACTCCT TAGAAAGAAG
GCTTCAGCTC CTCAGGAGCA GGCGAAACCA GGCTCAGGAT GGAAGAGTTA CGGTATCATG
ATGCAGAGCA CTCCTAGGGT GATGTGGGGA TGA
 
Protein sequence
MPPFSRVLVA NRGEIAVRVM KAIKEMGMTA IAVYSEADKY AVHVKYADEA YYIGPSPALE 
SYLNIPHIID AAEKAHADAV HPGYGFLSEN ADFVEAVEKA GMTYIGPSAE VMRKIKDKLD
GKRIAQLSGV PIAPGSDGPV ESIDEALKLA EKIGYPIMVK AASGGGGVGI TKIDTPDQLI
DAWERNKRLA TQAFGRSDLY IEKAAVNPRH IEFQLIGDKY GNYVVAWERE CTIQRRNQKL
IEEAPSPAIT MEERSRMFEP IYKYGKLINY FTLGTFETVF SDATREFYFL ELNKRLQVEH
PVTELIFRID LVKLQIRLAA GEHLPFTQEE LNKRARGAAI EFRINAEDPI NNFSGSSGFI
TYYREPTGPG VRMDSGVTEG SWVPPFYDSL VSKLIVYGED RQYAIQTAMR ALDDYKIGGV
KTTIPLYKLI MRDPDFQEGR FSTAYISQKI DSMVKKLKAE EEMMASVAAV LQSRGLLRKK
ASAPQEQAKP GSGWKSYGIM MQSTPRVMWG