Gene Msed_0344 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsed_0344 
Symbol 
ID5105502 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMetallosphaera sedula DSM 5348 
KingdomArchaea 
Replicon accessionNC_009440 
Strand
Start bp298619 
End bp300067 
Gene Length1449 bp 
Protein Length482 aa 
Translation table11 
GC content48% 
IMG OID640506250 
Productcarboxypeptidase Taq 
Protein accessionYP_001190445 
Protein GI146303129 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2317] Zn-dependent carboxypeptidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTTCAAG AAATTCTTGA AAAGTACAAG AGGGTCTGGT CCCTGAACTA CTCTCAAGCC 
CTTCTAGCGT GGGATCTGGA AACCTACATG CCAGAAGAGG ACTCAGCGTT AAGGGGAGAA
GTACTGGCTA ACATCTCCAC CATGATCAGG GAAATGACCA TGGCACTTGA ACCTGACGTG
GAAAAGGTGA GGGAAGAGGA CTTAGACGAC TTCGGGAAGG GCGTAATTCG GGTACTGAGG
AGGTCACTTA GGTTCTACAA GCTTGTCCCC AAGGAGATCA CAGAGGAACT GGACAGGTTA
ACCTCCCAAA GCGTGGTAGT ATGGAGGGAG AGTAAAAGGA AAGGGGACTT CAACCTGTTC
AAGCCTTACC TGGAGAGGAT AGTGGAGCTT CAGAGGAGTA TTGCTGAGAA ACTGGGTTAC
GAGGGACATC CCTATAACGC GTTAGTTGAT CTCTACGAGG AAGGGATAAC GGTGACCGAT
CTAGATGCCG TGTTCTCTCA GTTACTTCCA GATCTGAGGA CCATCCTGGA AAAGGTGTTG
GCCGAGGGCT ATTTTCCCTC TAATCATCCG CTCAAGGAAA TGAGCTATGA CCCAAAGGTT
ATGGAGGAGG TGAATAGGGA GGTCCTGAAG ATTCTCAACA TGCCCACGAA AACGTTTAGG
ATGGACGTAT CCGCACACCC GTTCACGATT AGAATATCAT CTAAGGACGT AAGGATTACG
ACCCGATATG AGGGGATAGA CTTCAGGAGT ACCATATTCT CGGTAATACA TGAATCAGGA
CATGCCATGT ACGAGCTTAT GGTTGATCCT GCATATGAGA TGACTCCAGT TGCCGGTGGA
GCCTCAACAG GGATTCATGA GTCTCAATCG AGGTTCTGGG AGAACATAGT GGGTAGAAGT
AGGGAGTTCA CTAACATCCT CTACCCCATC CTTAAGAGTA AGCTCCCAAT CAAGGATGAC
CAGGAGTCGC TGTACAGGTA CTTCAACATG GTGTCGCCAA GCCTCATTAG GGTAGATGCA
GACGAAGTCA CATACAACTT TCACATTGCC CTCAGGTACG AAATTGAGAA GAACCTTATC
TCCGGTAAGC TGAGCGTAAG CGATCTTCCG TCAATGTGGA ACGACTTCAT GGATAAGTAC
CTGGGAGTCA GACCCAAGCA TGATGGAGAA GGGGTTCTGC AGGACATCCA CTGGTCACAG
GGTAGTTTCG GCTACTTCCC GACGTACACC TTGGGAAACG TTCTAGCTGC GACCATCTAC
CATTTCATAG AGGACTTGCC TACGAAGGTG AGCAGGGGAG ACGTGAACGG AATAAGGGCT
TTCCTGTCGG AGAAGATATG TAAGTATGGG GCTGTTTATC CACCTAAGGT TCTCTTAACC
AAGGCATTCG GTGAGGTCTA TAACCCAAAG AGACTATCGT CCTATCTTGA AAAGAAATAC
ATAGCCTAA
 
Protein sequence
MLQEILEKYK RVWSLNYSQA LLAWDLETYM PEEDSALRGE VLANISTMIR EMTMALEPDV 
EKVREEDLDD FGKGVIRVLR RSLRFYKLVP KEITEELDRL TSQSVVVWRE SKRKGDFNLF
KPYLERIVEL QRSIAEKLGY EGHPYNALVD LYEEGITVTD LDAVFSQLLP DLRTILEKVL
AEGYFPSNHP LKEMSYDPKV MEEVNREVLK ILNMPTKTFR MDVSAHPFTI RISSKDVRIT
TRYEGIDFRS TIFSVIHESG HAMYELMVDP AYEMTPVAGG ASTGIHESQS RFWENIVGRS
REFTNILYPI LKSKLPIKDD QESLYRYFNM VSPSLIRVDA DEVTYNFHIA LRYEIEKNLI
SGKLSVSDLP SMWNDFMDKY LGVRPKHDGE GVLQDIHWSQ GSFGYFPTYT LGNVLAATIY
HFIEDLPTKV SRGDVNGIRA FLSEKICKYG AVYPPKVLLT KAFGEVYNPK RLSSYLEKKY
IA