Gene Msed_1455 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsed_1455 
Symbol 
ID5104825 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMetallosphaera sedula DSM 5348 
KingdomArchaea 
Replicon accessionNC_009440 
Strand
Start bp1422222 
End bp1424129 
Gene Length1908 bp 
Protein Length635 aa 
Translation table11 
GC content45% 
IMG OID640507343 
Productamino acid permease-associated region 
Protein accessionYP_001191536 
Protein GI146304220 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0531] Amino acid transporters 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCAGGAG AAAAGTCTCA AAAAGCAGGG GATTTTGGGT TAGAATCCGA TAAGCAACTA 
AGAAGATCTC TAGGAAAGTT CGAGTTGCTA TATCTTTCAC TGGGAGGAAT CATAGGATCA
GGATGGTTAT TCGGAGCCCT ATATACCGCG GAAGATGCTG GTGGATCTGC TATATTGTCT
TGGATAATCG CAGGAGTACT CGTTCTTTTC GTCGGTCTAG CTTACTCTGA ATTAGGTTCT
GCCATACCAA AGTCCGGCGG TATAGTGAGG TATCCGCATT ATTCTCATGG AGGTGTAGCT
GGATATATTA TCACGTGGAC CTATTTTCTT TCCGCGGCAT CGGTTCCTGC CATAGAGGCC
ACAGCCACAG TAACCTATCT ATCTAGCTTG GTTCCAGCAC TTACAGTAAA CGGAGTGCTA
ACTCCACTGG GCATACTGAC AGCATATCTA TTCCTTCTAT TCTTCTTCTT CCTTAATTAC
ATCGGTGTAA ATATACTAGG AAAAGTTACA CACGGTGCAG GTTGGTGGAA GTTACTGATT
CCCTCAATAA CCGTAGTAAT TCTTCTTATA TTTTATTTCC ATCCAGCTAA CTTCACTCTT
GGCGGAGGAT TTTTCCCTTC TGCTTCTAAT GTTGCTGCTG GTTCATCAGG GATCTATGGA
TTTTCTGCAG TACTTTACGC AATTCCGACT ACCGGTGTAA TATTTTCCTA CCTAGGTTTT
AGACAGGCAG TAGAATATGG TGGTGAAGGT AGAAATCCAA AGAAAGATAT TCCCTTTGCG
GTCATGGGCT CCCTAGTAAT TGCGCTTATT CTATACACAC TTTTACAGGT AGCATTTATA
GGAGCAATAA ACTGGAACGC GCTCACTGTT ACAAGGGGAA ATACAACAGT ACCTGTTACG
CCAGGCAACT GGACAGAGTT AGGTCAAACT GCCATTTCAT CGGGACCATT CTATCAGATT
TTCAAACTAG CGGCTCCTCT AGGTCTATTA TCATTAATTT TCAGTGGATG GGCATACATA
CTGCTTCTAG ATGCTGTGAT TTCTCCGAGC GGGACAGGGT GGATCTATAC TGGTACCAGT
ACTAGGACAA TGTATGGCTT CGCTACAAAC GGCTACTTGC CTGGTATCTT TCTAAAGGTG
GGTAAAACTA GGATTCCAAT TTACTCTTTG ATAGCCGCGA CTATCATTGC CGCAATATTC
ATGTTACCCT TCCCATCTTG GCAATCGCTG GTTGGTTTCA TAAGTTCGGC CACAGTCTTT
ACCTACATAA TGGGAGGGAT AGGGCTTGAG ACTCTCAGGA AGACCGCCCC AGAACTCAAT
AGGCCGTACA AGTTGCCCTT AGCAAGGGTT ATAGCGCCAA TTGCTACACT TGCGGCGGGC
CTGATAGTGT ATTGGTCAGG TTTTGCCACC CTGTTCTACG TTATCACTGG GATATTCTTG
GGATTTGCCT TATTTTTTGG CTACTATGCC TTCAAGGTTA TGGGAATTAA TAAGGCGTTT
TCCGCTATCG TAGGATTGGT AAACATAGTG GTGACCCTAG TGTTAGCCTT TGAATTCTAC
GGTGCCACCT CCGGTCTAAC TGCAGCGAAC AATGTGGCGT TCTTGATCTA TATCCTAGTC
ATGGCAGGCC TAGTAGCGTT TGATGTGGGA GTGCTTCATG CATTTGGCAA GGGTGAAGAT
GTGAAAAGGG AGATAACTGC TAGCTACTGG TTGCTAGCCT ACATTTTCGT AGTAGCCATC
ATTTCATACT TTGGAGGTTT CGGACTAAAT CCGGTGATTC CATTCCCCGA GGACACCATT
GTGGCTGCAG TGGTTACTCT AGCAGCCCAC TATGGGGCAG TGAAAAGCGG ATTTAGAACT
CAGGCCATAC AAGATATCCT AGAGGAAACA AGGGAGACCC CACCCTAA
 
Protein sequence
MSGEKSQKAG DFGLESDKQL RRSLGKFELL YLSLGGIIGS GWLFGALYTA EDAGGSAILS 
WIIAGVLVLF VGLAYSELGS AIPKSGGIVR YPHYSHGGVA GYIITWTYFL SAASVPAIEA
TATVTYLSSL VPALTVNGVL TPLGILTAYL FLLFFFFLNY IGVNILGKVT HGAGWWKLLI
PSITVVILLI FYFHPANFTL GGGFFPSASN VAAGSSGIYG FSAVLYAIPT TGVIFSYLGF
RQAVEYGGEG RNPKKDIPFA VMGSLVIALI LYTLLQVAFI GAINWNALTV TRGNTTVPVT
PGNWTELGQT AISSGPFYQI FKLAAPLGLL SLIFSGWAYI LLLDAVISPS GTGWIYTGTS
TRTMYGFATN GYLPGIFLKV GKTRIPIYSL IAATIIAAIF MLPFPSWQSL VGFISSATVF
TYIMGGIGLE TLRKTAPELN RPYKLPLARV IAPIATLAAG LIVYWSGFAT LFYVITGIFL
GFALFFGYYA FKVMGINKAF SAIVGLVNIV VTLVLAFEFY GATSGLTAAN NVAFLIYILV
MAGLVAFDVG VLHAFGKGED VKREITASYW LLAYIFVVAI ISYFGGFGLN PVIPFPEDTI
VAAVVTLAAH YGAVKSGFRT QAIQDILEET RETPP