Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msed_1033 |
Symbol | |
ID | 5104333 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Metallosphaera sedula DSM 5348 |
Kingdom | Archaea |
Replicon accession | NC_009440 |
Strand | + |
Start bp | 956433 |
End bp | 957971 |
Gene Length | 1539 bp |
Protein Length | 512 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 640506929 |
Product | ABC transporter related |
Protein accession | YP_001191122 |
Protein GI | 146303806 |
COG category | [R] General function prediction only |
COG ID | [COG1123] ATPase components of various ABC-type transport systems, contain duplicated ATPase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.264472 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGTTTGTAA GGATAAGGGA CCTTAAGGTA ACTTATCTGG GAAGAGGAAG GCCCTCCCTT CAGGTGGACA GCCTGGATAT CAAGGAGGGG GAGTCGGTCC TAGTACTCGG CAAGTCCGGT TCAGGGAAGT CTACCCTGGT GAGCTCCTTG AATGGTGTGA TCCCCAACCT AATCTCGGCC AAGGTTGAGG GAGAGATCAC GGTTTTCGGG AGGGACCCCA GGAAGACACC TGTTCACGAA ATGGCCAAGC TAGTGGGAAC CCTCCTGCAG GACCCTGAGG CTCAGGTTTT CCATCACCTG GTTCGAGACG AGATCGCCTT CGGACCGGAG AACTTTGCCC TTCCTAGAGA GGAGATCCTG TCACGGGTTG AGGAGTCCGC AAGGGTCACA GGGGTCTCCC ACCTCATGAT GAGGGAGACA TCCTCACTGT CCGGGGGTGA ACTTCAGAGG ACTGTGCTTG CCTCCGTCTT AGCCTTGAGG CCTAGGGCGC TCATCCTTGA TGAGCCCACG TCCAGCATTG ACCCCCAAGG GACAGCGGAG ATCCTGGGTC TCCTGAGGTC GCTAAGGAAC TCCGGCGTGA GCATGATAAT TGTGGAGCAT AAGGTTGAGA GGGTTCTGCC TTACGTGGAT AGGGTTATCC TAGTGGATGG AGGAAGGGTT GCCCTGAACG TCGAGAAGGC CAGATTAATG GAGCACGTCG ACCTGTTAAC CAGGGCAGGG GTTGAGGTAC CCGAGTATTA CCTTCATATG AAAAGGTACG GCGTAACTCG GGATTCCCTC TCGACGTATA GGCGAAGTCC GATCCCCAGG GTGAGGGGTG GGAGCATCTC ACTCTTCGCG AGGGTTAAGG TTTGGACTAA GGAAGGAAAG GTCCTAGTCG ACACCGAGAT TCAACTGAGG AAGGGCGAGA TAGTTGCCCT CATGGGAAGG AATGGGGCAG GGAAGACGAC TCTCCTGAAG GCGATCATGG GCCTTCTGGA CACCAAGTTG AGGAGCGAGG TTCACCTGGT CGTGAGTGGG AAGGACATCT CCAGGTCTAG GTATTACGAG AGGGGAAGTT ACGTCGCATA TTTACCTCAA AACTTCGACG TAATGTTTGT CAGGAGAACT GTGGAGGACG AGATTAAGGC CTCCTCCAAC GATCCAGAGC AATACCTCAA GTTATTCTCG TTGAACCAAG TAAGGAAAGA GGATCCCTTA ACCCTATCCT TTGGTCAGAG GAGGAGAGTA GCCATGGCCT CTATCCTCGG AAGGGGGCAG AGGGTGGTCC TGATGGATGA GCCCACGAGT GGACAGGATT GGTATCATAG GGAGAACCTG GGGAAAGAGT TGAGGGAACT GGGGAAGAGG GGAATATCAA CGCTCGTGGT CACACACGAT TCAAGGTTCG TAGACAAGTT CTGCGATAGG GTGATCGTGA TGGACCAGGG AAGAATCGTG ACTGAGGGAA CGCCAGAGGA GGTGTTCAGA GTGGGGATCG TGACTCCGCC CACTGAGTAC CTGGTTGAAG CTGGAACCTG GAATCCGCTG GAGGGATAA
|
Protein sequence | MFVRIRDLKV TYLGRGRPSL QVDSLDIKEG ESVLVLGKSG SGKSTLVSSL NGVIPNLISA KVEGEITVFG RDPRKTPVHE MAKLVGTLLQ DPEAQVFHHL VRDEIAFGPE NFALPREEIL SRVEESARVT GVSHLMMRET SSLSGGELQR TVLASVLALR PRALILDEPT SSIDPQGTAE ILGLLRSLRN SGVSMIIVEH KVERVLPYVD RVILVDGGRV ALNVEKARLM EHVDLLTRAG VEVPEYYLHM KRYGVTRDSL STYRRSPIPR VRGGSISLFA RVKVWTKEGK VLVDTEIQLR KGEIVALMGR NGAGKTTLLK AIMGLLDTKL RSEVHLVVSG KDISRSRYYE RGSYVAYLPQ NFDVMFVRRT VEDEIKASSN DPEQYLKLFS LNQVRKEDPL TLSFGQRRRV AMASILGRGQ RVVLMDEPTS GQDWYHRENL GKELRELGKR GISTLVVTHD SRFVDKFCDR VIVMDQGRIV TEGTPEEVFR VGIVTPPTEY LVEAGTWNPL EG
|
| |