Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msed_2204 |
Symbol | |
ID | 5105424 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Metallosphaera sedula DSM 5348 |
Kingdom | Archaea |
Replicon accession | NC_009440 |
Strand | + |
Start bp | 2115023 |
End bp | 2116066 |
Gene Length | 1044 bp |
Protein Length | 347 aa |
Translation table | 11 |
GC content | 44% |
IMG OID | 640508097 |
Product | binding-protein-dependent transport systems inner membrane component |
Protein accession | YP_001192266 |
Protein GI | 146304950 |
COG category | [E] Amino acid transport and metabolism [P] Inorganic ion transport and metabolism |
COG ID | [COG0601] ABC-type dipeptide/oligopeptide/nickel transport systems, permease components |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 31 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 1 |
Fosmid unclonability p-value | 0.00000400167 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGTTCGTTT TGAGAAGGAT TGCTATCCTT ATCCCGACAT TGATCGGACT TACCCTTCTT GTGTTCGTAC TAATCCATCT TCAGGGTAAT AATTTAATTC TTTCTGAATA TCTAAATCCT AGATTAACCG GTCAGGCAAG AGAACTCGCT ATTCAAAGAT TAACTCAAGA ATTTCACCTG AATCAACCGG TGTATATACA GTACTTTTAT TGGTTAGCGC AAGTTTTTAG CGGGAACTTT GGCTACACCA ACACTCCCAT ATTTAGCGGA CCAGTCTCGA CTGCAATTGT ACTTTTCCTC CCAAACACGG TAATATTATC CCTCTTTGCT GCCCTGCTGA TCTGGCTGAT TGGGATTCCC CTTGGCGTAT TCTCGGCAGT GAATAGGGAC TCGGCAGCTG ATCAGGGAAT AAGAGTTTTC TCCTTCACTC TCTACTCTAT GCCAATTTAC TTGATCGCTA TTGCCCTAAT CCTTATCCTT GGGGTGTATA CGGGTATATT ACCCTTCAGC GGAGAAGTCT CTCCTCAACT TGTTTCCGGT CTACCCTGGT ACGTTAACGG AATATCTTAT CCCACCCATG TCCTTCTAAT TGACGCAATC ATACACGGGG ATTTCGCAGT AGCATGGAAC GCATTCCTAC ATCTAATAAT GCCAGCCCTT ACATTAGCCT TGGCGGTTAT GGCTGGGATT ATCAGAATAT TGAGAGCCAG CATGCTTGAA ACTCTAGAGC AGGACTACAT TAAACTGGCC AGAGCTAAGG GTGTGCCTGA AAAGGTCGTT AACAATCTTC ACGCAAGAAA GAGCGCAATG CTTCCAGTAG TTACGTCGTT TGGATACACA GTCGCAGGGT TACTGGGAGG GGTAGTAGTG GTTGAGACGG TATTCGATTT TCCTGGAATC GGGTATTGGA CAACGCAAGC ATTGTTGAAC GATGACGTAG GCGGCGTCAT GGCATCAACC CTAATATTCG GTATAATACT GGTAGTAACG ACTTTAGTGC TGGACATCAT CTACGCAATC ATAGATCCAA GGATTAGATA TTGA
|
Protein sequence | MFVLRRIAIL IPTLIGLTLL VFVLIHLQGN NLILSEYLNP RLTGQARELA IQRLTQEFHL NQPVYIQYFY WLAQVFSGNF GYTNTPIFSG PVSTAIVLFL PNTVILSLFA ALLIWLIGIP LGVFSAVNRD SAADQGIRVF SFTLYSMPIY LIAIALILIL GVYTGILPFS GEVSPQLVSG LPWYVNGISY PTHVLLIDAI IHGDFAVAWN AFLHLIMPAL TLALAVMAGI IRILRASMLE TLEQDYIKLA RAKGVPEKVV NNLHARKSAM LPVVTSFGYT VAGLLGGVVV VETVFDFPGI GYWTTQALLN DDVGGVMAST LIFGIILVVT TLVLDIIYAI IDPRIRY
|
| |