Gene Msed_1121 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsed_1121 
Symbol 
ID5104154 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMetallosphaera sedula DSM 5348 
KingdomArchaea 
Replicon accessionNC_009440 
Strand
Start bp1050494 
End bp1051966 
Gene Length1473 bp 
Protein Length490 aa 
Translation table11 
GC content45% 
IMG OID640507014 
Productamino acid permease-associated region 
Protein accessionYP_001191207 
Protein GI146303891 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0833] Amino acid transporters 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.0304101 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTCAGACA AACCTTTCAA ATTGGCGAAG GTAATAGGAC CCGTGGCCAT AATAGCCTCG 
GCAGTAAGTC AAGAATACGG AGCTGGTATC AACGCGGTAG CCACACAGAG TATTGGATCA
TATCCTGCCA TACTCAACCT TGTTCCAGCA ATCATGTTCA TAACTGGATT GCTCATGCTT
CCCAAGGTAT TCATGTATCA GAAGTTTGGC AAGGTGGCAA GCAGAAGCGG AGGACAATAC
GTCTGGATAT CTAGAACTAC CACTCCGGAG GTGGGATTTA TTGTTCACTT TCTATACTGG
ATTGGAATAG TATCTGCCAT AGGGTTCATT AGCTACACTG TTGGATCTAC CCTAGCTTCA
ACACTCGTGT CATTGGGGAT ATCCTCTGGG GCGTGGTTCG CTACATTTAC AGGGCATATT
GTGCTGGGGT TGGCCCTAAT ATGGTCCTTC TTCTTAATTC ATTACACGGG CGTGAGAAGC
TATGGAGTTG TGGTAACTCT GCTGTTTGCC CTCGTTTTGC TAGGGGCAAT CATATCAATG
GTTGCGGGTT TCGGCACCGC TAATTCTGTC TACACTGGTT ATTTATCGAG TCAAATATTT
CATGGAACGA TTCCCAGTTA CACTACACCT CCCCTAACTT ACTCGGATAT TTTCGGTACA
GTAACCCTGT TCATTTTTGC GTATGCGGGC ATAAGCGCGG CCCCTCTCCT AGGTGGAGAG
GCTAAGGATC CCAAAAAGGA CATGCCAAGG GGTATATTCC TAGCGTGGTT GATTGCGTTA
GTCCTGTTTA CCTTAGTTTC GCTTGCAGTC TTCCACGCAA TAACTGGAGG GCAAGTGTTT
GCGTTAATAA AATCAAAGTA TTCCTATTAC GCTACCATTC CTGGCATACT GAGCATATCT
GAACCGAAAC TTATCGGAGC TATATTCTCA ATCATAGTTA CAATTATTAT AATGAAGACA
ATCATGCCCC AGTTACTTAC CTCCAGTAGA ACGCTCTTTG CCTGGGGCCA AGACAAGATA
CTTCCTGAGG TCTTCACTCA CACTAACAAG TTTAAGGCAC CCGACTTCTC CCTGCTGGTA
TGCGCGCTAT TTGCATCAAT ATACCTAGTT TATACAACTA GCGTGGGTGT GTCCGCTGTG
GACGTAAGAT CCCTCTCTGT CCTACTTGAG ATGATGGCTC TCGGGGCAGG GGTACTTCTT
ATCTCGACCA AGAGTAGCAA GAAAGAATGG GAAAAGGAAG TGACGACAAT AGGTGCGATC
ATAGCAGGGT TAGCAGGTAT AATAGTCACG CTTATTATTA TTCCAAGCGT CGCCGTTGTA
CCCCACGTTT CAATTCTCCT TCAACCCTCG TTTCAAGTGA TATTGGTTAT AGTGATAGGT
TTCCTCATCT ATGAAATCGC AAAAATGTAT AACAAACGGA CTAAAAACAT CGATCTAAAT
GATCTAATAA AGAAAGAGCT ACCCCTGGAA TGA
 
Protein sequence
MSDKPFKLAK VIGPVAIIAS AVSQEYGAGI NAVATQSIGS YPAILNLVPA IMFITGLLML 
PKVFMYQKFG KVASRSGGQY VWISRTTTPE VGFIVHFLYW IGIVSAIGFI SYTVGSTLAS
TLVSLGISSG AWFATFTGHI VLGLALIWSF FLIHYTGVRS YGVVVTLLFA LVLLGAIISM
VAGFGTANSV YTGYLSSQIF HGTIPSYTTP PLTYSDIFGT VTLFIFAYAG ISAAPLLGGE
AKDPKKDMPR GIFLAWLIAL VLFTLVSLAV FHAITGGQVF ALIKSKYSYY ATIPGILSIS
EPKLIGAIFS IIVTIIIMKT IMPQLLTSSR TLFAWGQDKI LPEVFTHTNK FKAPDFSLLV
CALFASIYLV YTTSVGVSAV DVRSLSVLLE MMALGAGVLL ISTKSSKKEW EKEVTTIGAI
IAGLAGIIVT LIIIPSVAVV PHVSILLQPS FQVILVIVIG FLIYEIAKMY NKRTKNIDLN
DLIKKELPLE