Gene Msed_0444 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsed_0444 
Symbol 
ID5105440 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMetallosphaera sedula DSM 5348 
KingdomArchaea 
Replicon accessionNC_009440 
Strand
Start bp398865 
End bp400277 
Gene Length1413 bp 
Protein Length470 aa 
Translation table11 
GC content47% 
IMG OID640506350 
Productbinding-protein-dependent transport systems inner membrane component 
Protein accessionYP_001190545 
Protein GI146303229 
COG category[E] Amino acid transport and metabolism
[P] Inorganic ion transport and metabolism 
COG ID[COG1173] ABC-type dipeptide/oligopeptide/nickel transport systems, permease components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACAAGG TATTGAGAGA CCTGCTAAGC AGGAAAACTT TCATTTTCTC GGTAGTCGTA 
ATCCTTTTCT TTGCTGCTAT TGCCCTTTTA GCTCCAGTTC TAACGTCATA CAACAATCCT
TATCTAGTTT CACAACAGTT CGTTGCGGCC CCCTACGCTG TTCCTTCGTG GGCCACAATC
TTCCCTCAAT ATCATGGTCT TCCCCCTGAC GTTAGAATGA CGTTGTCTCC CTCTGAGACC
TCAGGACTTA CCCCCATTAA CTCGTCCATG GTTAAGGTGG AGGTTCCTGC AGGGGGTCAG
GTTAATCTGA CTTACGCCAT TAGATGGAAT TGGAGTTCTC CATACAACGT ACTTCTGTCC
TTTACCCTAG TTACGCCATC TACAAGCGAC TTTAACGTAA ATCTTTACAT GAACAATATC
AACTTCATGG AACTGTCACC TTTACCGATA CCTCCTGCAG TTAGCGTTAC TCCCGGGAAG
GCCAATTACG TTACCTTCTC TTCAGAAACT ATAAACCCAA GCAATTCTCC GTACGTAAGC
TCCCTTCCCT TCCAAGATCA GCCCTTAGCA TCACTTGAGT TCCCCAAGGC TGTACTGCCT
AAACCTGGGA TATATTACCT GATTATATCA TTCCAGAATA CAGGAAATTC ACCTGAGACT
TTCCTTGTTT CCAACCCACA TTACTCCTCT CTTGGTTACG CTTATGGTAG GTTGGGAACA
GATGATAATG GCGCGAGCGT GTTCTCGGAG TTCGTGTATG GAGCGAGGTT CGATCTCTAT
TTAGCCCTTG TAGCCTCAGC CCTTATTATA GGAATAGGAC TCATAATTGG GCTGATAGCG
GGCTACGTGG GCGGTTTCAC GGATCTGGCC CTGAATGCTC TTACAGACTT CTTTCTGTTA
ATACCGGGTT TACCTCTCTT GATTGTTTTG ATCTCTATCT TCGATCTCAC TGGGGTCATA
GTTAACGTGA ACAAGGCCGT CCTTATACTA CTCATCATCT CGTTGTTATC ATGGCCTGGG
ACTGCTAAGA TAATTAGGGG ACAGACACTA AGCCTCAGGA ACAGAACCTT CGTGGAAGCT
TCTAGGGCTC TGGGCGAGGG AAGGTTTAGG ATCCTGTTCA GACATATAGT TCCCAACCTG
ATGGGAATTC TATTTGCTCA ACTGGCATAT GACGTTCCAG GCGTTATCCT GGCTGAGTCG
GGTCTCGACT TCCTGGGCCT GGGAATTACA GAGTTCCCGA CCTGGGGAAA CATGCTTGGA
TTTGCCACCA ATGATTTGTC CTTTGCCAAT GGGTTTGCAT GGTGGTGGGT GCTTCCACCT
GGAATTGGGA TAATATTGTT AAGTACAGCG TTCTACTATT TCGGGACAGC AATGCTTGAC
GTCCTTAGTC CCTACAAGCT TAGGGGTGAA TGA
 
Protein sequence
MNKVLRDLLS RKTFIFSVVV ILFFAAIALL APVLTSYNNP YLVSQQFVAA PYAVPSWATI 
FPQYHGLPPD VRMTLSPSET SGLTPINSSM VKVEVPAGGQ VNLTYAIRWN WSSPYNVLLS
FTLVTPSTSD FNVNLYMNNI NFMELSPLPI PPAVSVTPGK ANYVTFSSET INPSNSPYVS
SLPFQDQPLA SLEFPKAVLP KPGIYYLIIS FQNTGNSPET FLVSNPHYSS LGYAYGRLGT
DDNGASVFSE FVYGARFDLY LALVASALII GIGLIIGLIA GYVGGFTDLA LNALTDFFLL
IPGLPLLIVL ISIFDLTGVI VNVNKAVLIL LIISLLSWPG TAKIIRGQTL SLRNRTFVEA
SRALGEGRFR ILFRHIVPNL MGILFAQLAY DVPGVILAES GLDFLGLGIT EFPTWGNMLG
FATNDLSFAN GFAWWWVLPP GIGIILLSTA FYYFGTAMLD VLSPYKLRGE