Gene Msed_2204 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsed_2204 
Symbol 
ID5105424 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMetallosphaera sedula DSM 5348 
KingdomArchaea 
Replicon accessionNC_009440 
Strand
Start bp2115023 
End bp2116066 
Gene Length1044 bp 
Protein Length347 aa 
Translation table11 
GC content44% 
IMG OID640508097 
Productbinding-protein-dependent transport systems inner membrane component 
Protein accessionYP_001192266 
Protein GI146304950 
COG category[E] Amino acid transport and metabolism
[P] Inorganic ion transport and metabolism 
COG ID[COG0601] ABC-type dipeptide/oligopeptide/nickel transport systems, permease components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00000400167 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGTTCGTTT TGAGAAGGAT TGCTATCCTT ATCCCGACAT TGATCGGACT TACCCTTCTT 
GTGTTCGTAC TAATCCATCT TCAGGGTAAT AATTTAATTC TTTCTGAATA TCTAAATCCT
AGATTAACCG GTCAGGCAAG AGAACTCGCT ATTCAAAGAT TAACTCAAGA ATTTCACCTG
AATCAACCGG TGTATATACA GTACTTTTAT TGGTTAGCGC AAGTTTTTAG CGGGAACTTT
GGCTACACCA ACACTCCCAT ATTTAGCGGA CCAGTCTCGA CTGCAATTGT ACTTTTCCTC
CCAAACACGG TAATATTATC CCTCTTTGCT GCCCTGCTGA TCTGGCTGAT TGGGATTCCC
CTTGGCGTAT TCTCGGCAGT GAATAGGGAC TCGGCAGCTG ATCAGGGAAT AAGAGTTTTC
TCCTTCACTC TCTACTCTAT GCCAATTTAC TTGATCGCTA TTGCCCTAAT CCTTATCCTT
GGGGTGTATA CGGGTATATT ACCCTTCAGC GGAGAAGTCT CTCCTCAACT TGTTTCCGGT
CTACCCTGGT ACGTTAACGG AATATCTTAT CCCACCCATG TCCTTCTAAT TGACGCAATC
ATACACGGGG ATTTCGCAGT AGCATGGAAC GCATTCCTAC ATCTAATAAT GCCAGCCCTT
ACATTAGCCT TGGCGGTTAT GGCTGGGATT ATCAGAATAT TGAGAGCCAG CATGCTTGAA
ACTCTAGAGC AGGACTACAT TAAACTGGCC AGAGCTAAGG GTGTGCCTGA AAAGGTCGTT
AACAATCTTC ACGCAAGAAA GAGCGCAATG CTTCCAGTAG TTACGTCGTT TGGATACACA
GTCGCAGGGT TACTGGGAGG GGTAGTAGTG GTTGAGACGG TATTCGATTT TCCTGGAATC
GGGTATTGGA CAACGCAAGC ATTGTTGAAC GATGACGTAG GCGGCGTCAT GGCATCAACC
CTAATATTCG GTATAATACT GGTAGTAACG ACTTTAGTGC TGGACATCAT CTACGCAATC
ATAGATCCAA GGATTAGATA TTGA
 
Protein sequence
MFVLRRIAIL IPTLIGLTLL VFVLIHLQGN NLILSEYLNP RLTGQARELA IQRLTQEFHL 
NQPVYIQYFY WLAQVFSGNF GYTNTPIFSG PVSTAIVLFL PNTVILSLFA ALLIWLIGIP
LGVFSAVNRD SAADQGIRVF SFTLYSMPIY LIAIALILIL GVYTGILPFS GEVSPQLVSG
LPWYVNGISY PTHVLLIDAI IHGDFAVAWN AFLHLIMPAL TLALAVMAGI IRILRASMLE
TLEQDYIKLA RAKGVPEKVV NNLHARKSAM LPVVTSFGYT VAGLLGGVVV VETVFDFPGI
GYWTTQALLN DDVGGVMAST LIFGIILVVT TLVLDIIYAI IDPRIRY