Gene Msed_1346 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsed_1346 
Symbol 
ID5103405 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMetallosphaera sedula DSM 5348 
KingdomArchaea 
Replicon accessionNC_009440 
Strand
Start bp1319266 
End bp1320414 
Gene Length1149 bp 
Protein Length382 aa 
Translation table11 
GC content48% 
IMG OID640507235 
Productmajor facilitator transporter 
Protein accessionYP_001191428 
Protein GI146304112 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG2223] Nitrate/nitrite transporter 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.217122 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAGAG AGACCTTCCT AGCAATTGGT TTCATTGTGA TGTGCTTCAA CTCCTTATAT 
CAGTATTCAT GGAATGCTCT GGAGCCTCTC CTGAGGACAG GCTTCTCTGT TTCCGTGGTG
CAGATAGCCC TTGGCTTCAC CCTATTCAGT GTGTTCTCCT CCTTCTTCCA GCCCCTGGGA
GGTCATTTCG CTGATAGGGA TGGTCCTAGG AACGTGGGCA TCGTGGCATC GGTCCTGGCC
TCACTGGGGT TTCTTGGAAC TTACCTCTCG CCTAACATTC TCTACTTCTA CGTATTCTGG
TCCCTTGGAA GTATTGGTGA GGGAATACTT TACGGGATAG CAGCAAACCT CGCCATGAAA
TGGTTTATCG ACAGAATGGG TTTTGCCACG GGTATCGTAT CCATGGGATT TGGACTAGGC
TCGGTGGTGG CTAACCCCCT AATACTTCAC GTGGATAATT ACAAGATAGT CACGCTAACA
ATAGGTCTCT CTGAACTCGT AGTGGTCACG GTCCTGATGT CCCTTATTAG TTATCCCGCA
TCAAGTAAAG GAAGACCACC AGGGCAGGTG ATCTTCACTA CCAAGTTCTG GCTAATCTAC
GTCTCCTTCG TGGGCGCAGT TATTCCACTG ACTGCCATCT CCTCTCAACT TGCTGTTCTA
GGTAAAAACC TCTCACAGGA GGAACTTACG ATCCTTATCT CGATTTTTCC CTTACTCAGT
GGTGGGATGA GGCCAATCAT GGGAAGGATA GCTGACAAAG TTGGAATAGT GAGGACTACC
CTTATACTTA ACGCGATCCT GCTGGTGGGC TCCTTGACTC TCCTGGTCGG CCAATTGATC
CCAACGACGG TTCTGGTGGG ATTTGCTGGC GGGTCCATGA TCACCTTGTA CTTTAACGTT
GCAGGAGAGA TCTTTGGTAC TAGGTTCTCC ACGGTTAATA GCGGTATCCT GTATACGGGA
AAGGCACTTG GTGGGGTTCT GGGCAGTTTC GTTTTCGCCT ATCTTTATAC GTTAAACGTG
ACCACATCTG AGATTTATTT AGTTCTCGGA AGCCTTGTGG GGGTGCTTGC CTTGATTCCT
GTTATTCCTA GGATTAGAAC TGGTCAAAGG GTGATGGGTC AGCACGGCAA TCAGGGAAAC
ATGAAGTGA
 
Protein sequence
MKRETFLAIG FIVMCFNSLY QYSWNALEPL LRTGFSVSVV QIALGFTLFS VFSSFFQPLG 
GHFADRDGPR NVGIVASVLA SLGFLGTYLS PNILYFYVFW SLGSIGEGIL YGIAANLAMK
WFIDRMGFAT GIVSMGFGLG SVVANPLILH VDNYKIVTLT IGLSELVVVT VLMSLISYPA
SSKGRPPGQV IFTTKFWLIY VSFVGAVIPL TAISSQLAVL GKNLSQEELT ILISIFPLLS
GGMRPIMGRI ADKVGIVRTT LILNAILLVG SLTLLVGQLI PTTVLVGFAG GSMITLYFNV
AGEIFGTRFS TVNSGILYTG KALGGVLGSF VFAYLYTLNV TTSEIYLVLG SLVGVLALIP
VIPRIRTGQR VMGQHGNQGN MK