Gene Msed_0462 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsed_0462 
Symbol 
ID5105458 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMetallosphaera sedula DSM 5348 
KingdomArchaea 
Replicon accessionNC_009440 
Strand
Start bp416383 
End bp417741 
Gene Length1359 bp 
Protein Length452 aa 
Translation table11 
GC content48% 
IMG OID640506368 
Productmajor facilitator transporter 
Protein accessionYP_001190563 
Protein GI146303247 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG2223] Nitrate/nitrite transporter 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.042885 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATATTAT GGACAGATGC CATGGAACCT GAAAGATTTG GAGTTGATTC TGTAAAGGCG 
GTCTCATCCC AGTTTGTGGG ATTCCTTCTC GACTCCTACG ATCTAACAAT GATTTTGAGT
ATAGCTCCCA TCTTAGCCAA GGTTCTTCTA CCTCCCGAGT CCCCCCTGCT TGCTACCTTT
AACATAATTC TGAGTTACTC CTTAACAATC ATCTTTAGAC CTCTTGGCTC AGCCATATTT
GGAAACCTTG GCGATAAGAT TGGGAGGAGA GCTGACCTAA TCATCACAGT CCTTGGCCTA
GGCTTGGTAA GTGCACTCAC TGCTGCCCTT CCCACATACT CGGAGGTGGG AATACTCTCC
TTCGTTCTTT TTGTTCTCCT CAGGGTACTG GTTGGGATTT TCGCTGGCGG CGAGTACTCC
GCGGGCCATC CCTTTGCCAT GGAGTGGACC CCTTTCAAGT GGAGAGGTTT GATAAGCGGA
TTTGTACAGG GAGGTTTTTC CTTTGGAGCT GCCCTGGCAG CTGTAGTTGA GGGAGCCTTC
GTGGGAATAT ACGGGTTAAA GGGAGTGGAG GACTTCGCAT GGAGATATGT TTTCCTTACT
GCTCTAGCAC CCGCGGTCAT CGCCCTAGCT GTGAGGCTAT CAATGAAGGA GACTCCGGTG
TTTCAGGACG TGAAGAACAG GAACATGGTC AGGAAAAGTC CGCTCACAGA TCTGTTTAGA
AAGCCCTACA GGAGGGACTT CTTTCAGGTT ATGGTGTATA TGACTGGTAT GTTCTTTTAC
GCCTATTCCC TATTTGCCTT CGTGCCAGCG ATTCTTGAAC ACGCTCCCTC GGTGTTCTCC
CTTCAAGAGG CTGAGAGCAT CTACTCCTAC GGTACTTATG CAGCCTTTGC AGGTGCAGTG
ACGTTTGGGG CCCTTTCCCA GTACCTAGGG AGGAGAAGAT TGACACTGAT TTGGGTCTTC
ATTACCTTAA TTCTCTCTGT CCCTGTATAC TATCTTCTCT TTACCTCCGC GAAGTCAGGT
AATGTGGTCG GAGCATCCTT AGCTTCCGTG TTGATAGGAA TAATAACTCA GGCTCCCTGG
GGAGTAATAC CCATCTATCT CTCTGAGAGA TTCAAGGCCT CAATGAGAGC TTCAGGTGTA
GGTTTTGGGT ATTCATCTGG AATATTTGTG GGAGGTTGGT TTAGCATATA CGTGGAACTT
ATGCACGAAT ACCTATTCAA GGGAATTGAC ACTCCAGAGA ACGTTTGGTT CTCTACAGCA
GCTTTGCTAA TCCTAGGTGC AATATTTGTG GGGATAGGAC AATACCTGGG TCCAGAAACC
CTGGGAACGA GACTAACCGA AGAACCCCAA AAGGTATAA
 
Protein sequence
MILWTDAMEP ERFGVDSVKA VSSQFVGFLL DSYDLTMILS IAPILAKVLL PPESPLLATF 
NIILSYSLTI IFRPLGSAIF GNLGDKIGRR ADLIITVLGL GLVSALTAAL PTYSEVGILS
FVLFVLLRVL VGIFAGGEYS AGHPFAMEWT PFKWRGLISG FVQGGFSFGA ALAAVVEGAF
VGIYGLKGVE DFAWRYVFLT ALAPAVIALA VRLSMKETPV FQDVKNRNMV RKSPLTDLFR
KPYRRDFFQV MVYMTGMFFY AYSLFAFVPA ILEHAPSVFS LQEAESIYSY GTYAAFAGAV
TFGALSQYLG RRRLTLIWVF ITLILSVPVY YLLFTSAKSG NVVGASLASV LIGIITQAPW
GVIPIYLSER FKASMRASGV GFGYSSGIFV GGWFSIYVEL MHEYLFKGID TPENVWFSTA
ALLILGAIFV GIGQYLGPET LGTRLTEEPQ KV