Gene Msed_1013 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsed_1013 
Symbol 
ID5105612 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMetallosphaera sedula DSM 5348 
KingdomArchaea 
Replicon accessionNC_009440 
Strand
Start bp933229 
End bp934542 
Gene Length1314 bp 
Protein Length437 aa 
Translation table11 
GC content50% 
IMG OID640506912 
Productmajor facilitator transporter 
Protein accessionYP_001191105 
Protein GI146303789 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00471133 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGGATCCAA ATCTTCCCAG TAAGAACCTG GACAAATTCT CCTGGTCCAA GGTTCATACC 
TTCGCCTTCA TAGCTTTCTC TGCCGGATTT TTCCTAGAGG CTTACATTTT TGGTATGGCC
TCCATAGCAA CTGGCTGGGT AAGCGTTCCT AAGTTCCTCA CGAGCACTCT CCTGGCATGG
GCACCCCTCT GGCTAATCAT AGGGATTATG GTGACCGGTC CCCTTTCCGA TAGACTTGGC
AGGAAGACCA TGTTCTACAT CACCATGGCC TTATATGGGG TTGGCGCAGT GGGACTGGTT
TTCAGCGGGA CCTACTATCT AATCCTTCTT TTCCTTGCCA TGATGCTCTT CGCCGCTGGG
GGTGAGATGA ACACAATCAT GGTCGCGACC CACGAGATAA TGCCCAGGAA ACACAGGAGC
AAGGCCTTCT TCCTGGAGCT CAACTTCATT AACGTTGGAG GTTTCGTACT AGGTCTCGTG
GGTTACCTGG TTCAGAACCA ATCGGTGTTT TTCCAGAGAC TCATGATAGG TGTCACAGTC
CTCATAGTCC TCGTGGTTCT GATGTACACC AGGCTCAAGA TTCCCGAGTC CATACGGTGG
CTCGAGAAAC AGGGTAGACT TGAAGACGCG GACAAGGAAA TCAAGAAGTA CTTCGGAGAT
ATAAAGATAA TATCGCAGGA TGAATTGAGG CCAAAGATCA CGGTTAAGAA GCTTCCCATG
TGGTTTAAGC TCCTGGTCGT AATCCTGGTT GCGGCTGCCA ACACCATCGG TTACGGTCTC
ATGACCTACG TCCTGGGGCC CTACTACTTT CCGAGTCAGA CTGCCATGAT CATACTAGTG
ACCAACCTAG CTGAGATGCT TGTGGGACTG GTGATAGGCG TGTTCGCTGA CTCCCTGAGC
AGGAAACTAC TCCTCCTCAT CTCCTTTGTG GGGGCGACGG GATTCACGTT TCTGATCATG
GGGACAATAC CCATGTGGAG CAAGAGTCTG ACCCTATTCT ACTCTCTACT GGTGTTGCTA
AACGTGTTCG TAGGGATCTC CTATCTTACG GAAGATGCCC TCAAGAGCGA GATCTGGCCT
ACCCTCAAGA GGGGTACAAT AACGGCGGTG GCAAGGTTCA TATCCATTGG AGCTTACATT
CCCACGATTT ACCTTACAAG TAACTTCAGC ATTTTCCAGT ACACTCTGTT CAACGGGCTA
GTATGGGCAG TGGGTATGGT AGCAGCCATA TTATGGTTCG TGAAGGGGTA TGAGACCGGT
AAGGGAATCA GTGTCGATGA AATTTCGGAG GAAGTAGAAG GAACGAAGGT GTGA
 
Protein sequence
MDPNLPSKNL DKFSWSKVHT FAFIAFSAGF FLEAYIFGMA SIATGWVSVP KFLTSTLLAW 
APLWLIIGIM VTGPLSDRLG RKTMFYITMA LYGVGAVGLV FSGTYYLILL FLAMMLFAAG
GEMNTIMVAT HEIMPRKHRS KAFFLELNFI NVGGFVLGLV GYLVQNQSVF FQRLMIGVTV
LIVLVVLMYT RLKIPESIRW LEKQGRLEDA DKEIKKYFGD IKIISQDELR PKITVKKLPM
WFKLLVVILV AAANTIGYGL MTYVLGPYYF PSQTAMIILV TNLAEMLVGL VIGVFADSLS
RKLLLLISFV GATGFTFLIM GTIPMWSKSL TLFYSLLVLL NVFVGISYLT EDALKSEIWP
TLKRGTITAV ARFISIGAYI PTIYLTSNFS IFQYTLFNGL VWAVGMVAAI LWFVKGYETG
KGISVDEISE EVEGTKV