Gene Msed_1033 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsed_1033 
Symbol 
ID5104333 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMetallosphaera sedula DSM 5348 
KingdomArchaea 
Replicon accessionNC_009440 
Strand
Start bp956433 
End bp957971 
Gene Length1539 bp 
Protein Length512 aa 
Translation table11 
GC content54% 
IMG OID640506929 
ProductABC transporter related 
Protein accessionYP_001191122 
Protein GI146303806 
COG category[R] General function prediction only 
COG ID[COG1123] ATPase components of various ABC-type transport systems, contain duplicated ATPase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.264472 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGTTTGTAA GGATAAGGGA CCTTAAGGTA ACTTATCTGG GAAGAGGAAG GCCCTCCCTT 
CAGGTGGACA GCCTGGATAT CAAGGAGGGG GAGTCGGTCC TAGTACTCGG CAAGTCCGGT
TCAGGGAAGT CTACCCTGGT GAGCTCCTTG AATGGTGTGA TCCCCAACCT AATCTCGGCC
AAGGTTGAGG GAGAGATCAC GGTTTTCGGG AGGGACCCCA GGAAGACACC TGTTCACGAA
ATGGCCAAGC TAGTGGGAAC CCTCCTGCAG GACCCTGAGG CTCAGGTTTT CCATCACCTG
GTTCGAGACG AGATCGCCTT CGGACCGGAG AACTTTGCCC TTCCTAGAGA GGAGATCCTG
TCACGGGTTG AGGAGTCCGC AAGGGTCACA GGGGTCTCCC ACCTCATGAT GAGGGAGACA
TCCTCACTGT CCGGGGGTGA ACTTCAGAGG ACTGTGCTTG CCTCCGTCTT AGCCTTGAGG
CCTAGGGCGC TCATCCTTGA TGAGCCCACG TCCAGCATTG ACCCCCAAGG GACAGCGGAG
ATCCTGGGTC TCCTGAGGTC GCTAAGGAAC TCCGGCGTGA GCATGATAAT TGTGGAGCAT
AAGGTTGAGA GGGTTCTGCC TTACGTGGAT AGGGTTATCC TAGTGGATGG AGGAAGGGTT
GCCCTGAACG TCGAGAAGGC CAGATTAATG GAGCACGTCG ACCTGTTAAC CAGGGCAGGG
GTTGAGGTAC CCGAGTATTA CCTTCATATG AAAAGGTACG GCGTAACTCG GGATTCCCTC
TCGACGTATA GGCGAAGTCC GATCCCCAGG GTGAGGGGTG GGAGCATCTC ACTCTTCGCG
AGGGTTAAGG TTTGGACTAA GGAAGGAAAG GTCCTAGTCG ACACCGAGAT TCAACTGAGG
AAGGGCGAGA TAGTTGCCCT CATGGGAAGG AATGGGGCAG GGAAGACGAC TCTCCTGAAG
GCGATCATGG GCCTTCTGGA CACCAAGTTG AGGAGCGAGG TTCACCTGGT CGTGAGTGGG
AAGGACATCT CCAGGTCTAG GTATTACGAG AGGGGAAGTT ACGTCGCATA TTTACCTCAA
AACTTCGACG TAATGTTTGT CAGGAGAACT GTGGAGGACG AGATTAAGGC CTCCTCCAAC
GATCCAGAGC AATACCTCAA GTTATTCTCG TTGAACCAAG TAAGGAAAGA GGATCCCTTA
ACCCTATCCT TTGGTCAGAG GAGGAGAGTA GCCATGGCCT CTATCCTCGG AAGGGGGCAG
AGGGTGGTCC TGATGGATGA GCCCACGAGT GGACAGGATT GGTATCATAG GGAGAACCTG
GGGAAAGAGT TGAGGGAACT GGGGAAGAGG GGAATATCAA CGCTCGTGGT CACACACGAT
TCAAGGTTCG TAGACAAGTT CTGCGATAGG GTGATCGTGA TGGACCAGGG AAGAATCGTG
ACTGAGGGAA CGCCAGAGGA GGTGTTCAGA GTGGGGATCG TGACTCCGCC CACTGAGTAC
CTGGTTGAAG CTGGAACCTG GAATCCGCTG GAGGGATAA
 
Protein sequence
MFVRIRDLKV TYLGRGRPSL QVDSLDIKEG ESVLVLGKSG SGKSTLVSSL NGVIPNLISA 
KVEGEITVFG RDPRKTPVHE MAKLVGTLLQ DPEAQVFHHL VRDEIAFGPE NFALPREEIL
SRVEESARVT GVSHLMMRET SSLSGGELQR TVLASVLALR PRALILDEPT SSIDPQGTAE
ILGLLRSLRN SGVSMIIVEH KVERVLPYVD RVILVDGGRV ALNVEKARLM EHVDLLTRAG
VEVPEYYLHM KRYGVTRDSL STYRRSPIPR VRGGSISLFA RVKVWTKEGK VLVDTEIQLR
KGEIVALMGR NGAGKTTLLK AIMGLLDTKL RSEVHLVVSG KDISRSRYYE RGSYVAYLPQ
NFDVMFVRRT VEDEIKASSN DPEQYLKLFS LNQVRKEDPL TLSFGQRRRV AMASILGRGQ
RVVLMDEPTS GQDWYHRENL GKELRELGKR GISTLVVTHD SRFVDKFCDR VIVMDQGRIV
TEGTPEEVFR VGIVTPPTEY LVEAGTWNPL EG