Gene Msed_1368 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsed_1368 
Symbol 
ID5103427 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMetallosphaera sedula DSM 5348 
KingdomArchaea 
Replicon accessionNC_009440 
Strand
Start bp1337397 
End bp1339268 
Gene Length1872 bp 
Protein Length623 aa 
Translation table11 
GC content48% 
IMG OID640507257 
Producthypothetical protein 
Protein accessionYP_001191450 
Protein GI146304134 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGTCCT GGATTCTCCT GATTGTTCTG GTCTTGGGGT TACTCCCAGG TGTAGGGGCT 
TTTTCTTCAC CAACCATGCC TCACAACTTC ACCCTATACA ACATAACGGC GATGCAGGGT
CTAGATCCGA AGTACTACTC CTTCGAGGCT GTAGGGTATC TCCCACCCAA CGAGACTCCT
GTCACTGTCA CGGTAGCCAA GAACCTAATA TTCAACGATA CGGCGTGGAA ACCCTACTAC
TTGAACGTTC ACATTCCCAA TGGATCCTAC AGCTTCATCC TCATGAACGT TAGCGTGAAG
GAAGAGAATG GAACCCAGTT TGATCGCCCC TTCTACATTT TCGTGAACGG CATACCGGTG
TTCTGGGGAT CTACACAGGA GATTCAGAAC TCGAGTGCCT CTGTAGACCT AACCATGTTT
GAAAACCTCC TACACGGAAA CGTAACCTTT GAACCTGTCC TCGTGAATTT CTACGACGCA
AAGGTGAACA TAACAGGGAT CTATCTAGTC AACATAACTC TGTCCCTCTA CCCAGGACAG
GCCCCGAGTA ACTTGCCCAA CGAGTTCATA CCTCTCTTCG TGAATGGGAC TTTCAACTAC
AACTACTCGT ACGTCATTCT TAACCCTAAT CAGGACACTA TTACTTCTTC GGTTAAGTTA
CCTAACGGCA CGTATAGGAT GTCAGCTTTC CTTTACGAGG AAGGCGGAGG ACTAGACGAG
TTCTGGTACA GTAACGAGCC GGCCACCAGG GACATCCTGC TCTACTATGA CGGTCTCCTC
GGAGGAGTAG TTCCTCCTTA CGAAACAATA TACACTGGTG GTATTGATCT GTTCTGGTGG
AAGCCACTTT CCAGCATTAA CACGCTGGCA TTTCACACGC CCTATCAGGT GGATCTTACT
CCTCTTCTGG CGCTGGGATC TAATGCTAAC GTCACAGTGA CAGTCTCTAA CTTAGGAACT
GCGAAGGAAC TAACGGGTAG TTCCTCCTTC GACTGGGACC TATCAGGGTT CCTGGCTCTG
TGGGTAAATC AGAGCAATCC TCTAATCTCT GGGCAGGTAG TGAAGGCCTA TACTAGGTTT
ATTGACTCCT CGCCCATCTT TGTTGGCGGT TTCTCAGGGG TTCATTATCA GGAGGGAGGT
AGCTACACCC TCACTTACTC CTCAATTCTA AGGTTCCTTC ACGGTACCGA GATGGCAACA
GTCTCCCAGA CAGGAAGATT CTACGCCTCC CAGACCTTCA ACAACATCTA TCAATTCGCA
TATCTGGACG AAACCTTCAA GGAAATTGCA AATGAGACCG GGTTCTACTC CTCCTCCATG
TATCTGGCGG GCAACTACCC CGTGACACTG CAGATTTCAG CGTTTGCCAC TCCAATAACC
TCACCTAACG TGATACCCTT CAATCTGTCA TATGCGCAGA ACGGATCCAT TCAACTGGGT
GCGAATTACC TGTACTCATT TAGCCTTAAC GGTTACGTCA CGAGGCAATC CCTTCAAGAG
AACTTGACCG CTCAGGGAGG TTTCTCTGGG ATAATTGAGG TGATCAATAG TTATGGCGGA
GCCGTTCTCG TTAAGCTCAC GTCCAATAAC GCCCTAACTC AAAAGTACCT CACCTTCATC
TATCAGGAAC CGGGTGTAAC CGAGTTCAGG GAAAACTTCT TCGCCATGGC CGGACAGAAC
AGCTCCGTGA ATGCTACGGG CTACTACCTG AAAATACAGA GGAGTTTTAC ACCACTAACT
GACCCTGCCT ACCAGGAAGT TGTAAGCTTC ACTGAAGATC ATCTTACTGT AGATCATCTG
GTGGCATATC ATCGAAGTAT TCTGGCGGAA CTTCCTCGCT TTGCTCTTCT TCCACATCCC
TCCCCTTTTT AG
 
Protein sequence
MKSWILLIVL VLGLLPGVGA FSSPTMPHNF TLYNITAMQG LDPKYYSFEA VGYLPPNETP 
VTVTVAKNLI FNDTAWKPYY LNVHIPNGSY SFILMNVSVK EENGTQFDRP FYIFVNGIPV
FWGSTQEIQN SSASVDLTMF ENLLHGNVTF EPVLVNFYDA KVNITGIYLV NITLSLYPGQ
APSNLPNEFI PLFVNGTFNY NYSYVILNPN QDTITSSVKL PNGTYRMSAF LYEEGGGLDE
FWYSNEPATR DILLYYDGLL GGVVPPYETI YTGGIDLFWW KPLSSINTLA FHTPYQVDLT
PLLALGSNAN VTVTVSNLGT AKELTGSSSF DWDLSGFLAL WVNQSNPLIS GQVVKAYTRF
IDSSPIFVGG FSGVHYQEGG SYTLTYSSIL RFLHGTEMAT VSQTGRFYAS QTFNNIYQFA
YLDETFKEIA NETGFYSSSM YLAGNYPVTL QISAFATPIT SPNVIPFNLS YAQNGSIQLG
ANYLYSFSLN GYVTRQSLQE NLTAQGGFSG IIEVINSYGG AVLVKLTSNN ALTQKYLTFI
YQEPGVTEFR ENFFAMAGQN SSVNATGYYL KIQRSFTPLT DPAYQEVVSF TEDHLTVDHL
VAYHRSILAE LPRFALLPHP SPF