Gene Msed_1419 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsed_1419 
Symbol 
ID5104790 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMetallosphaera sedula DSM 5348 
KingdomArchaea 
Replicon accessionNC_009440 
Strand
Start bp1385892 
End bp1387136 
Gene Length1245 bp 
Protein Length414 aa 
Translation table11 
GC content47% 
IMG OID640507308 
Productnucleotidyl transferase 
Protein accessionYP_001191501 
Protein GI146304185 
COG category[J] Translation, ribosomal structure and biogenesis
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1208] Nucleoside-diphosphate-sugar pyrophosphorylase involved in lipopolysaccharide biosynthesis/translation initiation factor 2B, gamma/epsilon subunits (eIF-2Bgamma/eIF-2Bepsilon) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.0198372 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00414302 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCAATGGT CTCCTGAGGA TATAAAGGTT ATCATTCCCA TTGGGGGAGA AGCAACGAGA 
ATGCGCCCTT TAACCGTGGA AACCTCGAAA GCGACCGTGA GGCTTCTGAA TAGACCCCTC
CTCGAGTTTC CGATTCTCGA GCTCGCAAAG CAGGGAGTAA AGGAGTTCAT TTTTGGCGTT
AAGGGGTACG TTAATTACAA GTCTCTCTTT GACACCTTCA AGGAGGGTAT AGGATTTTCA
GCTAGGTACA GGATTAAACC AAGGGTGCAC TTCAAGTATC AACCTAGAGT CGACAGCGTT
GGTAACGCGG ACTCCGTAAG GATCAACATG GACTATTACA GGATAGATGA CATCACGCTC
GTGATCCAGG GAGATAACCT GATCAAGCTG GACCTAAAGA AGCTAGTGGA CTATCACCTG
TCAAAGGGAG CGATAATGAC TATCGTGCTC AAGAAGTGGC ACGACGTGAG GGAATTCGGG
GTTGCGGACC TTGGGGAAGA CATGAAAATA AGGAAGTTCG TTGAGAAACC CAAGGAAGGA
GAGGCTCCAT CTAACCTGAT CAACACGGGA GTCTACGTCT TGTCTCCTAA GATAAGGGAT
ATCTTCGCGA GTGATGAGGT TTCGGCCATG AGAGAGGAGG GAAAGATGGA CTTTGGGAAG
GACATAATTC CCTGCCTAAT CCAGAAAGGC TACCCAGTTT ACGGTTACGT TACTGACTCT
CTCTGGTTTG ACGTCGGGAC TCCTGAGAGG TACTTGGAAG CCATGAGGGT ACTTCTAGAA
AGCCTGGACG AACATGAAAT GGGTGGGAAG AGGATAGACC AGTCCAAGAG GATATTTGTC
CAGGGAACGA GCCCTGACTC CATAAGGAGG AGAAACGTGA TAGCCATGAA GTATAGAAAG
GGTAGGATGA AAATAGAAGG GAGCGTGCTC ATAGGAAGGC ACTGCCAGAT CGGGAACAAC
GTTTACCTAG AGAACTCCAC GATTGACAAC TTCTCGATCC TAAGAAATAA CGTCAGGGTT
GTGAGGAGCT CCATCATGGA CAGGGCATTC ATTGGCGAAG GAGTAGTGAT AGAGAACTCG
GTCATAGCTA GACATGTGGA AATTAGGGGA GGGGCTAGGA TAATTGGGAG TGTCATAGGT
GACGATGTGG TGATAGATGC TGACACTGAG ATAGTGAACT CGAAGATATA TCCGCACAAA
GTTATTAACG CGAATAGTAA AATACACGAT ACTGTACTGA CTTAA
 
Protein sequence
MQWSPEDIKV IIPIGGEATR MRPLTVETSK ATVRLLNRPL LEFPILELAK QGVKEFIFGV 
KGYVNYKSLF DTFKEGIGFS ARYRIKPRVH FKYQPRVDSV GNADSVRINM DYYRIDDITL
VIQGDNLIKL DLKKLVDYHL SKGAIMTIVL KKWHDVREFG VADLGEDMKI RKFVEKPKEG
EAPSNLINTG VYVLSPKIRD IFASDEVSAM REEGKMDFGK DIIPCLIQKG YPVYGYVTDS
LWFDVGTPER YLEAMRVLLE SLDEHEMGGK RIDQSKRIFV QGTSPDSIRR RNVIAMKYRK
GRMKIEGSVL IGRHCQIGNN VYLENSTIDN FSILRNNVRV VRSSIMDRAF IGEGVVIENS
VIARHVEIRG GARIIGSVIG DDVVIDADTE IVNSKIYPHK VINANSKIHD TVLT