Gene Mlab_0469 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlab_0469 
Symbol 
ID4795239 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanocorpusculum labreanum Z 
KingdomArchaea 
Replicon accessionNC_008942 
Strand
Start bp446863 
End bp448083 
Gene Length1221 bp 
Protein Length406 aa 
Translation table11 
GC content55% 
IMG OID640099126 
Producthypothetical protein 
Protein accessionYP_001029910 
Protein GI124485294 
COG category[S] Function unknown 
COG ID[COG4260] Putative virion core protein (lumpy skin disease virus) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value0.87951 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones43 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTTTCT TCTCAAAAAC AAACAAAACC ATCGGCTCCG GTTCGGATAT CGAGGGTGCA 
GAATCCAGAA AAGGCTTCTA CTGGGTCGAT GATCAGAAGG GTGACAACGT CATCTGGCGT
CTCCCAAGAA ACGTTATGTG GAACGACAAC GTGCTCGTCC GCGAGGATGA GTATGGTATC
TTCTTCCGGG ACGGGAAAGC TCTCGTCGTT TTCGACCGCC CCGACAGATA TGCCCTGACG
ACCGAGAACA TCCCGGTCCT GAAAAGTATT CTTGGAACTG TTGTTGGAAA CGTTCAGATC
GGAGAGTTCT ACTGGGCACA GAAGCGTGAG TTCCGGGATA AGTTCGGGAC TCCGCAGCCT
CTTGCATTCC GCGATGTGGA CTTCGGTGTT GTCCAGCTCA GAATCTTCGG TCAGTTCTCC
TACAAAGTTG TCGATCCGCT GCTTCTGATC ACCCAGTTCG TCGGAACAAA AGGCCTGACG
AAATCCGAGG AGATCGTCGA GTGGCTGAAA TCGCAGATCG TGATGATCTT AAACGATACC
CTCGGCGAGC TGAAAGCAAA GAAGCAGATG GGTGTTCTGG ATATGCCTGC ATATCTGCAG
GAGATCGAGC AGCTCTGCCT TGGCAAACTG ACGACCGAGA CAGAAGTGTA CGGTCTGAAG
ATCATGAAGT TTGCCGGCCT GAACATCAAC ATGCCCGAAG AGGTTCAGGA AGCGATCAAC
AAACGCGGAG CAATGTCTGC TCTGGGCGTG AACTATCTCC AGTATGAGTC CGGAAAAGCT
ATCGAAGGCA TCGGACAGGG AGCCGCCCAA GGCGGAGAAG GCTCCGGATT TGCCATGATG
GGTGCAGGAA TGGGCGCCGG AATGAGCATG GGCGGCATGA TGACCCAGAG CATGGCAGGT
GCAGGAGGCC AGCCGGCTCC CTTTGGCGGT CAGCCGGGAG CAGGCCAGGC TGCAGCACAA
CAACCGACCG GGAAAATGGA GACATGCAGC AACTGCGGAG CAAAGGTCCC GGCAGGCACG
AAGTTCTGCC CGGAGTGCGG CCAGAAGATG GTGCCTGCGG GCGGTTCAAC CTGCACAAAC
TGCGGAGCGA CTCTTGCACC AGGCGCTAAA TTCTGCCCCG AGTGCGGTAC AAAAGTCGAG
ACCATCAGGA GATGCCCGAA ATGCAATGCC GTGGTCCCTG CCGGAACAAA GTTCTGTCCT
GAATGCGGAC AGAAGCTCTA A
 
Protein sequence
MSFFSKTNKT IGSGSDIEGA ESRKGFYWVD DQKGDNVIWR LPRNVMWNDN VLVREDEYGI 
FFRDGKALVV FDRPDRYALT TENIPVLKSI LGTVVGNVQI GEFYWAQKRE FRDKFGTPQP
LAFRDVDFGV VQLRIFGQFS YKVVDPLLLI TQFVGTKGLT KSEEIVEWLK SQIVMILNDT
LGELKAKKQM GVLDMPAYLQ EIEQLCLGKL TTETEVYGLK IMKFAGLNIN MPEEVQEAIN
KRGAMSALGV NYLQYESGKA IEGIGQGAAQ GGEGSGFAMM GAGMGAGMSM GGMMTQSMAG
AGGQPAPFGG QPGAGQAAAQ QPTGKMETCS NCGAKVPAGT KFCPECGQKM VPAGGSTCTN
CGATLAPGAK FCPECGTKVE TIRRCPKCNA VVPAGTKFCP ECGQKL