Gene Hlac_2108 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_2108 
Symbol 
ID7400628 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012029 
Strand
Start bp2098409 
End bp2099479 
Gene Length1071 bp 
Protein Length356 aa 
Translation table11 
GC content65% 
IMG OID643709178 
Productmethionine synthase 
Protein accessionYP_002566755 
Protein GI222480518 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0620] Methionine synthase II (cobalamin-independent) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.112744 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.0912449 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTCCGAA ACCCGTCTGC CAACCGCGAA CAGTTCCGCC CGCACGACCA CCCGAACGAC 
GCGTTCCTCC TGACGACCGT CGTCGGCTCG TACCCGAAGC CGAAGTGGCT CAACCGGGCC
GACGAGCTCG TCGACGACCC CGACTCGAAG TTCGATGTGT CGGACCTCGA AGAGGCCCAC
GACGACGCCT GTCGACTGAT CACCCACGAA CACGAGCGCG CCGGGCTCGA CACGGTGGTC
GACGGCGAGA TGCGCCGCAA CGAGATGGTC GAGTTCTTCG CCGACCGCAT CGACGGCTAC
GAGTTCAACG GCCCCGTGAA GGTGTGGGGT CACAACTACT TCGACAAGCC CTCGGTCGTC
GAGGAGGTCG AGTACGACGA GCCGTGGCTC GTCGACGAGT TCGAGTTCAC CTCGTCGGTC
GCCGAGCATC CCGTCAAGGT CCCGATCACG GGGCCCTACA CGCTCGGGTT CTGGGCGTTC
AACGAGGCGT ACCCCTCCAC CGAGGAACTC GTGTACGATC TGGCCGACCT CGTCAACGAG
GAGGTCGAGA AGCTCGTCGA GGCTGGCGCG CGCTACATCC AGATCGACGA GCCCGCGTTG
GCGACGACGC CGGAGGACCA CGCCATCGTC GGCGAGGCCC TCGAACGCAT CGTCTCGGGC
ATCGACGAGG AGGTCCGGAT CGGCCTCCAC GTCTGTTACG GCGACTACTC GCGGATCTAC
CCCGAGATCA ACGACTACCC GATCGACGAG TTCGACGTGG AGCTGTGTAA CGGCGACTTC
GAGCAGATTC CCACGTTCAC CGACCCCGAG TTCGAGCCCG ACCTCGCGCT CGGCGTCGTC
GACGCCCACA CGGCGGAGAT CGAGTCAGTC GAGGAGATCA AAGCGAACAT CCGGCAGGGC
CTGCGCGTCG TCCCGCCGGA GAAGCTCACG ATCTCGCCCG ACTGCGGGCT GAAGCTGCTC
CCGCGAGAGA TCGCGTACGG GAAGACCGAG AACATGGTCA CCGCGGCCCG CGAGGTCGAA
GCCGAGATCG ATTCCGGCGA GATCGACGTC GAGAACCCGC TCGACGACTG A
 
Protein sequence
MVRNPSANRE QFRPHDHPND AFLLTTVVGS YPKPKWLNRA DELVDDPDSK FDVSDLEEAH 
DDACRLITHE HERAGLDTVV DGEMRRNEMV EFFADRIDGY EFNGPVKVWG HNYFDKPSVV
EEVEYDEPWL VDEFEFTSSV AEHPVKVPIT GPYTLGFWAF NEAYPSTEEL VYDLADLVNE
EVEKLVEAGA RYIQIDEPAL ATTPEDHAIV GEALERIVSG IDEEVRIGLH VCYGDYSRIY
PEINDYPIDE FDVELCNGDF EQIPTFTDPE FEPDLALGVV DAHTAEIESV EEIKANIRQG
LRVVPPEKLT ISPDCGLKLL PREIAYGKTE NMVTAAREVE AEIDSGEIDV ENPLDD