Gene Hmuk_2135 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHmuk_2135 
Symbol 
ID8411673 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalomicrobium mukohataei DSM 12286 
KingdomArchaea 
Replicon accessionNC_013202 
Strand
Start bp2042776 
End bp2044518 
Gene Length1743 bp 
Protein Length580 aa 
Translation table11 
GC content68% 
IMG OID645020476 
Producthypothetical protein 
Protein accessionYP_003177955 
Protein GI257388182 
COG category 
COG ID 
TIGRFAM ID[TIGR02537] archaeal flagellin N-terminal-like domain 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.11758 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTACCGGG GCCTCGGCCG GCGCGGGCAG TCGTCGCCGA TCGCCGTGAT CCTCCTCGTC 
TCGATGGTCG TCGCCGGCTC GCTGGCCGTC GTGACGCTGG GTGCCCAGTC GCTGTCTGAC
ACCCGAGAGA CGATGGACGT CGAGCGCGCC GAGAAGGGCC TGACACAACT GGACTCGAAC
GTCGCGATGG TCGCGCTGGG GAGTGCCGGC GGACAGGAGC TCTCGCTGTC GAGGACGGAC
GGGGCCGCCT ATCGGCTCCG GGACGACGCG GGGCGGATGA CCGTCGCGGT GACGAACACG
TCCAACGACT CCACGAAGAC GGTGATGAAC GCGACGCTCG GCTCGATCGC CTACGAGAAC
GACGGCCGCT CGGTCGCGTA CCAGGGCGGC GGCGTCTGGA AGACGGACGG CGACGGCGGC
TCGCTGATGG TCTCGCCGCC GGAGTTTCAC TACCGGGACG CGACGCTGAC GCTCCCGCTG
GTGACCGTCT CCGGCGACGA GTCGCTCGAC GGCCGGATCG CGGTTCGGCC GGGCGGTCGC
TCGACGCAGC ACTTTCCGAA CGCCTCGGCC GACGAACAGT GGGTGAACCC GCTCGACGGC
GGCCGCGTCA ACGTGACCGT CAGGAGTGAG TACTACCGGG CGTGGGGCCG ATTCTTCGAA
GAGCGTACGG ACGGCGAGGC CACGCTCGAT CACGCGAACG AGACGGCGAC GGTGACACTC
GTCGTCCCGG CCGGGCCACA GACCGTCACG AACGCCGTCG CGGCCACGTC AGCGGGCGGC
GAGATCGTAC TGGCCGGAAG CGGCGATCAG ACCCGGACCG ACAGCTACAA CTCCTCGAAA
GGGACCGGGC TGTACGCCGA CACGAAGACG CACAACGGGT CGATCCGGAC GGCCGGCGAC
GTGACGGTCA AGGGCAACAG CCAGGTCAAC GGCTCGCTGG CGTCGGGAGG CAAGGTGACC
GTCAAAGGGA GCGGTGTGGT GACCCGAGAC GCCGGTTACA CCGACGACAT CAAGGTCACC
GGCAGCGGCG GTGTCGACGG CTCGATCGAA CAGCTCTCGG GCGTCGACGG GATCGGTCCG
ATCGACGCCG TCGTCGATCG ACGGTACGAG AACGCGACCG GGGACAACGA CAACGGCGAC
ACGAGCGCGA TCACGGGGAC GACGCTGAGC GACGGCGACC AGACGCTCTC GGCCGGCGAG
TACCACCTCG ATCGGCTCGT TCTCGACGGC GAGACGCTGA CGCTGGACAC CGGGACCGGC
GGGACGATCA GTCTCGCGGT CCGTGACTAC GTCCAGCTGA AGAACGACGG CCGAATCCAC
GTCGTCGGAA ACGGGACGGT CCGCCTGTAC GTCGACGGAC AGGCGACGAC GGCGTCGAAC
CACCACTTCT CGATCGAGGG AAGCGGCGGC CAGATCGACA TCGACGAGGG GCAGAACGCC
TCCCAGTTCT GGCTCTACGG CCGCGAGGAC TTCCAGGGAC GGATCGACGG GACCTCCAGT
GACACCCATC TGTTCGAGGG CGTCGTCTTC GCACCCGGCG GCACGCTCGG CAGTAGCTCG
TTCACGGTCG AGAAAGGGAG CCTCTACGGC GGTGTCGTCA CCGGGAGCGT CACGATGGAC
AACGGCGGAC AGGTCCACTA CGACCGGTCG CTGAAGCGGG TCAACGCCGT CCCGCCGGCC
GAGAACATCG TCCGACTGAC CTACCTCCAC GTCTCGGAGA TCGAGATCGA AGCTCGCGAC
TGA
 
Protein sequence
MYRGLGRRGQ SSPIAVILLV SMVVAGSLAV VTLGAQSLSD TRETMDVERA EKGLTQLDSN 
VAMVALGSAG GQELSLSRTD GAAYRLRDDA GRMTVAVTNT SNDSTKTVMN ATLGSIAYEN
DGRSVAYQGG GVWKTDGDGG SLMVSPPEFH YRDATLTLPL VTVSGDESLD GRIAVRPGGR
STQHFPNASA DEQWVNPLDG GRVNVTVRSE YYRAWGRFFE ERTDGEATLD HANETATVTL
VVPAGPQTVT NAVAATSAGG EIVLAGSGDQ TRTDSYNSSK GTGLYADTKT HNGSIRTAGD
VTVKGNSQVN GSLASGGKVT VKGSGVVTRD AGYTDDIKVT GSGGVDGSIE QLSGVDGIGP
IDAVVDRRYE NATGDNDNGD TSAITGTTLS DGDQTLSAGE YHLDRLVLDG ETLTLDTGTG
GTISLAVRDY VQLKNDGRIH VVGNGTVRLY VDGQATTASN HHFSIEGSGG QIDIDEGQNA
SQFWLYGRED FQGRIDGTSS DTHLFEGVVF APGGTLGSSS FTVEKGSLYG GVVTGSVTMD
NGGQVHYDRS LKRVNAVPPA ENIVRLTYLH VSEIEIEARD