Gene Huta_1703 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHuta_1703 
Symbol 
ID8383989 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhabdus utahensis DSM 12940 
KingdomArchaea 
Replicon accessionNC_013158 
Strand
Start bp1696248 
End bp1697513 
Gene Length1266 bp 
Protein Length421 aa 
Translation table11 
GC content49% 
IMG OID644972770 
ProductMcrBC 5-methylcytosine restriction system component-like protein 
Protein accessionYP_003130609 
Protein GI257052776 
COG category[V] Defense mechanisms 
COG ID[COG4268] McrBC 5-methylcytosine restriction system component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCCCTCT CTTTCAATCG TAACCAGGAG GAGTTTGAAG CCGACATCCA GTTGGGAGAA 
TACGAATCCA GCGAACCAAT TGAACTCTCA GAGATGGCCG TATCGATGCT GGAAAACGAA
GTGAATAGCG GAGAGGAAAA AGAGGGCGAC CGTATCAAGT TGCATTACAA CCGAGATGGG
GAAGCAATAC TCACTTCGAC CCAGTACGTT GGAGTCGTTT CGTTGAGAGA TGGACCCACT
ATTGAAGTCC GCCCGAAAGC GGCCGGCACA AACCTCCTGT ATCTTCTCCA ATACGCTCAT
GACACGACTG CGACCACGTT CGAATCACAG GCTCCGTATC AAGCAGGTCA CACTTTTCTC
GATGCATTTG GTGCACTCTA CGAAGCGGAA TTGCGGAGAA TTGTAGATCG AGGACTCTAC
ACGGACTACC GAAGAACCGA CGCTACCGAG TCTCATCTTC GCGGACGACT CGATATCCAT
CGCCAGCTAC AGCGACAACC ACCAGTTCCT ACTGCGTTTG AATGTACTTA CGACGAATTG
ACTCATGATA TTCTGGCGAA TCGAGCCATC CTACATGCTA CCACTGTCTT GCTAGGGGCG
GTCTCAGACC GTTCAATAAC CCAGTCGCTT CGTCAACATC AACAGTTGCT TCGCCGTCAG
GTTTCCCTTA CGCCTGTGAC GATACAGGAC ATAGAGCGTA TTGAACTCAA TCGTCTTGCT
GACCACTACG AGGACATTCT CCGACTTACT AAATTGGTGA TTAGGAACTC ATTCGTGTCG
GAACTCCAAG CCGGCTCGAG TGCGGCGTTT GCGATGTTAG TAAATATGAA TACGATATTC
GAGAACGCAG TTGAGCGTGC CTGTAAAGAA GTTCTGTCAG AGCGCGAAGA TTGGGAAGTG
AAATTCCAGG ATACGTCACA GAACTTAATC ACTGGCGGAA AACACACAGT GACACTTCAG
CCCGATATTA CGATATATGA CCCGGAAAAT ACGGTATCAC TCGTTGCTGA TGCGAAATGG
AAGAATGAGA GGCCGAAAAA CGCCGACTTT TACCAGATGA CGTCATACAT GCTCGCCAAC
AACGTACCGG GAATACTATT TTACCCCGAT TGTGGTGGAC TCAATGAGTC ACGTTCGACT
GTCACTGGTG GATTCCCCCT TTGGCTATCT GAACTACCTA CTGCTGTCCA AGTGAATTCC
TACGAAGATT TCGTCTCAGC TTTTGAGTCC GAAACGGCGG ATGCAATTTT TGGAATGGTG
GATTAG
 
Protein sequence
MSLSFNRNQE EFEADIQLGE YESSEPIELS EMAVSMLENE VNSGEEKEGD RIKLHYNRDG 
EAILTSTQYV GVVSLRDGPT IEVRPKAAGT NLLYLLQYAH DTTATTFESQ APYQAGHTFL
DAFGALYEAE LRRIVDRGLY TDYRRTDATE SHLRGRLDIH RQLQRQPPVP TAFECTYDEL
THDILANRAI LHATTVLLGA VSDRSITQSL RQHQQLLRRQ VSLTPVTIQD IERIELNRLA
DHYEDILRLT KLVIRNSFVS ELQAGSSAAF AMLVNMNTIF ENAVERACKE VLSEREDWEV
KFQDTSQNLI TGGKHTVTLQ PDITIYDPEN TVSLVADAKW KNERPKNADF YQMTSYMLAN
NVPGILFYPD CGGLNESRST VTGGFPLWLS ELPTAVQVNS YEDFVSAFES ETADAIFGMV
D