Gene Huta_1887 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHuta_1887 
Symbol 
ID8384178 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhabdus utahensis DSM 12940 
KingdomArchaea 
Replicon accessionNC_013158 
Strand
Start bp1894137 
End bp1895147 
Gene Length1011 bp 
Protein Length336 aa 
Translation table11 
GC content69% 
IMG OID644972955 
Productorc1/cdc6 family replication initiation protein 
Protein accessionYP_003130789 
Protein GI257052956 
COG category[L] Replication, recombination and repair
[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1474] Cdc6-related protein, AAA superfamily ATPase 
TIGRFAM ID[TIGR02928] orc1/cdc6 family replication initiation protein 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATCGAGG ACGCCCGCGT GCTCCGCGAG GAGTTCGTCC CGAACGACGT CGTCCACCGC 
GACGGCGAGG TCGACGCGCT CTCGGCCGTC CTCGAACCCG TCGTCGAGGG CGAACCGCCC
GAGTCCGCGC TGCTCACCGG CCCCTCGGGA GCCGGCAAGA CCACCATCGC GAAGTTCGTC
GTCGGCCGCC TCCGGGAGAC CGCCCTCGAC GTCGAGGCGA TCCACGTCAA CTGCTGGCAA
TCGTACACTC GCTTCAAAGC CCTCTACCGG ATTCTCGAGG GCCTCGGCCG GACGATCGAC
GTCCACCGCC AGTCGACGCC CCACGACGAA CTCCTCGATC GGCTCGAAGC CTACGACGGC
CCGCCCGTCA TCGTCACGCT CGACGAGGTC GACCAGCTCG AGGACGGCCA CCTGATCTAC
GACCTCTACC GCCTCCCCGC GTTCGCGGTC GTCCTGATCA CCAACGACGA GGAAGAGCTG
CTGGCCGGCC TCGACGAGCG CGTCCGGTCG CGGCTTCACA CCGCCGAGAC GATCCATTTC
GACCGCTACG ACGTCGAAGA GCTGACCGAC ATCATGGCCG ACCGCGTCGA CCACGGGCTG
GCTTCGGGGG CCGTCGACTT CGACCAGCTC CGGTGGATCG CCGACGCCGC CGCCGGCGAC
GCCCGCGTCG GGTTGAGTAT CCTCCGGAGC GCCGCACGGC GGGCCGACCG CGACGGTGCC
GATGCTATCG CCGCGTCCCA CATCGAGGCC GCGATCCCCG AAGCCCGCCG GGAAGTCCGG
TCGCGGGCCC TCGACGCACT GCACAAGGAG CAACGGAAAG TATTCGAGAT CCTCCGGGAG
AGCGACGGGC TCCCGCCGCG GGAGGTCTAC GATCGGTACG TCGCGGCGGT CGAGGATCCC
CGGACGAAGC GGACGGTCCG GTCGTGGCTC CAGAAAATCG AACAGTACAA CCTGGTCGAG
GCCGACGGGA GTGGCCCGAC CCGGACGTAT CGCGTCATCG CCGAGGAGTA G
 
Protein sequence
MIEDARVLRE EFVPNDVVHR DGEVDALSAV LEPVVEGEPP ESALLTGPSG AGKTTIAKFV 
VGRLRETALD VEAIHVNCWQ SYTRFKALYR ILEGLGRTID VHRQSTPHDE LLDRLEAYDG
PPVIVTLDEV DQLEDGHLIY DLYRLPAFAV VLITNDEEEL LAGLDERVRS RLHTAETIHF
DRYDVEELTD IMADRVDHGL ASGAVDFDQL RWIADAAAGD ARVGLSILRS AARRADRDGA
DAIAASHIEA AIPEARREVR SRALDALHKE QRKVFEILRE SDGLPPREVY DRYVAAVEDP
RTKRTVRSWL QKIEQYNLVE ADGSGPTRTY RVIAEE