Gene Hhal_1867 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHhal_1867 
Symbol 
ID4711200 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhodospira halophila SL1 
KingdomBacteria 
Replicon accessionNC_008789 
Strand
Start bp2041308 
End bp2042378 
Gene Length1071 bp 
Protein Length356 aa 
Translation table11 
GC content70% 
IMG OID639856339 
ProductDNA-directed DNA polymerase 
Protein accessionYP_001003433 
Protein GI121998646 
COG category[L] Replication, recombination and repair 
COG ID[COG0389] Nucleotidyltransferase/DNA polymerase involved in DNA repair 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCTGGC GCAAGATCAT CCACATCGAC ATGGACGCCT TCTACGCCTC GGTGGAGCAG 
CGCGACGACC CGAGCCTGCG CGGTCAACCC GTGGTGGTGG GCGGCTCCCC CGAAGGACGG
GGCGTGGTCG CCGCGGCGAG TTACGAGGCG CGGGCATTCG GCATCCGCTC CGCGCAGCCC
GCCGCCTGGG CCCGGCGCCG CTGTCCCGAG GCCATCTTCC TGCGCCCCCG GTTTGACCGC
TACCGCGCCA TTTCCCGGCA GATTCACGGC ATATTCGCCG ACTTCGCCAC CACCATCGAA
CCCCTATCCC TCGACGAGGC CTATCTGGAT GTCACCGGCT CGCAACGGTT CCGCGGCTCG
GCCACCCACA TGGCCCAGGC AATTCGCCGG CGCATCCGCG AGGAGACCGG GCTGACGGCC
TCGGCGGGGG TGTCGTACAA CAAGCTACTG GCCAAACTTG CCTCCGACGA AGGCAAACCC
GACGGCCTCT ATGTGGTCCC GCCCGAGGAC GGGCCGGCGT ACGTCGCGGC GCAGCCGATC
CGCCGTCTTC ACGGGGTCGG CCCCGCCACG GCGGCGCGCC TGGAGCGCCT GGGGATTCGG
CAGGTGGGCG ACCTGCTGGA CTGGGAACTC GCCGACCTGC ATGTGTTCCT GGGCAACCGC
GCCGGCACCC TGCACGACGC CGCCCGCGGT ATCGATCACC GGCCCGTCCG GCCGCGCCGC
TCGCGCAAAT CCATCGGCGC CGAGCGGACC TTCGGCGATG ACACCCGGGA TCTTGGGGAA
ATCCACCAGC GACTGGCGCC ACTGATCACC AAGGTGGCCA CCCGGCTGGA GCACCACGAG
CTGGTCGCCC GCACGGTGAC CCTGAAACTC CGCTACGCCG ACTTCGAGTC GATCACGCGC
CGGGTCTCGC CCCCCGGCCC GGTGGCACAG GCCGCGGACA TCGAGGCCCT GATCCCCGCC
CTGCTGGCCG AGACCGAGGC CGGCTCCCGG CCGGTGCGGT TGCTCGGCGT CAGCCTGTCG
GGACTGCAGC CGAAGCAGCG CGAACAGGAT CTGTTCAGCG CACTCACCTG A
 
Protein sequence
MTWRKIIHID MDAFYASVEQ RDDPSLRGQP VVVGGSPEGR GVVAAASYEA RAFGIRSAQP 
AAWARRRCPE AIFLRPRFDR YRAISRQIHG IFADFATTIE PLSLDEAYLD VTGSQRFRGS
ATHMAQAIRR RIREETGLTA SAGVSYNKLL AKLASDEGKP DGLYVVPPED GPAYVAAQPI
RRLHGVGPAT AARLERLGIR QVGDLLDWEL ADLHVFLGNR AGTLHDAARG IDHRPVRPRR
SRKSIGAERT FGDDTRDLGE IHQRLAPLIT KVATRLEHHE LVARTVTLKL RYADFESITR
RVSPPGPVAQ AADIEALIPA LLAETEAGSR PVRLLGVSLS GLQPKQREQD LFSALT