Gene Hhal_0020 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHhal_0020 
Symbol 
ID4710198 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhodospira halophila SL1 
KingdomBacteria 
Replicon accessionNC_008789 
Strand
Start bp21085 
End bp22296 
Gene Length1212 bp 
Protein Length403 aa 
Translation table11 
GC content64% 
IMG OID639854476 
Producthypothetical protein 
Protein accessionYP_001001617 
Protein GI121996830 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGGCAA GCGCTATAGC CGGGTGTTGT AACAAGCTCA GGCACAGGAT TGTGGCCGGG 
GCGGCCCTGG TCACAGCGGG GAACGCTGTG GCCGGGTCGA CCCTGCCCGA ATCCGGTATC
CTCGACCCTG GACAGATCGA TGTCAGGGGC ACAGTGGGCT TGAGCTACTT CGGGGCCGGT
CAGGGCACCA CTATGGATGT TCTGCCCTCC CTGCGTACGG GGCTCCCCGG TCCGTTCGAT
GTGGCGGTTA CGGTGCCTTA TCGAAACGAT CTTGAGCAGG AGCAGTACGC CCTGCGTAGC
TCATTCCGCA CCGATTTCTC CTACCGGTTC CTCGATGATG GCCCTCGCCA GGCGGTTCTG
GGCGGCTACA TGACCTTCGA TCCCTCGGAT GCTGGGGAAG GTGTAGGGAG CGGGTCGCAC
AACTATGGGG TCAGCGCCGA CTACCGCGCA GAGGACATCG TGGGCACCGG GACCTTCTAT
CTGCGCGGGG CCGTGGAACG GCTGGACCAT CGCGACGATC CCGGCGCCGA TGAGGTCTCC
TACCGGCTCG CCAATCGCCT CACCGCCGAG ATCGGTCTAG GGCTGGATGT GGATGTCGAC
GCCGAGCCCT ACTTCGGTCT TCGAGGTACC CAAGGGCTAG GCTCGACGAC ACGTGACCAG
CAGAGCTTGT CCTTTCGTCC GGGCATTCGG TTTCGGTACA CCCCTAACAG CGAGGTCCAG
TTCCTGGCGC AATTCGATAC GGTACAGCGT AACGCCGAAC CGGAGCGGGC GATCTTCGTG
ACCTGGACCT ACCAGCACCG CCCTCCGGAG CGTGATCTGG ACAGCCTGCG GGAGCGTATC
TCCGCCAATG AGATGGCCAT CGAGCGCCTG GATCGACGGG TGAGCGACAT CGAACGCCGG
CTGTTAACAC GTACCGAGGT ACCGGAACCG GAGACGAGGG AGGGGGTGGT CGTGCTCAAT
CACTCCGGGA TCCCGGAGTT GACCACCTTG GTGGTGGACA CCCTGGAGAA CCTCGATCTC
TCTGTCGGGG ACACGCGTGA CGAAGACGAC GTCGCCCGTC GGGATCGCAC CAAGATCCTC
TACCGGCCGG GGCATGCGGA GCGGGCTCGG GAGATTGCCC GCGCGCTGCC GGGGAATCAG
CTGATCGAGC AGCGTGATGA CATGCCGAAT CAGGCCGAGA TCGCCGTCCT GATCGGCTTC
GATCTGGAGT GA
 
Protein sequence
MKASAIAGCC NKLRHRIVAG AALVTAGNAV AGSTLPESGI LDPGQIDVRG TVGLSYFGAG 
QGTTMDVLPS LRTGLPGPFD VAVTVPYRND LEQEQYALRS SFRTDFSYRF LDDGPRQAVL
GGYMTFDPSD AGEGVGSGSH NYGVSADYRA EDIVGTGTFY LRGAVERLDH RDDPGADEVS
YRLANRLTAE IGLGLDVDVD AEPYFGLRGT QGLGSTTRDQ QSLSFRPGIR FRYTPNSEVQ
FLAQFDTVQR NAEPERAIFV TWTYQHRPPE RDLDSLRERI SANEMAIERL DRRVSDIERR
LLTRTEVPEP ETREGVVVLN HSGIPELTTL VVDTLENLDL SVGDTRDEDD VARRDRTKIL
YRPGHAERAR EIARALPGNQ LIEQRDDMPN QAEIAVLIGF DLE