Gene Hhal_1692 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHhal_1692 
Symbol 
ID4710036 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhodospira halophila SL1 
KingdomBacteria 
Replicon accessionNC_008789 
Strand
Start bp1846822 
End bp1848102 
Gene Length1281 bp 
Protein Length426 aa 
Translation table11 
GC content65% 
IMG OID639856159 
Producthypothetical protein 
Protein accessionYP_001003258 
Protein GI121998471 
COG category[S] Function unknown 
COG ID[COG2718] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGGTCAATA TCATCGACCG GCGCCGCAAT CCGAAGGGCA AGAGCCTGGC CAACCGCCAG 
CGTTTTCTCC GGCGTGCCAA GCGCCAGGTT CTCGATGCAG TCAACGAGGC CTCCGGCCGG
CGCAAGGTGA CCGACGTGGC CGGCGGGGAG CAGATCACGA TCCCCGCCGA CGGGCTCCAC
GAGCCGAGCT TTCGCAAGGC GCCCGATCGC GGGGTGCGTG AGCACGTGGT CCCTGGTAAC
AAGGAGTACG TGGTCGGCGA TACCATCTCG CGCCCCGAGG GGCAGGGGGG TAGCGGGGGG
CGCGAGGGTA GCCCCGATGG CGAGGGGGAG GACGAGTTCA CCTTCGCCGT CAGTCGCGAC
GAGTTCCTCG ATCTCTTCTT CGAGGGGCTG GAGTTGCCGG ATCTGGTCAA GCGCCAGGTG
AAGAAGACCG AGCAGTACAC CCAGCAGCGG GCAGGGTACT CGGTCAGCGG CTCCCCTTCG
AACCTCAACG TCGAGCGGAC CATGCGCAAT TCCCTGTCGC GGCGCATCGC CCTGCGTCGG
CCGAAGGGGG AGGACCTGCA GGCACTGGAT GAGGAGATCG ACCGCCTGGA GCGTTCCGGA
GCTGAGCCCG AGCGGCTGCG TGAGCTCATC GAGCTACGCC GGAACAAGCA GGAGCGGTCC
CGGGCCATCC CGTACATCGA TCCGGTGGAT ATCCGTTACA ACCGCTTCGA CCACGTCCCG
CAGCCGATCT CCCAGGCGGT CATGTTCTGC CTGATGGACG TCTCGGGCTC CATGACTGAA
GAGATGAAGG ATCTGGCCAA ACGGTTCTTC ATGCTCCTCT ACCTCTTCCT CGAACGCCGC
TATCGGCATG TGGATATCGT CTTTATCCGC CACACCCATA TCGCCCAGGA GGTTGACGAG
GACACCTTCT TCTACTCCCG CGAGACCGGT GGGACGCTGG TCTCCCCGGC GCTGGCGATG
ATGCGCGACA TCGTTGACGA TCGTTATCCG GTCCAGGACT GGAATATCTA CGGAGCGCAG
GCCTCGGACG GCGACAACAC CCCGGCGGAC AATCCGGCCA CCACGCGGTT GATGGCCGAC
GGCATCCTCC CGCTGTGCCA GTACTTTGCC TACATCGAGG TGGGGGGTGG GCAAGCCTTC
CACGTGCCGT CCGATTTGTG GCGAGCCTAC GATCGCTTGG CTCGGGGGGA GTCGCCCCTG
GCCATGCGGC GGGTACAGAC CCGTGGCGAC ATCTTCCCGG TCTTCCGGGA TCTCTTTACG
CCGGCTGAGC TGAAGGCCTG A
 
Protein sequence
MVNIIDRRRN PKGKSLANRQ RFLRRAKRQV LDAVNEASGR RKVTDVAGGE QITIPADGLH 
EPSFRKAPDR GVREHVVPGN KEYVVGDTIS RPEGQGGSGG REGSPDGEGE DEFTFAVSRD
EFLDLFFEGL ELPDLVKRQV KKTEQYTQQR AGYSVSGSPS NLNVERTMRN SLSRRIALRR
PKGEDLQALD EEIDRLERSG AEPERLRELI ELRRNKQERS RAIPYIDPVD IRYNRFDHVP
QPISQAVMFC LMDVSGSMTE EMKDLAKRFF MLLYLFLERR YRHVDIVFIR HTHIAQEVDE
DTFFYSRETG GTLVSPALAM MRDIVDDRYP VQDWNIYGAQ ASDGDNTPAD NPATTRLMAD
GILPLCQYFA YIEVGGGQAF HVPSDLWRAY DRLARGESPL AMRRVQTRGD IFPVFRDLFT
PAELKA