Gene Hhal_1844 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHhal_1844 
Symbol 
ID4711341 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhodospira halophila SL1 
KingdomBacteria 
Replicon accessionNC_008789 
Strand
Start bp2015808 
End bp2017667 
Gene Length1860 bp 
Protein Length619 aa 
Translation table11 
GC content72% 
IMG OID639856315 
Productankyrin 
Protein accessionYP_001003410 
Protein GI121998623 
COG category[R] General function prediction only 
COG ID[COG0666] FOG: Ankyrin repeat 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTGCCGAA CTCCCCACCG CGTCTTTGCC TTGCTGACCG CCGGGGTGGC TCTGGCCTCG 
TTGCTGACCG CCTGCAGTGG CTGGCCGTTG ACCCCGGCGC AGCAAGACGC CTTCAACGCG
GCCCGGGACG GCGATGTGGA CGCCCTGGAG CGGGCCCTGG ACGGGCCGGC CAAGCTGCAG
GCCACCGACG GGGCGGGGCG CGGTCTCCTG CACTACGCGG CGCTCAACGG TGAGGTGGCC
GTGGTGGACC TGCTCATCGA GCGCGGTGCC GGTGATGCTC CGGATGTCCG GGACGATGCC
GGCCGTACCC CCCTGCACGA GGCGGCTGCC GGAGGGCACG CCGAATCCGT CGGGTCGCTG
CTGGGGGCCG GGGCGGCGGT GGATCCCGTC GATGCAGAGG AGCGCAGTCC GCTGATGCTG
GCCGCCCGCA GTGGCGATCT GCAGGCGGTG GACGCCCTGC TGGAAGCCGG GGCCGAGCCG
GACCGCCGCG ACGCCGACGG GGCCACCGCG CTCTCCATGG CGTCCGCAGC CCGCTATCCC
CGCGTTGCGG CTCGGTTGCG TGAGGTCGGT GAGCGTCCGG CGGCCCCGTG GGCCCTGGTC
GGCGTAGTGC GTGCCGGTGA TGTCGAAACC CTGGCATGGA TGCTCGATCG TGGCTTGCCG
GTGGATGTCG CCTGGCAGGA CGGCGATAAC TTCCTCTCCC TCGCGGATAT CGCCGCGGAG
TGGAGCCCGC GCGCAGCGGC GCTGCTGCTG CGCCGTGGTG CTCAGCTCCA CGGCGAGGGG
GCGGCGGCCC TGGTCGCCGC GGCGCTCCAC GGCGATGCCG TGCAGGTGGC GGAGCTCCTG
GAGCAGGGAG TCGATCCCGA CGTCAACGTG CGCGGCGGCG ACGCCGAGGG GCGCACGCTG
CTGATGGAGG CGGCTCGGCT CGGCTATCGA GAGGTGGTCG AGGACCTGCT GGCGGCCGGT
GCGGATGCCG GCAGCGTGGA TGAGTACGGC GCCGATGCGC TTTGGCACGC TGCGGTCGGC
TACGTCTCTG CCGGTGACGA GCGGGCCGGG CTGGAGCGCG ACGCGGACCG CAGCGAGACC
GTGCAGGCCC TGCTGGATGC GGGCGTGGCC CCCGGCGGCG GTGACCGACA CGACTTTGCC
GCCATCCATG CAGCGGCGGC CTGGGCCGAC GCCGAGGCCG TTGAGGCGCT GCTCGATGCC
GGCGCGGGCC TGGAACAGAC TGATCGCAAT GGGCGCACGC CGTTGATCTT CGCTGCCCAG
GCCGGGAATG AGGCAACCCT CAGCGCCCTA TTGGCCCGGG GGAGTGATCC CCGGGCCGTG
GACCGGATCG GTGCCACAGC GCTGACCTAT GCCCGGCTTG CCGAGGCGCG CGGCATAGCG
GAAGAGGCTG TGGCGGCTGT GCGCGAGGCC GGGGGTGGCA AGGCGCCGGG GCCGCGACAG
GGGACGCCTG GGCTATCCGT CGGTGGTGAT CATCAGCGCC TGATGCGCCT CGATGCGCCG
GTAGCCATCT TCGATAGCGC CGGATACGAG GGGCATAGCG CCCGTTTTCT GCCGTACACC
ACGGCCTGCC GGGATCACGG TTGCCTGGTA CCGCCGGTGG TGGTGCTGCC GGCGGGGACG
ACGGTCGAGC TAGTGGCCCG GGAGGCTGCC GAAGACGGCC GGCTTCACGG CGTCTTTCTG
GTCCAGGAGC CGGGCTTGCG CGGCCTGTTG TTCGAAGACT ACCTGTTCGG CGCTTGGCCG
CCGGATGGGG CGGCGGTCAG TGAAATCAGC GTGGGCCGGG TGGCGCATAT CCGCGGCATG
GAGTGGGTCG AGGTCTTCGA CGTCGACCTG CCGGCGGCCA CAAGTGGGTT GTTCTGGTGA
 
Protein sequence
MCRTPHRVFA LLTAGVALAS LLTACSGWPL TPAQQDAFNA ARDGDVDALE RALDGPAKLQ 
ATDGAGRGLL HYAALNGEVA VVDLLIERGA GDAPDVRDDA GRTPLHEAAA GGHAESVGSL
LGAGAAVDPV DAEERSPLML AARSGDLQAV DALLEAGAEP DRRDADGATA LSMASAARYP
RVAARLREVG ERPAAPWALV GVVRAGDVET LAWMLDRGLP VDVAWQDGDN FLSLADIAAE
WSPRAAALLL RRGAQLHGEG AAALVAAALH GDAVQVAELL EQGVDPDVNV RGGDAEGRTL
LMEAARLGYR EVVEDLLAAG ADAGSVDEYG ADALWHAAVG YVSAGDERAG LERDADRSET
VQALLDAGVA PGGGDRHDFA AIHAAAAWAD AEAVEALLDA GAGLEQTDRN GRTPLIFAAQ
AGNEATLSAL LARGSDPRAV DRIGATALTY ARLAEARGIA EEAVAAVREA GGGKAPGPRQ
GTPGLSVGGD HQRLMRLDAP VAIFDSAGYE GHSARFLPYT TACRDHGCLV PPVVVLPAGT
TVELVAREAA EDGRLHGVFL VQEPGLRGLL FEDYLFGAWP PDGAAVSEIS VGRVAHIRGM
EWVEVFDVDL PAATSGLFW