Gene Haur_1813 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_1813 
Symbol 
ID5733671 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp2106410 
End bp2108857 
Gene Length2448 bp 
Protein Length815 aa 
Translation table11 
GC content51% 
IMG OID641278956 
Productankyrin repeat-containing protein 
Protein accessionYP_001544584 
Protein GI159898337 
COG category[R] General function prediction only 
COG ID[COG0666] FOG: Ankyrin repeat 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000177587 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATCTCA ATTCATTAAA AAAACTGCTC AAACACCAAG CCTACCACCA AATACTTGAG 
CATTTGCAAA CTGGCCCAAC TTGGAAGCAA AAGTCGCTTG ATCATGCCCT CTATCAGCTT
GCCAGCCAAC CAGCTGAAAC CCTGATCGCC ACATTACTCA GCGCTGGCGC AAACCCCAAC
CAATCACTGC AAGACAACCG CTATATCAGC TTGCATCGGG CAGCAATCTA TAATCGTTAT
GCGATCGCTG AGCTTTTGTT GAGCCATGGA GCCAACTGCC ATGCCCAAAG CGCTAAGCAA
CAAACGCCAC TACATTTGGC TTGTCTCAAC GGCCATTTCG AGCTAGCCCA ATTGCTTTGG
CATAACGGCG CTGATCTGCA TGCCCAAGCT GATTCAAACT TCACACCATG GTTATATGCA
GCCAGCAGTG GCAATCTCCA ACTACTCAGG TGGTTGTTAG ACCAAGCGGT TGATATTGAT
CAAACGACGG CTAATGGCAT CTCGGCCCTA ACGTTGGCAG CGTGGAATGG GCATCAAGCA
GCCGTCGAAT GGTTGTTAGC CAATGACGCA GCAATCGAAG GCCCGCCAAC CAGAACTCGC
ACCCCGTTGC ATGCTGCATT AAGTAAAGGC CATATGGCAA TTGCCAATTT GTTGCTTGAT
CACGGGGCAG CGGCCAACGC CATAACCAGC GAGGGCAATA GCTGCGTGTG TTTGGCTGCT
TGGCACAATG CTACTGATTT AGTTGATCGT TTATTGCAGC TTGGCTCACC CATTAATTGT
GCGATGGCAA AACCGCATGC GACCGCCTTG CATGCTGCGG CCCAACTCAA TAACCCTGCG
ATGGCCCAGC AACTTTTAGC CAACGGAGCG AAGATTAATG CACTCAATCC ACAAGGCTTG
ATGCCATTTC ACACAGCGAT CAGCACTATT TGCGCGACTA AAAGCGCCGA TCTCGCGTTG
ATCGAGGTTT TTCTGCGGAT GGGAGCCGAT CCTAACCAGC CAAGTTATGC AATCAAGCAG
CTTGGCAAGG AAACGCAGGT GCGCGAACAT TGGCGACCCT TAGGCTATGT GTTGGCCCAA
AAACGGCGTG ATTTAGCCGA GTGTTTATTG GCTTATGGTG CTGAGCTTAA TTTGCCAAGC
TATGGCAAGT GGGCGATGGA GCAAACACCA CTTGAAGTTG TGGCTAGCGC CGAGGTTATT
GATCAAGAGG CTGGCGAATT GTTTAGCTGG CTCCTCAGCC TCAAACCGCA AATTCCCCCA
AAGCTGCTGC CAAGCATGCT TATAACGCAA AAATTCGGCT TTGCCCAACA ATTAATCGCG
GCTGGAGCCG ATCTGCATGC TCCCGATGTT TTAGGCGCAG CGATTACCGC TAAAAACGAG
CACTATGTTC ACGATTTTTT GGCACGAGGC GTAAATGTTG ATAGCCCATA TCGCAATTAC
ACAACCGTGT TACAGCTAGC ACTGAGTTCT TATCCCCAAT TCGCGCTGCG ATTAATTGAG
GCTGGAGCCG ATTTAACCCA CTTGGTTGAT GAGAACCCAA GCCTCAATTA TCCAATGCTG
ATTCGCCAAC AACGCTCAAA TCGGCCAGCG ATCAGCAATT TAATGCTACT CGATTTGATT
GCCGAAGCCA TGCTAGCCCA GCTTGAGCCT GCAAACCCAG CTTATCAAGC CTTGCTCGAT
CAAGAGTTTG CCGAGCGCGT TTGTACAGCA AACGAAACAA CATGGCAGAT TTGGCTGGCA
CGCGGAGCCA ATATCCATGC GCTGAATCAT GCTGGATTGT TGGGGTTCAG CCAGCTTTGC
GTCCACGGCG ATTTAGCTGG CGCTCAAACA CTCTATGCCA ACGGCGCAAA TATCAATCAA
ACTGACCATT TTGGCCGCAC TGCCTTGCAC TGGGCAGTTG AGCGGCAACA GCTGGCAAGC
ATACAACAAT TGCTGGCTTG GGCTGCGGAT ATGCATAGTG CAACGCCCTA TGGCTACACG
GCGTTGCACT ATGCCGCCTT AGCCAATCGA CTGGATCTGG TTGAATTGTT GTTGCAAGCC
AAGGCTGATC CGACGGTGCA ACTCACGACG GGGCGCTTAC AAGGCTGGAC GGCATTACAC
TGCGCCTATG CAGTCGATAA TCAATCATTA ATTAAATTGC TGCACCCACT AACGCCCACA
ATCACGCCGC CAGAGCCAGG TTCGCAGCAT ATTCAAGGAA CCTACGACGT AACAATGGCG
CATAACGGCT GGCACAAACC ACGCCCAATT AGCCAGCAAA CCCAACGTTG CCCCGCTTGC
GCCGAGCACA TGCTCTACAA CACTGCTCAC AGCTTCGATG GCTCAGGCCA ACTAGCCGAT
CGAATTGAGA TCTATCGCTG TGGGAATTGT CAGGCCGTAT TTTGGGAGAA TAGTATGGCT
ACGTGGCGCT CACGTTTGCA GCCATGGTCA AGTTTTGTGC CGGATTAA
 
Protein sequence
MNLNSLKKLL KHQAYHQILE HLQTGPTWKQ KSLDHALYQL ASQPAETLIA TLLSAGANPN 
QSLQDNRYIS LHRAAIYNRY AIAELLLSHG ANCHAQSAKQ QTPLHLACLN GHFELAQLLW
HNGADLHAQA DSNFTPWLYA ASSGNLQLLR WLLDQAVDID QTTANGISAL TLAAWNGHQA
AVEWLLANDA AIEGPPTRTR TPLHAALSKG HMAIANLLLD HGAAANAITS EGNSCVCLAA
WHNATDLVDR LLQLGSPINC AMAKPHATAL HAAAQLNNPA MAQQLLANGA KINALNPQGL
MPFHTAISTI CATKSADLAL IEVFLRMGAD PNQPSYAIKQ LGKETQVREH WRPLGYVLAQ
KRRDLAECLL AYGAELNLPS YGKWAMEQTP LEVVASAEVI DQEAGELFSW LLSLKPQIPP
KLLPSMLITQ KFGFAQQLIA AGADLHAPDV LGAAITAKNE HYVHDFLARG VNVDSPYRNY
TTVLQLALSS YPQFALRLIE AGADLTHLVD ENPSLNYPML IRQQRSNRPA ISNLMLLDLI
AEAMLAQLEP ANPAYQALLD QEFAERVCTA NETTWQIWLA RGANIHALNH AGLLGFSQLC
VHGDLAGAQT LYANGANINQ TDHFGRTALH WAVERQQLAS IQQLLAWAAD MHSATPYGYT
ALHYAALANR LDLVELLLQA KADPTVQLTT GRLQGWTALH CAYAVDNQSL IKLLHPLTPT
ITPPEPGSQH IQGTYDVTMA HNGWHKPRPI SQQTQRCPAC AEHMLYNTAH SFDGSGQLAD
RIEIYRCGNC QAVFWENSMA TWRSRLQPWS SFVPD