Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_1008 |
Symbol | |
ID | 5732912 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 1152262 |
End bp | 1153950 |
Gene Length | 1689 bp |
Protein Length | 562 aa |
Translation table | 11 |
GC content | 46% |
IMG OID | 641278143 |
Product | CHAP domain-containing protein |
Protein accession | YP_001543784 |
Protein GI | 159897537 |
COG category | [R] General function prediction only |
COG ID | [COG3942] Surface antigen |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.00120354 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGCAACC ACCACTGGCG GATCTGCTTA ATTGTAGGAC TTTTAGTAAG TCTTGGTCTA TGGATGCAGC CTGCTAATGC GAACAAATCG TCAGATCCAA ATCGCTTGAT TGAGCCATCG ATGGGTTTCG AGTTTGCTCA GCTCGATCAG TGGATACCAA GTATTTTTGC TGAGAATGGC CCGCAAACAA ACTCACAAGT TGCCCAACGC AGCATTGGAT TTGTCCATCG CACAACCCCT TCATTATTGG CAATTGTCAG TAGTTTCGCA AACCCGAAAG CGTTAAGTAG TAGCGGCTGG ATTAAACGCT ATGATTATCG ACGAACTAAT GCTGCATATC AGATTGAAAC TATCGAATGG CAAAAACGCA GTGTGCAGTT AATTAGTGAA CATCAGTTGA GTCGTCAAGC CCATGGAACT CCCGATCATT TACGCTTGAT CATCGCGATC AATCAGCAGA TTATGGTTTT TGAATATATT GGCTATAGCA TCAATCGGGC TGAATTCCTC GCTTGGCTTG AGCAGATCAC CCTAATTCCA GCCCAAAAAT TCCAAGCATC ACCGTTGAGC AACGATGTTA AAGAGGCATT TGCTCAAGCT AACCAGCCTT TAGCTGTATC AATCCAAAAT TGCTGTGGGG TTAGTGACCC TGAATTCAAT CCTTTTCCTT GTAATAGCAG CGTTGGCAAT GGCAATTGTA CGTGGTGGGT CCGCTACCGC CGAACAGGCA ACAATATTGC CAATTTATCT AATTGTACTG GCAACGCGGA TACATGGGAT GAATGTGCTG CCAGCTCTTA TCCACAATTA CTCAGTGATA CGCCCAGCGT CAAAAGTGCC GTGGTTTGGA CAAACATAAA TCATGTAGCA TTTTTGGAAC AAGTTAATAG CCCAACCAGC ATTACGATGT CGCAAATGAA TTGGTATAGT CCATGTCCGC AATCAACTAT CACCCAAGGG ATTACAAATA AGAAATTTAT TCGCCACCCT GATGCCATTC AACCCGAACC CGCAAAGCGT TGGCATCTGA GCTACAATTT ATCAAGCGGC AATGCCGAGT TATCCTTTAA TTATGGCTTG AAATCGGATA AAGCGGTAAT TGGCGATTGG AATGGTGATG GTATTGATAC GCCAGGGGTT GTACGTGGCA ATACTTGGTA TCTTTCAAAC ACCTATGGCG AACCACATAC CATCAGTTTT GAATTCGGTG ATCCCAATGA TATTCCGGTG GTTGGCGATT GGAACGGCGA TGGCAAAGAT ACCCCTGGCC TTGTTCGCGG AACGACTTGG TATATCTCAA ACAACCTGAA TGGCGGCTGG GCCGAACGAT CCTTCGGCTT TGGTGAGGCT GGCGACAAAC CCGTGGTTGG CGATTGGAAC GGCGATGGCA AAGATAGCCC TGGGGTTGTG CGCGGCATAA CATGGTATCT TTCCAATAAT CTTAATGGCG GCTGGGCTGA TATTTCGCTT GGCTTTGGTG AGCTAGGCGA TACATTCATC GTGGGCGATT GGGATGGCGA TGGTGATGAT ACCCCTGGGG TTGTGCGTGG CAATATGTGG TATCTCTCCA ACAACCTCAA TGGTGGTTGG GCCAATCTCT CCTTCATGTA TGGCGATCCC GGTAACTATC CAATTGTTGG TAATTGGGGT GATAGCGATC GGAATAGTGA GATTGGCGTA ATTCCCTAA
|
Protein sequence | MRNHHWRICL IVGLLVSLGL WMQPANANKS SDPNRLIEPS MGFEFAQLDQ WIPSIFAENG PQTNSQVAQR SIGFVHRTTP SLLAIVSSFA NPKALSSSGW IKRYDYRRTN AAYQIETIEW QKRSVQLISE HQLSRQAHGT PDHLRLIIAI NQQIMVFEYI GYSINRAEFL AWLEQITLIP AQKFQASPLS NDVKEAFAQA NQPLAVSIQN CCGVSDPEFN PFPCNSSVGN GNCTWWVRYR RTGNNIANLS NCTGNADTWD ECAASSYPQL LSDTPSVKSA VVWTNINHVA FLEQVNSPTS ITMSQMNWYS PCPQSTITQG ITNKKFIRHP DAIQPEPAKR WHLSYNLSSG NAELSFNYGL KSDKAVIGDW NGDGIDTPGV VRGNTWYLSN TYGEPHTISF EFGDPNDIPV VGDWNGDGKD TPGLVRGTTW YISNNLNGGW AERSFGFGEA GDKPVVGDWN GDGKDSPGVV RGITWYLSNN LNGGWADISL GFGELGDTFI VGDWDGDGDD TPGVVRGNMW YLSNNLNGGW ANLSFMYGDP GNYPIVGNWG DSDRNSEIGV IP
|
| |