Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_3848 |
Symbol | |
ID | 5735713 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 4830315 |
End bp | 4832750 |
Gene Length | 2436 bp |
Protein Length | 811 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 641281001 |
Product | NB-ARC domain-containing protein |
Protein accession | YP_001546612 |
Protein GI | 159900365 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACAATGT GGAATGAACG CACCGTGCAA GATGTGTTAC AACGCCCAGA ACGCTTGCTT AGCCAAACTG ATTGGCGGCA CATAATTCAG CAACAGGGTG GATTAACTGC GTTTTACCAA CAATTGCAAC AACTGCCACT TGAAGCCAAT CAACAAGCGG TGCTGAACGT TTTGACGACC TATCCAGGTG CGCCAGTTGA AACCTATTGT TCATTGCTGA ATGTGCACAA AGCGACCTAT CATCGCTATC AAAAAGCCTT GATTCAACAA CTAACCAGCT TGCTCAATAA CGAGCAACCG CAAGCACACA CGCCAACCCA AGCGCCGTCG TTGCACCAAC TGCGCCCGAT TTTGGCCGAT TTTGTGGGCC GTACAGCCGA ACTCAAACAA GCTCACTATG CGATCGATAT CGCCCACAAT GCCGCACAGG GTGCGGTAAT TAACGGAATT CAAGGCATGG GCGGGGTTGG AAAAACTGAG CTAGCAATCT ATTTGGCGCA TCAGTTGATT CCGCATTTTC CTGATGCCCA AATTGTGCTC AATTTGTATG GCTCGCGCGA GCAACCGCTG ACGATTGAGC AAGCACTGGG CACGGTGATT GCCCTGTTTA AGCCGAATGC CAAATTGCCT GAACAACGCG AAAAACTGCT TGAAATCTAT CATGAGGTGT TGGCTGATAA GCGGGTATTG ATTTTGGCCG ACGATGCGCG AGATTTGGCC CATGTCCAAG ATTTAACCCC GCCAGTTGGT AGTTGTTTGT TGGTCACTAG CCGCTTGCGG TTTGCGATGC CGCTGATGGC GCAACTGCAT CTGACCGAGT TTCAGGAGCC TGAGGCCATC GCTTTGCTCC AGCAAATTTG CCCACGACTT GAGGCTGAAA CTGCCCAGCA ATTGGCCGTT GCCTGTGGTT ATCTACCCTT AGCTTTGCGC ATCAGTGCCA GTATTTTGGC CCAAAACCCT GAGTTAGCGG TTGCCGAATA TCTGATCCAA CTGCGCGATC AACAGCAACA ACTCGCCGCC TTGGAATACC CCGATGATCC GCAGGCCAGC GTGGCAGCAT CGTTGGCCTT GAGTTACGCC CGTTTGCCCA GCGAATTGCA AGCCTTGGCG CGTCAACTCA GCCTGATTGT GGCCGACTTT AGTAGTGCCA TGGGTTTAGC AACAGCAGGG CTGGATTTCA ACATGGCCAA CGAAAATTTG CTGTATAAAT TGGCCTTGCA CAACTTGATT CAATTTGAGC ATCGCCAAGA GCGCTGGCGC ATGCACGACC TCGTTCGCAG CGTGCTACGC CGTTATTTGG ACGAAGCAGA ACAAACTCAA ACTCTATTGA ATTATGCCCA AGCCAGTGTT GAAACTCTCA AAATTATTTA TCAGGATTTT CGAGCAGGCG GAGCTACTCA AACTAAAAGC ATTGATAATT TTGATCGCGA ATATGCTCAT ATTGTAGCAA TTTGGCAATG GGCGCAACAA CAACCGATCT CGCCAGTAAT AGATAAAATC GTGGTTGAAT TAGGATTTTC CAGTGGTGGA GTTAGTCGGA TACGAGTTGG TCGTCGCTAT AGCACTTTAG CCGAACATGA GTTTGGGTTT GAGGCAGCTC TGCGTATTCA AGAGCTTTAT AAAGCAGCAA TATTTGCTGG AGCACTTGCT AATAGATATC TCGCTCGTGC AGAGTATAAA ATATCACTCG GGTGGCATGA ACGAGCTTAT GCGCTTGCTC TTGAGATCAA CGATCTTTAT TTACAATCGT TATTTTTAGG TGATATGGCC ACATGTTATA GCCAAATGGG TGGTGATAAA CATCTGGATA AAGCCCTGGA CTTAGAGCGC GAGGCACTAC GATTGTTTAG GTTAAGCGGG TATGAGGGTT CAGGTGAAAG CCTCAGAGTG AATAATCTCG CAACAAACCT TGCATTGCTT GGATACCATG AAGAAGCTGC TGAATATTTT ATTGAAGCTG TTACTATCGC TCAAAAATCT GAGAACCAAG CTGATGAGTG TCGTGCACTC TATAATCTCG GTGAAACGTA TCTCAAATTG AACCAACTTG ACCAAGCCCA AATAGCTTTT GATCAAGCAC TTACTATTGT TGAGCGCATG AATTTTGATG AAGGCCGCGC TTACATGTTG CAAGGCCAAG CAAATGTAGC GATGTTGCAA AAAAACTATC ACCAAGCAAT CCAAAGATTC AATCAAGCAT ATGCACTCAT GCAGCATTAT AATCGGACTA TTGCGTTAAA CATTCAATGG AAAATCGGCC TCCTGTATTG GAAGCTTAGT GATGTGTTAG CAGCTGAAGC CCAAATGCAA GCAGTGCTTG AACAAGAACG CCCATTGGGG ATTGATCGGG TGCAAGATCA TGAACTACAA CTGAGCAATT TGCGCAATCG CCAACCCTTT GATGATAGCC TGTTGGTATC GATGCTCAAA GAATAG
|
Protein sequence | MTMWNERTVQ DVLQRPERLL SQTDWRHIIQ QQGGLTAFYQ QLQQLPLEAN QQAVLNVLTT YPGAPVETYC SLLNVHKATY HRYQKALIQQ LTSLLNNEQP QAHTPTQAPS LHQLRPILAD FVGRTAELKQ AHYAIDIAHN AAQGAVINGI QGMGGVGKTE LAIYLAHQLI PHFPDAQIVL NLYGSREQPL TIEQALGTVI ALFKPNAKLP EQREKLLEIY HEVLADKRVL ILADDARDLA HVQDLTPPVG SCLLVTSRLR FAMPLMAQLH LTEFQEPEAI ALLQQICPRL EAETAQQLAV ACGYLPLALR ISASILAQNP ELAVAEYLIQ LRDQQQQLAA LEYPDDPQAS VAASLALSYA RLPSELQALA RQLSLIVADF SSAMGLATAG LDFNMANENL LYKLALHNLI QFEHRQERWR MHDLVRSVLR RYLDEAEQTQ TLLNYAQASV ETLKIIYQDF RAGGATQTKS IDNFDREYAH IVAIWQWAQQ QPISPVIDKI VVELGFSSGG VSRIRVGRRY STLAEHEFGF EAALRIQELY KAAIFAGALA NRYLARAEYK ISLGWHERAY ALALEINDLY LQSLFLGDMA TCYSQMGGDK HLDKALDLER EALRLFRLSG YEGSGESLRV NNLATNLALL GYHEEAAEYF IEAVTIAQKS ENQADECRAL YNLGETYLKL NQLDQAQIAF DQALTIVERM NFDEGRAYML QGQANVAMLQ KNYHQAIQRF NQAYALMQHY NRTIALNIQW KIGLLYWKLS DVLAAEAQMQ AVLEQERPLG IDRVQDHELQ LSNLRNRQPF DDSLLVSMLK E
|
| |