Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_3145 |
Symbol | |
ID | 5735017 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 3972667 |
End bp | 3974553 |
Gene Length | 1887 bp |
Protein Length | 628 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 641280288 |
Product | hypothetical protein |
Protein accession | YP_001545910 |
Protein GI | 159899663 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCGCAGT CAGCCCGGTG GCGTGGGTGG TTTTGGCTTG GTTTAGGCCT AATCTGCCTA CTTTGGCCCA ACCTAAGTCA TGCCCAAAAT GCCGATGCCA TCACCATCAC TATCGATCAA GTTGGCTTTG ATGCCCAAGG TCATGTAGCG AATGGTGGTT GGTATCCAAT TATTACAACG ATCGAAAATA GCGGAGCCGA CCTTCAAGCC CAAGTTGTGG TTACCACAGG CTTTGGTCAG GCCGATTTGC TGCAAAATGT CGATTTACCT GGTGGTTCGC GCAAACAAGT GCGTTTACTC ATGCGGGCCA ATCTCAATCA AACCGTTGTC GAAATCAAAG TTATTGATGC TCAAGGCAAG CAGTTAGCCC GCAATCGCAG TAATGTGCGG GTACACGATA GCCAAGAAAT TTTAGTAGGC GTATTTGGCG GGCCAGCCTC AAGTTTAGCT GGCGCTTCAG TTCCTGGCCG CCTCACCACG GTGATGCCGC TTGACCCAGC TCAACTGCCA AGCACCGATG GCGAATTGTT TAACTTTGCA GCAATCGTGC TCCAAGATGT TCAACCGACT GCCGAGCAGG CCGCCGCTCT CGAACGTTGG GTCGCAACTG GTGGAACATT GATCGTCAGC GGTGGCCCGA ATAGCGCCGA ACTGCCCAAA GAACTAGCCT CACTCTTGCC TGCGAGTGTT AGCCGCAGCA ATAGTTCGGC GGTGTTAACC ACGCTCAATG GGCGCAATAC TCCCGCTTTT GCCCAAATCA ACCTACGGGT TAACCAACTT CAACCAACCG CCGATGCTAG CATATTCGGC GTTGGGGCCA ATAACGAAGC CTTAGTGATC AGTCGCAAGC TGGGCATGGG TCAAATCTTA GTCACAGCCT TCAATCCCAG TGACTTACCC GCCGAAGTTA ATGATCGGTT GGTTTGGCCA GTGCTCTTAC AACCCCAACT TTACCGCGAT TGGAATGTAG CGCTCTCGCC ATGGTCAATC CAGATTCGCG GCACTGATCA AAATTTGCCT TCAGTTTTAG GCTTGATGGG AATTTTGTTT GGCTACATTC TGCTGATTGG CCCGATCAAT TACTTTATTT TACGACGTTT GGATCGGCGG GAATGGGCTT GGTTTAGTAT TCCATTGGTT GTACTTGGCT TTGTGGGTAT TATGTATTTA GCTGGTGGCG ATTTACGTAC TGGCAATATC AATGTCACAA CGATTAACAT TATCGATAGC CAACTGGGAG CCGATCAAGG CCGTCTAAGC GTCAATTATG GCTTCAATGC AGGGCGACGC GGCGCATGGA ATGGCAGTAT TGATGCCAAT TTAATTGCTG GCAACCAACC AATGCAAGGC TTTGGCGATG AGGGTTCGGG CACAATCGAG CAAACCAACG ATGGCAAAAC CCGCTTGCCC AATTGGCAAA GCAACATCGG CCAAATGCAA ACCCTCGCCG CGATTGGCTC AAGCGCAGTT CCCTACAATT TTGAGGTTAA AGTAGCCAAA GCCAATTCGT GGGAAGGTGC AACCATTACC AACCGGAGCG AACGCAAGGT TGAATATGCG ATTCTGTTCA ATGGCGAAGA AAGTATTATT TTGCCAGCAC TCGAACCAGG TGCTTCGATT ACGATCGATA ACAGCCTTGA TCGCGTTATG CAAAGCAGCC CCTATCTCAA CAACAACGAC CAATTAACCC AAGCCCTACA ATTGCTCTGG AATGCTGGCA ACGAACGCAC CAACTACAGT GGACTGCCCA AAAATTCGCT GTATACCAAG CCGCATATCA CGGTGCTTGA TACCGAAGTG CTCAACGAAA TTATGGTCGA TGGGGTCGCC GCTCCGCAAA AGAGCAGCAA TATCTATAAT TTGTATGTTG ATTTGGAGCA ACGCTGA
|
Protein sequence | MAQSARWRGW FWLGLGLICL LWPNLSHAQN ADAITITIDQ VGFDAQGHVA NGGWYPIITT IENSGADLQA QVVVTTGFGQ ADLLQNVDLP GGSRKQVRLL MRANLNQTVV EIKVIDAQGK QLARNRSNVR VHDSQEILVG VFGGPASSLA GASVPGRLTT VMPLDPAQLP STDGELFNFA AIVLQDVQPT AEQAAALERW VATGGTLIVS GGPNSAELPK ELASLLPASV SRSNSSAVLT TLNGRNTPAF AQINLRVNQL QPTADASIFG VGANNEALVI SRKLGMGQIL VTAFNPSDLP AEVNDRLVWP VLLQPQLYRD WNVALSPWSI QIRGTDQNLP SVLGLMGILF GYILLIGPIN YFILRRLDRR EWAWFSIPLV VLGFVGIMYL AGGDLRTGNI NVTTINIIDS QLGADQGRLS VNYGFNAGRR GAWNGSIDAN LIAGNQPMQG FGDEGSGTIE QTNDGKTRLP NWQSNIGQMQ TLAAIGSSAV PYNFEVKVAK ANSWEGATIT NRSERKVEYA ILFNGEESII LPALEPGASI TIDNSLDRVM QSSPYLNNND QLTQALQLLW NAGNERTNYS GLPKNSLYTK PHITVLDTEV LNEIMVDGVA APQKSSNIYN LYVDLEQR
|
| |