Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_3716 |
Symbol | |
ID | 5735580 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 4674206 |
End bp | 4675333 |
Gene Length | 1128 bp |
Protein Length | 375 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641280868 |
Product | hypothetical protein |
Protein accession | YP_001546480 |
Protein GI | 159900233 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.000943682 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACAATTG TTGAGGGATT AATCAATCTT GGGGTGTTTT TGATCGGGCT GGTGATTGTG GCATTCACGC TATTTAGTGC GATTCGCATG GTCGTGTTGC CACGCGGCGA TAATGTTTGG CTGACGCGCA CCACCTTTCG GCTGGTCTAC GGCTGCTTTT TGGTGCGTTT GCGCCGAATC AACAACTACG AACAAGCCGA TCGGGTGCTG GCATTTTATG CGCCAGTGGC CTTGCTGATT ATTCCAGTGG TTTGGATGGT GTTGGCAATT GCCGGCTTTA CCTGTATGTT TTGGGCGGTT GGCGAGCATC CTTGGAGTAA AGCATTTTTG CTGAGTGGCT CGTCGATGTT GACGCTGGGG TTTGCCGCAG TTGATTCAAC GGTTGAAACA ATTTTGGCCT TTTTGGCGGC AACGGTTGGC TTAGGCATTG TGGCTTTGTT AATTGCCTAT CTGCCAACGA TGTACGGGGC TTTTTCCAAG CGCGAAGAAG CCGTGACCTT GCTGGAAGTG CGGGCAGGCA CGCCGCCATC GGCAATTACC ATGATTCAGC GCTATCAGCG GATTCATAAT TTGGAGCGTA TGCCTGAGCA ATGGGCAATT TGGGAGCAAT GGTTTGCCGA ATTAGAGGAG AGCCATACCA CCTTTGCGGC CTTGGCGTTT TTTCGTTCGC CCCAGCCGTA TCGCTCGTGG ATCACCGCTT CAGCGGCGGT GTTGGATTCC GCAGCGTTGG CAGTTTCGAC CCTTGATATC CCGCGCCAGC CAGATGCCGA TTTGTGCATT CGGGCGGGCT ATTTGGCGCT GCGCCGCATC GCCGATTTGT ATATGATTCC CTACAATGCC ACGCCGCATG CCGATGATCC AATTAGCATC ACGCGGGCTG ATTTTGATCA GGCTTGTGCT GAATTGATCG CTAGTGGCGT GCCACTCAAA GCTGATCGCG ACCAAGCATG GCGTGATTTT GCTGGTTGGC GGGTCAATTA CGATCAAGTG TTGATCGAAT TAGCCAAATT GGTAACCGCC CCGCCAGCCC CTTGGACAGG CGAACGTCCG CTGCCATTAT GTTTCAACAT GCCACGTTTT GGCCATAAAC GCCGCCAACT AGCCAAAGAG CCAACTGGCT CAATGTAA
|
Protein sequence | MTIVEGLINL GVFLIGLVIV AFTLFSAIRM VVLPRGDNVW LTRTTFRLVY GCFLVRLRRI NNYEQADRVL AFYAPVALLI IPVVWMVLAI AGFTCMFWAV GEHPWSKAFL LSGSSMLTLG FAAVDSTVET ILAFLAATVG LGIVALLIAY LPTMYGAFSK REEAVTLLEV RAGTPPSAIT MIQRYQRIHN LERMPEQWAI WEQWFAELEE SHTTFAALAF FRSPQPYRSW ITASAAVLDS AALAVSTLDI PRQPDADLCI RAGYLALRRI ADLYMIPYNA TPHADDPISI TRADFDQACA ELIASGVPLK ADRDQAWRDF AGWRVNYDQV LIELAKLVTA PPAPWTGERP LPLCFNMPRF GHKRRQLAKE PTGSM
|
| |