Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_0003 |
Symbol | |
ID | 5736837 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 3605 |
End bp | 4564 |
Gene Length | 960 bp |
Protein Length | 319 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 641277124 |
Product | peptidase M28 |
Protein accession | YP_001542783 |
Protein GI | 159896536 |
COG category | [R] General function prediction only |
COG ID | [COG2234] Predicted aminopeptidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.00570901 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATCCAAC GCCTTTGGTT CCAACGCTTG CTGCAACTTG GCCTGATCAT CAGCCTAACT GGCTGTGGTG CAACGACTGG AACACCTGTT GCCAATCCAG TTCAAACAGT TGCCGCCACG AGTGCGCCGA CGATTGTGCC CTTGCCCAAG CCTAGCCTAA CCGAAAGCCC CCAATTCAAC GGGGCCAACG CCATGGAATT TGCCAAAGTT CAAATGCAAT GGATTCCGCG CGATACGGGC ACACCAGGCT GGCAAGCTAA CGGCGATTGG ATTGTACAGA CTTTGTTGGA ATATGGCTGG GATGTTGAAG AACAATTTTT CAACGTGCCC GAAAATAAAA AAGGCCGCAA TATTATCGCC CGCCGTGGCA CTGGACCATT GGTCTTGCTT GGGGCGCACT ACGATGCTCG CCGTTACGCA GATAATGATC CCGACCCAAC CAAGCGGATG CAGCCCGTAC CAGCTGCCAA CGATGGAGCC AGCGGCGTTG CGGTGCTGCT GGAATTGGCG CGGGTACTCA AGCCCGAACG GCTCAACGAG GAAGTTTGGT TGGTGTTTTT CGATGCCGAG GATAATGGTG GGATTGGCAC ATGGAACTGG ACGCTTGGCT CAATCGACCT AGCTCCCAAG CTCGAAACCA CGCCCAAAGC TGTGGTGATC GTCGATATGA TTGGCGATGC TGATCAACAG GTCTACCTGG AGCAAAGCTC AACGCCGGCT TTACGGGCCG AAATTTGGCA ACAAGCCGCC GATTTGGGTT ATACCACCTT CATTTCGGAA ACCAAACACC ATATTCTTGA TGATCATACC GCCTTCTTGC AACGCGGCTT TAGCGCCGCC GACCTAATCG ACTTCGATTA CCCGCATTGG CACACCACCA GCGATACGAT CGATAAGCTC AGTGCCTCGG CACTAGAAGC AGTCGGTCGC ACGCTCGAAG AGTGGCTCAC CAAGCAGTAA
|
Protein sequence | MIQRLWFQRL LQLGLIISLT GCGATTGTPV ANPVQTVAAT SAPTIVPLPK PSLTESPQFN GANAMEFAKV QMQWIPRDTG TPGWQANGDW IVQTLLEYGW DVEEQFFNVP ENKKGRNIIA RRGTGPLVLL GAHYDARRYA DNDPDPTKRM QPVPAANDGA SGVAVLLELA RVLKPERLNE EVWLVFFDAE DNGGIGTWNW TLGSIDLAPK LETTPKAVVI VDMIGDADQQ VYLEQSSTPA LRAEIWQQAA DLGYTTFISE TKHHILDDHT AFLQRGFSAA DLIDFDYPHW HTTSDTIDKL SASALEAVGR TLEEWLTKQ
|
| |