Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_0917 |
Symbol | |
ID | 5732686 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 1048558 |
End bp | 1049871 |
Gene Length | 1314 bp |
Protein Length | 437 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641278049 |
Product | hypothetical protein |
Protein accession | YP_001543693 |
Protein GI | 159897446 |
COG category | [S] Function unknown |
COG ID | [COG1944] Uncharacterized conserved protein |
TIGRFAM ID | [TIGR00702] uncharacterized domain [TIGR03604] bacteriocin biosynthesis docking scaffold, SagD family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCAAAGA CGTTCTTATC AATTGATGAC ATTTTGACGC GGGTTGCGCC GTTGGTGAAT CCGCGCACCG GAATCTTGGG CACTGCGACC GAACTACCGC GCACGCCTGG CAATCCTGCA ATTCCAGTCT ATAGTACCAA CATTCATTGT GCCGAACTGG CTATGGCCGA GTCTCCAGCT GGCACGGGTG TATTTTTGGA ACGTGAATGG GCTAAGGCTA AATCCTATTG TGAAGCACTC GAACGTTATT GCAATGTCAA GCATGCCCAT CAAAACTTTA TTGTGGCGAC CCGCCAAGAG TTGGGTATTG AAGCAGTTGA TTTAGAGTTA TTTCCCCGCT GTTCTGATGC TGAATATCGC AACCCCGCCA ACCCAACCAC GCCGCCGAGC AACCGCCAAC CCATGCGCTG GGTCGAAGGC TATTCGCTGA TTTCAGGCCA GCCGATGTAT GTTCCGGCGA TTGGGGTCTA TGTTGGGATG CATCCTGAAT ATGCTGGCGA AACCTTTACC ACCTCGATTT CAACTGGCAC GGCGCTGGCA GCCAGCTACG AACAAGCAAT TGTAACTGGG ATTGGCGAGG CGATCGAACG CGATGCCTTG AGTATTGCCT GGTGGCAAAA GTTGGCATTG CCCCAAGTTG ATTTGCGCGA ATTCCCTGAT CCAGCCTTCC AAGAACGTTT GAGTCGGGTC GAAGCGGCTC AAATTCAGAG TTATTTTTTT GATGCAACCA CCGATCTAGG GGTTGCGACG ATCTATGCAG TCCAAGTTGC GCCCCATGGT CGGCTACGCA CAATGGTGAT GTCGGCCACC CGCACCAATC CATTGGCCTT GCCCAGCAAA GTGCTCGACG AATCGGCAGC CTCGCGGATT GGTATCGAGC ATTCGTTGAG CCAACCGCTG CCCTTTGATC CTGCCGATTA TCGCACCTTT ATGCGCTTGA GTGATGGCGC TGCCTACTAT GCCGATGCGG CCACCGCTCC AGCCTTTGAT TTCTTGTTTG AGCAAACTCG CTGGCGCACG CTCGACCAAC TGCCACGGCT CGATCATCCT GATCCTGCGG TCGAAGCCCA ACGCTTGATC GATATTTTTC GGCGGGCTGG GCTGGAACTG ATTGTGGTCG ATTTGACCTT GCCAGCCTTG CGCGAAGTTG GTTTGTATAC GATCAAAGTG GTTGCGCCGC AATTAATGCC CTTCAGTTGT AACTATAATG CCCGCTTTTT GGCTACGCCG CGCTTATATA GCGTGCCGGA GCGCATGGGC TACCCAGTTT TGGCCGAAGC TGAATTAAAC CATTGGCCTC AGCCATTTGC CTAG
|
Protein sequence | MSKTFLSIDD ILTRVAPLVN PRTGILGTAT ELPRTPGNPA IPVYSTNIHC AELAMAESPA GTGVFLEREW AKAKSYCEAL ERYCNVKHAH QNFIVATRQE LGIEAVDLEL FPRCSDAEYR NPANPTTPPS NRQPMRWVEG YSLISGQPMY VPAIGVYVGM HPEYAGETFT TSISTGTALA ASYEQAIVTG IGEAIERDAL SIAWWQKLAL PQVDLREFPD PAFQERLSRV EAAQIQSYFF DATTDLGVAT IYAVQVAPHG RLRTMVMSAT RTNPLALPSK VLDESAASRI GIEHSLSQPL PFDPADYRTF MRLSDGAAYY ADAATAPAFD FLFEQTRWRT LDQLPRLDHP DPAVEAQRLI DIFRRAGLEL IVVDLTLPAL REVGLYTIKV VAPQLMPFSC NYNARFLATP RLYSVPERMG YPVLAEAELN HWPQPFA
|
| |