Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_1604 |
Symbol | |
ID | 5733491 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 1861123 |
End bp | 1862454 |
Gene Length | 1332 bp |
Protein Length | 443 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641278743 |
Product | hypothetical protein |
Protein accession | YP_001544375 |
Protein GI | 159898128 |
COG category | [V] Defense mechanisms |
COG ID | [COG0577] ABC-type antimicrobial peptide transport system, permease component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATGACAA CGCTTGATGT GCAAACTGTT CCCACGCCCG CTCCCGCCGC CAGCGACGAC ATCCTATTTC AGCCGATTGA AGCTAGCTCA AGTGGCTTTG GCAGCCTATG GGAAGTGGTG CGCATTTCGT TTGGCAGCCT GTTGGCTAAT AAAGTGCGCT CGTTGCTGAC GATGTTGGGG GTGATTATTG GGGTGGCCTC GGTGGTTTCG TTGCTGGCGC TTGGCAACGG GGCAAGTGCT TCGATTACTG GCGAAATTGA GGCAATTGGC ACGAACGTCC TGACAATTTC GGCTGGCTCA CAAAATCGTG GGCCTGGCAA TGCGACTGCT GCCCAAAACT TGACGATGGA CGATGCGCGG GCGATTGAAG CCTTGCAACT GCCAGTGATT GGCGTGGCTC CCCAACTTAA CTCAAATGCC CAAATTGTCG CCAAATCGGC GGATAAAAGC GCCCAAATTG TTGGCATAAC TCCGTCGTTT CAAGTGGTGA ACAACTTGGC CATGCAACAG GGCAGTTTTA TCACTGAGGA GCATTTACAA GGCGCAAATA CCGTGATCGT CTTGGGTTCG ACCTTGGCCA AAGATTTATT TGGCAATGGT CAAGCGGTTG GCCAAACCGT GCGGATCAAC AACCAAAGTT TGCGCGTGGT GGGGGTGCTT ACGCCGCAGG GTGGCAGTGC CTTTGGCTCG GAAGATGATC GGGCTTATAT TCCGATTACC ACGGCCCAAA AACGCCTGTT TAACGCCCGC ACTCCTGATG GCAATGGCTA TCGAGTTGGC TCGATTACGC TTTCGGCGAT TAATGCTAGC GATCTTGATG CGCTGCAATC GCGGGTAAGC ATGTTGCTGC GTGAGCGCCA TCATCTCAAA CTTGATGGTA GCGCCGATGA TTTTAATGTA ATCAACCAAG CCGAAATTTT GGGCACGCTG ACCACGATCA CCTCAATGAT GACCCTCTTT CTGGCGGCGG TAGCGGGGAT TTCGCTGCTG GTTGGCGGAA TTGGGATTAT GAATATTATG CTGGTGAGTG TGACCGAGCG TACTCGAGAA ATTGGTTTGC GCAAGGCTGT CGGCGCACGT AGCCATCATA TTTTGATGCA ATTTGTGGTT GAGGCGGCGG TGCTGAGTAT GACCGGCGGT ATGATTGGCC TGATGCTTGG CAGCATTATT CCAATTGTCG TAACCCAAAT GGGCTTGCTC GATGCGCCGA TTGATCTGCA AACTGTCGGA GTCGCGATCG GCTTCTCGCT GGGGGTTGGC TTGTTCTTTG GAATTTACCC TGCTCAACGG GCCGCCAAAT TGAACCCAAT TGATGCCTTG CGCCACGAAT AA
|
Protein sequence | MMTTLDVQTV PTPAPAASDD ILFQPIEASS SGFGSLWEVV RISFGSLLAN KVRSLLTMLG VIIGVASVVS LLALGNGASA SITGEIEAIG TNVLTISAGS QNRGPGNATA AQNLTMDDAR AIEALQLPVI GVAPQLNSNA QIVAKSADKS AQIVGITPSF QVVNNLAMQQ GSFITEEHLQ GANTVIVLGS TLAKDLFGNG QAVGQTVRIN NQSLRVVGVL TPQGGSAFGS EDDRAYIPIT TAQKRLFNAR TPDGNGYRVG SITLSAINAS DLDALQSRVS MLLRERHHLK LDGSADDFNV INQAEILGTL TTITSMMTLF LAAVAGISLL VGGIGIMNIM LVSVTERTRE IGLRKAVGAR SHHILMQFVV EAAVLSMTGG MIGLMLGSII PIVVTQMGLL DAPIDLQTVG VAIGFSLGVG LFFGIYPAQR AAKLNPIDAL RHE
|
| |