Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_3813 |
Symbol | |
ID | 5735677 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 4786075 |
End bp | 4787157 |
Gene Length | 1083 bp |
Protein Length | 360 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641280965 |
Product | peptidase T-like protein |
Protein accession | YP_001546577 |
Protein GI | 159900330 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG2195] Di- and tripeptidases |
TIGRFAM ID | [TIGR01883] peptidase T-like protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.119165 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCTATCA ATCAAGAGCG TCTGCTCGAT ACGTTTTGTA CTTTGGTTCG GATCGATAAT CCATCTGGTG AAGAAGCAGC CATGGCTGCT CACTTAATCG AGCGCTGCCA AGCTTTGGGC CTTGAATGCG AGCAGGATGC GATTGGCAAT GTGATTGCCC GTTTGGCTGG CGTAGGTTCA CCCTTGTTGC TGAATGCCCA TATGGATAGC GTTGCACCAT GTCATGGCAA GCAGCCAGTC ATCCGCGATG GCAATATCTA CAGCGCTGGC GATACCGTTT TGGGTGCTGA TGATTTGGCG GGGGTTACGG CATTTCTCGA GGGCATTCAA GCAGTTTTAG AATCAGGCCA GCCGCATCGG GCAATTGAAT TGGTTTTTAC GGTGCAAGAA GAGACAGGGT TGTATGGCGC TCGTGCCCTC GATTATAGCA AGCTGCAAGC CAAACAAGGC CTCGCTTTCG ATTTAAATGG CGATGTTGGG GCGATTTGTA TTGGTTCACC CGCCCACGAT TCGTTCACCG CGACGATTAC TGGGGTTTCG GCCCACGCTG GAGTTGCCCC CGAAAAAGGC ATCAGCGCGA TTGAAGTTGC CGCGCATGCG ATTGCTGCCA TGCCTTTGGG GCGGCTCGAT GACGAAACCA CCGCCAATAT TGGCTCGATC CATGGTGGCA AGGCCAACAA TATTGTGCCC GATTCAGTTG TGGTCAAAGG CGAGGCTCGT TCGCGCAACC AAGCCAAACT CGATGCCCAA TGGCACATTA TGCGCAATGC TTTTGAGCAA GCTGCCGCTA AATTTGGCGC GACAGTTGAA ATTGAACACA AACAACATTA TGGCCCCAGC GTGCTAGCCC CCGATGCGGA AATTGTGCAG TTGCTCAATC AAGCGATTCG GGCGATTGGG CTTGAGCCTT CGTTGGTGGT CACGGGCGGC GGCAGCGATG TCAGCATTAT CAGCAATAAT GGCATCGAAA CTGCCAACTT GGCGATTGGC TACGAAAATA TTCACTCGGT CGATGAGTTT ATTCCGATTG TGCAACTGCA ACGAGCTGCC CAAATCGTCG AACAAATGCT CTTGATGGTT TAA
|
Protein sequence | MAINQERLLD TFCTLVRIDN PSGEEAAMAA HLIERCQALG LECEQDAIGN VIARLAGVGS PLLLNAHMDS VAPCHGKQPV IRDGNIYSAG DTVLGADDLA GVTAFLEGIQ AVLESGQPHR AIELVFTVQE ETGLYGARAL DYSKLQAKQG LAFDLNGDVG AICIGSPAHD SFTATITGVS AHAGVAPEKG ISAIEVAAHA IAAMPLGRLD DETTANIGSI HGGKANNIVP DSVVVKGEAR SRNQAKLDAQ WHIMRNAFEQ AAAKFGATVE IEHKQHYGPS VLAPDAEIVQ LLNQAIRAIG LEPSLVVTGG GSDVSIISNN GIETANLAIG YENIHSVDEF IPIVQLQRAA QIVEQMLLMV
|
| |