Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_0315 |
Symbol | |
ID | 5732210 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 376385 |
End bp | 377740 |
Gene Length | 1356 bp |
Protein Length | 451 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 641277439 |
Product | hypothetical protein |
Protein accession | YP_001543095 |
Protein GI | 159896848 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG3591] V8-like Glu-specific endopeptidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAACGAC AAACTCGCCA ACTTTGGCGC TCGACGCTGA GCCTGTTAGG AACGGTCGGA TTAATAGCCA GTAGTTTTTC AATCACCACT AGTTCAATTT CGGCCCAGAC GAGCGGCAGT GCCCAACCAT GTTTGAGTGG CGATCCATTC CAACTGGCCG CGCTCAAAAC CAAACTTAGC CAGGTCAATC CACCCAGCGG CGTTCAACGA GTCAATCGCA CATTGACGGT CGAATTTATC ACCACTGGCG TGTTGACTGA TGAAGCGCCA ACTATGCCAG TGCTCCCTAC CGAAGTTGGC GACGATGAGC CATACGAACG CGAGCCAGTG CCTGCTTCAA CTGCCACCTT CCGCGTGTTC AACACATTAA CTTTGAACGA ATTTCGGGTG GTGATGCAAG CCAGCACAGT CGGCACAATT CGCGATTGCT ACGAGCGCAA CGATTTGCTC AATGGCAAAA TACCCACCGA TTACACTGGC GATATTGATG CACCACCGCC ACCACAACGC AGCAAGCAAA CCGAAGTGTT CACACCGTTT GGCTGGAGCA ATGGCGACGA TAATCGTGAA TTAAAAACCA ACCATACCCA ATTTCCATTA CGCACAATCA GCCAATTTTC ACGGGTGAGC GGCAACCAAG ATTCCAACTG TACCGGGACT TTTGTAGGGC CACGTCACTT GATTACCGCC GCCCACTGTA TCAATCGTGA AGCAACCAAT GTTTGGTTTA CCACCAAAGT TACGCCTGGC CGGAATGGCA CGGGCACAGG CTCAGCACCG TATAACTCAA CCGTGATCAT GCCCAATCCA CAGCCACCAG TGGAATCATG GTATTGGACC TTTGAAGAAT GGCGCGATCC AAACCAAAAC AATCGCACTC GCTGGGACAT TGGGATGATC GTTGTACCTG ATCGTTTGGG CGATACCACT TCATGGATGG GCGTTGCTCC ACGTACAGCG ACCTATTTGA AGAATACAAC CAGCTATAAT CGTGGTTACC CTAACTGTAA TGGCGACGGA GCGACTCGTG GCAATGCTCC AGCTGGCTGT CAAGTTGCTC GAATGTATGG TGATCCTGGC AATTGTGGGG CACGTTGGTT CAAGAACCTT GATGGTGATG GTTGGTCACG GCGCTACGAT GTGAAATGTG ATGCCAGCGC TGGTCATAGT GGCAGCCCAG TTTATCATTA TGAATACAGC GCCCACCACG GCAAAGATAT TCCGGTTGTG TCAGCAGTGA TTATCACCGA AGAATGTTTC ACCTGTTCGA ATCTGAATTC ATATGTCAAT ACGGTGCGCC GCGTAACGCC TTCAGTCATC GACAACTATG TCGCCTTACG CGAAATCTTC AACTAG
|
Protein sequence | MQRQTRQLWR STLSLLGTVG LIASSFSITT SSISAQTSGS AQPCLSGDPF QLAALKTKLS QVNPPSGVQR VNRTLTVEFI TTGVLTDEAP TMPVLPTEVG DDEPYEREPV PASTATFRVF NTLTLNEFRV VMQASTVGTI RDCYERNDLL NGKIPTDYTG DIDAPPPPQR SKQTEVFTPF GWSNGDDNRE LKTNHTQFPL RTISQFSRVS GNQDSNCTGT FVGPRHLITA AHCINREATN VWFTTKVTPG RNGTGTGSAP YNSTVIMPNP QPPVESWYWT FEEWRDPNQN NRTRWDIGMI VVPDRLGDTT SWMGVAPRTA TYLKNTTSYN RGYPNCNGDG ATRGNAPAGC QVARMYGDPG NCGARWFKNL DGDGWSRRYD VKCDASAGHS GSPVYHYEYS AHHGKDIPVV SAVIITEECF TCSNLNSYVN TVRRVTPSVI DNYVALREIF N
|
| |