Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_2217 |
Symbol | |
ID | 5734104 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 2816608 |
End bp | 2817996 |
Gene Length | 1389 bp |
Protein Length | 462 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 641279358 |
Product | peptidase S8/S53 subtilisin kexin sedolisin |
Protein accession | YP_001544985 |
Protein GI | 159898738 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCTTTTTC CACCGACCAA TCAGCATGGG CCGCTAGCCC GCTGTATTGT GGTCAAATTC CACGATCAGC AGGCTCCGCT AATCTATACC GATCAAAATT TTCAAGGCTA TGTTGTTCAA CCGCTCTTTA ACAGCGTCGA TCCTGATCAA TTAGTTGGCT TAGTTCAGCA AGCCCAGCAA ACTAACCCAG AATATCAAGC ACCAAATTTT TTGGCCTTCT TTGCGATTGA TGTTGAGCCA GACCAAGATC CCTTTGCAAT CATTGAGGAA ATTAACCAAT GGGCAGAACT TGAGTATGCC TATGTTGAAT CGCCGCCAGC ACCTGTGCCC ACTGATGCTA ACCCTCGCCG CGCACGCCAA ACCTATCTCA ACCCACCAAG CAGCACTGGC CCAACCATTG GCGGCATCGA TGCTGAAGCG GCTTGGAAGG TTTTAGAGCA TGCAGGCAAA GCCATCACGA TTGTGGATAT TGAGAAATCA TGGCAATTAG AACACCCCGA TTTGCTGCAA CATGGCAGTT CGCCGATTAC GATCTTGCCC TCACTGCTGC ATTGTGATAT GCATGATCCC GCTTGTGCTG ATCATGGCAC GAATGTTTTA GGTGTGCTGG TGGCCCAAAA TAATACTGAG GGTGGAGTTG GAATCGCTCA CGACGCAGCA GCAGCAGTCA TTTCGCCCTG GCAAAAACCA AGCAATGGCA CCAACCAACC AAGCTGGAAT ATCGCCAATG CGATTGTCGC CGCAAGCAAC TACTTAACAA CACTTGAATT ATCAGGTAAT CTGATCCTGC TAGAATTGCA AATCTACCAA GATCTGGCTG GCGGACCCTA CACCAACACG CCCAATCAAC CAGGCAGATT GCTACCAGTC GAATTAGAAC CAGCCAATTT CGAGGCCATT CGGTTGGCGA GCGAACTGGG CATTATTGTG ATCGAGGCTG CTGGAAATGG CGCTAGCGAT TTGGCAACGT GTTGGGATAC CGTGGGAACA TATCAAATTG AGCCAGAAAC AGCCCGTTAT CGTGATTCTG GGGCAATTCT AGTTGGAGCG GTTTACAGTC GCGACCCCAA TAAAGCAACG CGAACCGCTA GCTCCAACTA TGGACAACGG GTCAATTGCT TCGCTTGGGG CAATGGTGTA TTTACCACCA ATGCTTCGGG CTATAGCTTA AGCTTTGGCG GAACATCAGC AGCAGCAGCG ATTATTGCCG GAGCCGCCAT TTTAGCGCAA GCAATTGGCG AACAGCTGCG CCAAGCACGA TTCAGCCCCG AGGAATTACG CAGGTTGCTT ACCCACCCAG ATGCCTGTAC CTATTCAGCC CAGCCCCAAC ATGATCGGGT TGGCGTTATG CCAGATCTAG GGCGCATTAT TGGCTTGTTG CAGGTTTAA
|
Protein sequence | MLFPPTNQHG PLARCIVVKF HDQQAPLIYT DQNFQGYVVQ PLFNSVDPDQ LVGLVQQAQQ TNPEYQAPNF LAFFAIDVEP DQDPFAIIEE INQWAELEYA YVESPPAPVP TDANPRRARQ TYLNPPSSTG PTIGGIDAEA AWKVLEHAGK AITIVDIEKS WQLEHPDLLQ HGSSPITILP SLLHCDMHDP ACADHGTNVL GVLVAQNNTE GGVGIAHDAA AAVISPWQKP SNGTNQPSWN IANAIVAASN YLTTLELSGN LILLELQIYQ DLAGGPYTNT PNQPGRLLPV ELEPANFEAI RLASELGIIV IEAAGNGASD LATCWDTVGT YQIEPETARY RDSGAILVGA VYSRDPNKAT RTASSNYGQR VNCFAWGNGV FTTNASGYSL SFGGTSAAAA IIAGAAILAQ AIGEQLRQAR FSPEELRRLL THPDACTYSA QPQHDRVGVM PDLGRIIGLL QV
|
| |