Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_4367 |
Symbol | |
ID | 5736227 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 5578890 |
End bp | 5580641 |
Gene Length | 1752 bp |
Protein Length | 583 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 641281528 |
Product | protease domain-containing protein |
Protein accession | YP_001547127 |
Protein GI | 159900880 |
COG category | [S] Function unknown |
COG ID | [COG5276] Uncharacterized conserved protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.0000129346 |
Plasmid hitchhiking | No |
Plasmid clonability | unclonable |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCTTTTTC AACGTTGGTC ACGTTATTTT AGCGTGGCAA TTTGCCTAAT CCTAATCGCT GCCTGTGCCA ATCAAAATGC GACAGCGATT CCTGCGACGG CAACAATCAA CAACAATGCT GGCGAGCAAG CTCGCTCTGA TAAAGATGGC GTGAATCAAA GACCAACCAA AACGCCGCGA CTCGCTGCCA CGGCGACTCC AACCATCACA GGGCCGCATT TCGAGCAAGT TGGCAGCTTG CGACTTAAGC CACCAACCAG CGGTCGCCAT GCTGATCTCA CGTTGTATAA CGACTTGGTG TTGCTTGGTA CGCAGCCTGG CTCATGCCCC GCCGAAAATC AAATTACATT AATTGATGTG AGCGATCCGG CTAATCCAGA GTTGGCGGGC TATTCACCAA GCGTCAAAAA TGCCTCACTT GAAGATATGG ATGTGGTGAG GATTGGCGAG CAAGATATTG CAGTTTTGGG GATTCAGCCC TGCCGTGCAG CAACCAAACC TGGCATTCAA ATTGTGGATA TCACAGATCC AACTGAGCCG CTCGAATTGG CTCGGTTTGA AACCAACCTT GGCGTACATG AACTCGATGT AACAATTACG GCTAGTGGAC AAGCCTTGGC TTTGCTTGCT GCGCCCACCA ATAACGCTTT TGGTACAACG CCCAAGCCCG AAGATCGTGG CGAATTGTGG ATTGTTGATA TTAGCGACCC TAGCCAACCA ACCATGCTCA GCCGTTGGGG CATCGATCAA AAGCCCGATT GGCAAGCACT TGATTATGCA GATCGGACTC GTGGCAATTT CCCAGGGATC TTTTTGCATA GCGTGCGGGC CAGCAGCAAC AGCCAACGCG CCTATCTCTC GTACTGGGAT GGCGGCGTGA TTATCTTGGA TATTCAAGAC CCCAATCAAC CAATCTACCT TGGCCAAACG CCCTATCCCG CGCTAGCCGA GGGCGATGCA CACTCTGTCG TCGATTGGAA TGATGGGCAA ATGTTGGCCT TGAACAACGA AGATTTTAGC AACGATCAGG CAAAAATTAG TCACCCAGCC TTGGCCGAGC CAGATTACGC CCACGAATTA CCATTTGGCG GCAAACTTGA TGCCCCGCTG AGTGCCAAGG TTTTAGCGCT TGGGCAGGCT TGCGATGCTG AAGCTGAATA TCCTGATTTC AAAGGTTTTT TTGTGTTGGC CGAGATGGCG GGCTGTTCGA TTGAGCAAAA GCTGCAAATT GCCCAAAACG GCGAGGCCGC AGCATTATTG ATTTATGCTA ATACACCATT TGAGGAGCTG CAATTTGGCG ATGATGTTGA TTTGATCGAT GATTTTGATC TGCCGATGTT TACCATTAGC AGCACAACCG CCGCTGCCTT GCTCAGCCAA CCTGAGGCTG AAGCAACGAT CGAAAGCTAT TTTGATGGTG GCGGCGCAAT TCAATTCTTC GATCTGAGCA ATCCCAGCCA GCCAGTTGAA GTTGGGCGCT ACAATACGCC AAATTCGATT AATGAGACGT TGCGATCCAA CCCAACTGTG CATAATTCTG AGGTGCAAGG CCAATATTTG TATGCCTCGT GGTATCAAGA TGGCCTGCGC ATGCTCGATA TCAGCGATCC CAGCCAGCCC AAATCAGTGG CAAGCTGGCC GCTGAACAAT TCGCCAAAAG TGGCCTTGTG GGGCGTACAA GTGCGCGATC AATTTGTCTA TGTCAGCGAT TTCAGTTATG GCTTGTATAT TTTGGAGTTT AAGGCCGAGT AA
|
Protein sequence | MLFQRWSRYF SVAICLILIA ACANQNATAI PATATINNNA GEQARSDKDG VNQRPTKTPR LAATATPTIT GPHFEQVGSL RLKPPTSGRH ADLTLYNDLV LLGTQPGSCP AENQITLIDV SDPANPELAG YSPSVKNASL EDMDVVRIGE QDIAVLGIQP CRAATKPGIQ IVDITDPTEP LELARFETNL GVHELDVTIT ASGQALALLA APTNNAFGTT PKPEDRGELW IVDISDPSQP TMLSRWGIDQ KPDWQALDYA DRTRGNFPGI FLHSVRASSN SQRAYLSYWD GGVIILDIQD PNQPIYLGQT PYPALAEGDA HSVVDWNDGQ MLALNNEDFS NDQAKISHPA LAEPDYAHEL PFGGKLDAPL SAKVLALGQA CDAEAEYPDF KGFFVLAEMA GCSIEQKLQI AQNGEAAALL IYANTPFEEL QFGDDVDLID DFDLPMFTIS STTAAALLSQ PEAEATIESY FDGGGAIQFF DLSNPSQPVE VGRYNTPNSI NETLRSNPTV HNSEVQGQYL YASWYQDGLR MLDISDPSQP KSVASWPLNN SPKVALWGVQ VRDQFVYVSD FSYGLYILEF KAE
|
| |