Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_0968 |
Symbol | |
ID | 5732854 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 1110098 |
End bp | 1111486 |
Gene Length | 1389 bp |
Protein Length | 462 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 641278100 |
Product | stage II sporulation D domain-containing protein |
Protein accession | YP_001543744 |
Protein GI | 159897497 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG3409] Putative peptidoglycan-binding domain-containing protein |
TIGRFAM ID | [TIGR02669] SpoIID/LytB domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.00176967 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCTGCGCA GGTTAACATG GCGTTGGTTG GGTTTGCTCA GTTTGGCCTT AGGTTTGCTG ATTCAAGCAC CTGTTTCGGC GCAAACTCGC AATCAGCAAC GAGCTAATTT AACGGTAACG GTCACCAATA TTGATGGCAA GCCCTTGAGT AATGCCAGCG TTTCGGTAGC GAGCTTAGGC CTAACAACGA CTACCAACGA AGCAGGCATC GCGCATTTTA CAGTATCAAC CAACGATAGC CAAGTGATTG AGGTGGCGGT TGAGTCACAG GGCCATCGTG CTTGGCGCTT ACGTAATGCC CAGTTAATTC CTAATGATAC GCTGTTGGTT GATGCGCCCC TTTCGCTTGC CGCCAAAGCT GTTGATCAAC AACCTGAAAT TATTGAGGTT CAGGCCCATC GGTTAACCAC CAACGCGCCC CAAGAAGCTA ACCTGAATGA CGCTGAATTG AATCAAATAA TGGCTGGCAC AAATACCACG CCGCCAAGCA CGATTCGGGT CTATCGCGTC AGCCTTGGCC GAATTGATAC AGTTAATTTC AAAACCTATG TTAAGCACGT ACTGCCCAAT GAATGGGTGG CGGGCTGGCG CACTGAATCG TTGCGTTCGG GCGCGATGGC TTCGAAAACC TATGCCTGGT ATCGCACGAT GTACCCCAAA TATCCGGGCA AAGGCTATGA TACCAAGGAT ACGACTGCCG ACCAAGTGTA CAATCCCAAT GTCTCATATG CTAGCACCAA CGCCGCAGTT GATGATACCT GGAACTATCG CTTGATCAAA AATAATGCGA TTTTCCAAAG CCAATATTGT GCTGGCTCCT ACAATGGCAG CCGCACGTCG GGCCAGTGCA GCCAAAATCA TGGTTGGACG GTTGGCTTGT ATATGAGCCA ATGGGGCTCG AAATATCTGG CCGATAATGG CTCAAGCTGG CGTTCAATCC TAACGTTCTA CTACGATAAC GTCACAATTG GCACGATCTC TGGTGGCACA ACGCCTAGTT TGCCAGCATG GCCAAGCTTG CGCAACGGCA GTTCGGGCAA CGATGTTAAG GCTGCTCAGT ATTTATTGCG TTCGCATGGC TATTCGCTGA CTGCTGACGG CGCATTTGGC GCAGGCACCG AACAAGCAGT GCGCAGCTTC CAAAGTGCCA ATGGCTTAAC CGCTGATGGA ATTATTGGCC CGCAAACCTA TGCCAAACTG ATCAAAACCG TGCAAAATGG TAGCAGCGGC GATGCGGTTC GGGCAATTCA GACCCTGCTT GGTGTAACGG TTGATGGGGC ATTTGGCGCA GGCACCGAAC AAGCAGTGCT CAATCTGCAA GCAACTTATG ACCTGACCCG CGATGGAATT GTTGGGCCAG TTACTTGGCA AGCAGCTTTT GGGAAGTAA
|
Protein sequence | MLRRLTWRWL GLLSLALGLL IQAPVSAQTR NQQRANLTVT VTNIDGKPLS NASVSVASLG LTTTTNEAGI AHFTVSTNDS QVIEVAVESQ GHRAWRLRNA QLIPNDTLLV DAPLSLAAKA VDQQPEIIEV QAHRLTTNAP QEANLNDAEL NQIMAGTNTT PPSTIRVYRV SLGRIDTVNF KTYVKHVLPN EWVAGWRTES LRSGAMASKT YAWYRTMYPK YPGKGYDTKD TTADQVYNPN VSYASTNAAV DDTWNYRLIK NNAIFQSQYC AGSYNGSRTS GQCSQNHGWT VGLYMSQWGS KYLADNGSSW RSILTFYYDN VTIGTISGGT TPSLPAWPSL RNGSSGNDVK AAQYLLRSHG YSLTADGAFG AGTEQAVRSF QSANGLTADG IIGPQTYAKL IKTVQNGSSG DAVRAIQTLL GVTVDGAFGA GTEQAVLNLQ ATYDLTRDGI VGPVTWQAAF GK
|
| |