Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_4509 |
Symbol | |
ID | 5736360 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 5773696 |
End bp | 5774787 |
Gene Length | 1092 bp |
Protein Length | 363 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 641281672 |
Product | hypothetical protein |
Protein accession | YP_001547269 |
Protein GI | 159901022 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR01167] LPXTG-motif cell wall anchor domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.0202625 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCTTCAAA CCATTGTTCG CTATCGTTGG CTGTTGGTCT TCGTTGGTTT TTTGGTAGCG GTCCCAATCG CTGCTCAAAC GGCGGAGAAA AATAAAGGGG CTGATTGGTT GCTGACCCAA CAGCAGTCGG ATGGCAGTTT TGATTCGTAT TATCACTCTC CACTTGATCC AACGGCATGT TCGGTGTATG CCCTCCATGC GGCGGGCTAT CAAATTGATC CAGCCACGCA ACGCTTTATT GAGCAACAGG CTCAGAGCTA TATTGGGCAT CCGGCTAAAG CTTCGGCCAT TGTCATGGCT CAATTGCTAA CTGGTCATGA TCCCCGTTCG GTTGGTGGGG TCGATTTGGT TGAAGCAATC ATCAAGAGCT ACGATCCTGC AACGGGCATG TATGGTGAAA ACTTGTATGA GAATTCCTTG GTAATGATGG CCTTGAAAGC TGCTGGCGAG GCGATTGAGC CACAGGCAAT TCAGACGATT CTTGATCAAC AGTTGGCTGA TGGGTCGTGG AGTACTAGTA CGCAAAACAC CGCCTTACAA ATTCAGGCCT TAGTCGCGGC TGATCAAAAG CAATCTGCGG CAATTCCGGC AGCTTTGGCC TTCTTGCAAA CTCAGCAAGA TTTTGATCGC GGGTTTATGA ATAATCGCGA CTTTGAGCCG ACAGCGTTTA AGGATGCAAT TTCAACGTCC TTGGCCATCC AAGCTATCTT AGCAACTGGT GGCGACCCCA AGGCTGAACC ATGGGGCGAT AATATCAATA ATCCTATCAA TGCTTTGCGG CGTTTGCAGC TTGCTGATGG TGGCTTTCGG CTCGATTCAT CGACCCCACA GCCTGATACG ATGTCAACTT GTTCGTCTGT TCAGGCTGTG CTGCAAAAAA CCTATCCATT TATTGAACTT GCCAACATTG GCATTACGCT TACCCCGACC TTAGCTCCTG CCGATGGCTC GACAGCCACT CCCGTTCAGC AACCTGGCCT ACCTGGGGTG TTACCAGACA CTAGCTCAAG CTCAAATCTG GCCTTGCCAA TTTTGCTTGG CGTATTGGCT GCCTGTGTAT TGGTTGGCCT ACGTTTGCGC AAATTGGCCT AA
|
Protein sequence | MLQTIVRYRW LLVFVGFLVA VPIAAQTAEK NKGADWLLTQ QQSDGSFDSY YHSPLDPTAC SVYALHAAGY QIDPATQRFI EQQAQSYIGH PAKASAIVMA QLLTGHDPRS VGGVDLVEAI IKSYDPATGM YGENLYENSL VMMALKAAGE AIEPQAIQTI LDQQLADGSW STSTQNTALQ IQALVAADQK QSAAIPAALA FLQTQQDFDR GFMNNRDFEP TAFKDAISTS LAIQAILATG GDPKAEPWGD NINNPINALR RLQLADGGFR LDSSTPQPDT MSTCSSVQAV LQKTYPFIEL ANIGITLTPT LAPADGSTAT PVQQPGLPGV LPDTSSSSNL ALPILLGVLA ACVLVGLRLR KLA
|
| |