Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_3484 |
Symbol | |
ID | 5735345 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 4386641 |
End bp | 4387996 |
Gene Length | 1356 bp |
Protein Length | 451 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 641280631 |
Product | leucine-rich repeat-containing protein |
Protein accession | YP_001546248 |
Protein GI | 159900001 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.00000424211 |
Plasmid hitchhiking | No |
Plasmid clonability | unclonable |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACATGC TCGATCTGAA CCAAGTTGAT CAAGCCGCCA TGCTCAGCCT TGCCCGCAAC CCCCAAACCG ACCCACAACA ATTAATTGCG CTTACCGAGT GGCTTAAACT CCAACAAGGC GCTGATTCGG CTCAGCCTAG CACGAGTTTC GCTTACCTCA AGAGCCAAGC ACCGCAAGGC CCACTGGCAG TTATCTTAGA ACGCCGAACC TCGCCGCTAC TCGAAGCCTT GATTGCCAAC CCCAATCTTC CGCCAATGCT GGCTTTAGAA TTTGCCGCCG ATGTGCCAGC TGCATTTTTT GCCAATCCGG CATTGCCCAT ATGGTTGCAA CACGATCCTG CGTTGTTTAA GCGTATGGAA CCACTACGCT GTATGCAGTT TTTAAGCTAT CCAGCTATTC CCCAAGTTAT TTTAGCGTCG ATTCAAAGCA TTAGCCCTGA AATTGCTCAA ACCGTGCACC TGCATAGTGC TTCAAATCCT CAGCTTGATG CCGATTGGTA TGCCGACTAT CAGCATTACA GAGAGCAAGT GGCCTTGCCC GATGCCACAG CAACAACATT ACTGCAAGAA TTAATTGGCT TGAATGCGAT TAATCAGCCG ATGTTGAGCT GGCTACGCCA ATCACCAGCC GAGCAGCATC AAGCCTTATT TAATCGAGCG CCAGCTACGC CACAGCCTGT GATTGAGCCA CAAACGCTTA ACTTTACCCC AAACTATCCA CGATTGCTCG AATCTCCGTT GGCCGAACGA ATTCAAGTAG CGCATTCCAA TGATATTAAG GGCTTGGCAA TTTTGGCTGA AGACGACGAT CTTAGCATTC GACTGCTAGT GGCCCAAAAT CCGGCAACTC CGCGGACTGT TCATCAACTA CTGGAGCTTG ATGATTCGCA GCATGTGCGA GCGGCACTAG CGCGGAATCC CAACATTAGC CCAAAATTAC TGCTGACACT AGCGCGTGAT TACACGTGGT CGGCAGTGCC AATTCGTGTG GCAGCAGCCC TCAATCCAGT CGCAACTTCC GAAATTCTAG AGTTGCTGGC CCAAGATCAA GCCTCGTTGG TACGTCAAAC GGTTGGGCAA AACCCGCAGG CCTCAGCTGA AATACTTGAT CATGCCCGCC AGCGAGCACT AATCGAAGCC CTGTATGCGC TCGATCCCTG GCTGCATATG CTGGCATTGG GAAACCCCGC GACCCCAATT GAGCATTTGG CGAAGGGCGC TCGCTCCCCG TGGTGGTTGG GGCGAGCAGC TTTAGCCGAA AACCCGAGTT GCCCAAGCAA TGTGCTTGAG CAATTGACCA ATGACGGAAA TTGCTATGTG CAACGCTTGG CACAAACTCA ATTGAATGCT CGTTAA
|
Protein sequence | MNMLDLNQVD QAAMLSLARN PQTDPQQLIA LTEWLKLQQG ADSAQPSTSF AYLKSQAPQG PLAVILERRT SPLLEALIAN PNLPPMLALE FAADVPAAFF ANPALPIWLQ HDPALFKRME PLRCMQFLSY PAIPQVILAS IQSISPEIAQ TVHLHSASNP QLDADWYADY QHYREQVALP DATATTLLQE LIGLNAINQP MLSWLRQSPA EQHQALFNRA PATPQPVIEP QTLNFTPNYP RLLESPLAER IQVAHSNDIK GLAILAEDDD LSIRLLVAQN PATPRTVHQL LELDDSQHVR AALARNPNIS PKLLLTLARD YTWSAVPIRV AAALNPVATS EILELLAQDQ ASLVRQTVGQ NPQASAEILD HARQRALIEA LYALDPWLHM LALGNPATPI EHLAKGARSP WWLGRAALAE NPSCPSNVLE QLTNDGNCYV QRLAQTQLNA R
|
| |