Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_5261 |
Symbol | |
ID | 5737219 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009974 |
Strand | + |
Start bp | 34321 |
End bp | 35997 |
Gene Length | 1677 bp |
Protein Length | 558 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 641282425 |
Product | leucine-rich repeat-containing protein |
Protein accession | YP_001548016 |
Protein GI | 159901771 |
COG category | [S] Function unknown |
COG ID | [COG4886] Leucine-rich repeat (LRR) protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTATCAGA TTCATGCTGG GTATATTATG CCCGAGGATG CGACGCAGAC CACCCTTGAT TTTAGCCGCT TAAGCCTCAC AATCCTTCCG ACGATGCAGG ATGTTTCTTG GAATATTACG GCAATCGATT TATCCTATAA TAGCTTAACG ATGCTCCCTT ATGCGCTTCC GCGTGCGGCA TCGCTGAAAC GCCTTCTGTT ACGTGTTAAT CCGCTCACTG CGCTTCCTGA ATGCATCCGC GAATGCCACA ACCTTGAAGA ACTCTATGTC TCAGGCTGTC CCTTGACCAT GTTGCCAGAT TGGTTGGATG AACTAACGGC CTTGCGAATA CTGGAGATCA GTGACACCGC CATTCCTGAT TGTCCCTCGG TATTGCGTCG ATTGCCGAAT CTTCGGATGC TTGGCATCGC AAATCTCCCA TGGACAACCC TACCGCCATG GTTTTGTGAC TTACCGCTGA CAACCCTCAC TATCGATGGC ATGCCTAGAT GTGATTGTTC TCCCCTCGTA GGATTAAAAC AGCTTCAGCA TCTTGGCCTC AGTGCCATGG ACTATACCAT TGTGCCTGAA TGGATTCGAC AATTGCCTTT GTTACACCTG CTTGATCTCA GTCATAATCC CCTTGAAATA CTCCCATCTT GGCTAGAAAC CATCCCTATA ACAACCCTCA TGCTTGCCCA TGTTCCATTA GCGGCGCTTC CCGATTGGCA TACGTGGGAT CGGTTAACCA CACTCGATTT AACGGCATGT CATTTGGCTG ATGCTGCCTT TCGAATGCCC TTGCCGCGCA ATTTAACCCA CCTGAATCTT GATCAAAATC CCATTACTCA GCTTCCATCA GAGCTTTACC AGTGTCATTC ACTCTACGCT TGTAGTCTTG CGAATACCGC TCTAACCACC CTTCCAGCAT GGTTTTTTGA AGATCTTCCA CTTGTATCAC TCGATATTTC AGGCACCAGC CTCACGTTTC CGGCGCTCTC CCAGCGATCG ATGCTGGAAT CATTCATCTT TGGCATGGGG AAAACGTCCG CTTGGCCATA CCTTCTCACA CACATGCCGA CATTACGCGT GCTTGATCTT TCTGATACGT GGTTGCAGTC TCAAAGTCCA TCTATGGGTC ACGCGCTCTT TCCGCAGCTT GAGACATTTC GTGGACCACG CGATCAAGAC CGTGTTCCTT TAATAGGAGC TATGCCAAAT CTGCGTGTGG CAGTGTTAAG TGGTGGATTA TCACGAGGTT CGCGTGAGTA TCTTTCTGCA CTCTTAGGAA AGAGTCCTCA GATCCAAGCA CTTGACCTTT CGCGTTGGCA TTGCAATCCG ATCCCTTCTA CCCTTGTCGA TCTTGCCGAG TTGCAGACCT TGAATATTGC GCATAAGCAC CTCGATCAGG TTCCTGGATG GGTGAACGAT ATGCCATACC TTAAATCACT GGATCTTTCT GATAACCGAT GTACTGACAT ACCACGATGG ATGAGGAACA TGACACACCT TGAATCGCTT GATCTTTCCG GGAATCCTTT ACAGACCTTT CCCTCATGGT TAAAGGATAT CCCAACGCTC AGAGATGTCG CATTCATGTT TCCATCAGTC AACCTTCAGT GTGATCACGT CCTCCCAGAA TTTCTGGCGG CAGGGATTCG CCTGGATGTC CAATATCCCC GTGATGACGC TGAATGA
|
Protein sequence | MYQIHAGYIM PEDATQTTLD FSRLSLTILP TMQDVSWNIT AIDLSYNSLT MLPYALPRAA SLKRLLLRVN PLTALPECIR ECHNLEELYV SGCPLTMLPD WLDELTALRI LEISDTAIPD CPSVLRRLPN LRMLGIANLP WTTLPPWFCD LPLTTLTIDG MPRCDCSPLV GLKQLQHLGL SAMDYTIVPE WIRQLPLLHL LDLSHNPLEI LPSWLETIPI TTLMLAHVPL AALPDWHTWD RLTTLDLTAC HLADAAFRMP LPRNLTHLNL DQNPITQLPS ELYQCHSLYA CSLANTALTT LPAWFFEDLP LVSLDISGTS LTFPALSQRS MLESFIFGMG KTSAWPYLLT HMPTLRVLDL SDTWLQSQSP SMGHALFPQL ETFRGPRDQD RVPLIGAMPN LRVAVLSGGL SRGSREYLSA LLGKSPQIQA LDLSRWHCNP IPSTLVDLAE LQTLNIAHKH LDQVPGWVND MPYLKSLDLS DNRCTDIPRW MRNMTHLESL DLSGNPLQTF PSWLKDIPTL RDVAFMFPSV NLQCDHVLPE FLAAGIRLDV QYPRDDAE
|
| |