Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_2056 |
Symbol | |
ID | 5733944 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 2566000 |
End bp | 2567679 |
Gene Length | 1680 bp |
Protein Length | 559 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 641279198 |
Product | hypothetical protein |
Protein accession | YP_001544825 |
Protein GI | 159898578 |
COG category | [S] Function unknown |
COG ID | [COG4886] Leucine-rich repeat (LRR) protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.809448 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATGCCGT CTTCGTTTGA TCCGTTCGAT GGTCGCCTGT ATCCACGGCG TACCCCATCC CAGACCATGC TTGAGTATCA GCATTTAGAT CTTGCGGTTG TGCCATCGCC GCTGTGCCCG CAGCGTCCGA TTACCTCTCT TGATCTTTCC CATAATCCGC TGGTCACATG GCCGAGCGAA ACGGCTGCAT TGCCATCCCT CACCCGATTA AATCTTGCCC ATACCACGCT CACTCAGCTT CCCGACCATC TTCGCGCGTG TACGCAGCTG GAAGAACTCT ATCTGTCAGG GTGTCCGCTG GAATGTCTGC CGTCGTGGCT TAGCGAACTT CCCCACCTGC GCGTGCTTGA TCTGTCGCAT ACGCGCCTCA CCATGGTGCC TGATGTCGTG CGTTCGCTGC CATCCCTCCA GGTGCTCAGT CTTTCCGGGC TTCCCCTTGC AGCCCTGCCG CCGTGGCTCG ATGCGTGCGC CCTCCACATG CTGTTCCTGC GGTCATTGAC GGCATGTGAC TTAAGCAGGG TGCGGGCTTG TTCGACGCTC GAATACCTTG ATCTCGGACA CCTGGACTTG ACCCAGGTTC CCGACTGGAT TCAGGGATTA CCCCGATTAC AGCAGCTGGA TCTCTCGGAC AATCCGATCA CGGAGCTTCC CGCGTGGGTT GGCGATCTGC CGCTCACGAC GCTCCATCTT GCCCAGACGC GACTTCAGCA CCGTCCCGAT TGGGAGGCAT GGACGATGCT GCGTGACCTG AACCTCAGCG GAATGACCCA TGATCCTGCC GTCTTTGCGG GGGCATTTCC CGCATCCTTG ACAAGCCTGA AGCTGTACGA CACGGCCTTA ACCGCAATCC CTCCCTTCGT TCGCAACCTC CAGCATCTTG AAACGCTCCG GTTTGACAAC AATGCATCCT TGTCGCTTCC AGCATGGCTC CTGGAGGAGT GCCCATTAAA AACGCTCGAA CTGATCAACA CCCACATCAC CGAAATTGCC CCGGTCGCGC AGCCCATCGC CTTAGAACAC CTGATCATCA CGGCTGGCCG TCTGCCCACG TGGCCGACGC TCCTTGACTA TACGCCACAC CTGCGGACAC TCGATTTGTC GGAAACGCGG ATCGTCGATG CCACCTGTCC ATCGCCGTGT GTGCTTCCCC GATTAGTAAC GCTCGATCTT CAAGGCGATG CGATCGCGCA GCTGCTCCCG CAGCTGGTCG TTCCCATGCT GCAACGATTG ACCATCGCCA ACTGTTGGGA CGCAGACCTA ACCGCCGTGC TTCAGCAGGT CGGTCAGGTG AAGAATCTTG CCATCTTAAA CTGTTCAGGG ACGGTACCGG AGGGGCTGCG ATCATGGACC CACCTCCAAA CACTGAATAT GGGTCATAAT GGGTTGCGTG AGCTACCACG TTGGATCAGC GAATTGGAAC ACCTTGAATC GCTCAACCTC GCCTATAATG ATCTTGCACG ACTCCCGCTC GCCGTGCGGG AGCTTTCGCA GCTACATACG CTCGATATCA CGGCGAATCC GCTGCGGAGC TTTCCTGATT GGCTCCATAC CATGCCACAG CTGCATGCTA TCGAGTTTCA ATTTCCACCG GATGACCTCA CCCTGCATGA TCATCAATTG CAATTTCTGG CCGCTGGAGT GCGCTGCAAT GTCCGTTCAC CGCGACCACG GAAAGCTTAA
|
Protein sequence | MMPSSFDPFD GRLYPRRTPS QTMLEYQHLD LAVVPSPLCP QRPITSLDLS HNPLVTWPSE TAALPSLTRL NLAHTTLTQL PDHLRACTQL EELYLSGCPL ECLPSWLSEL PHLRVLDLSH TRLTMVPDVV RSLPSLQVLS LSGLPLAALP PWLDACALHM LFLRSLTACD LSRVRACSTL EYLDLGHLDL TQVPDWIQGL PRLQQLDLSD NPITELPAWV GDLPLTTLHL AQTRLQHRPD WEAWTMLRDL NLSGMTHDPA VFAGAFPASL TSLKLYDTAL TAIPPFVRNL QHLETLRFDN NASLSLPAWL LEECPLKTLE LINTHITEIA PVAQPIALEH LIITAGRLPT WPTLLDYTPH LRTLDLSETR IVDATCPSPC VLPRLVTLDL QGDAIAQLLP QLVVPMLQRL TIANCWDADL TAVLQQVGQV KNLAILNCSG TVPEGLRSWT HLQTLNMGHN GLRELPRWIS ELEHLESLNL AYNDLARLPL AVRELSQLHT LDITANPLRS FPDWLHTMPQ LHAIEFQFPP DDLTLHDHQL QFLAAGVRCN VRSPRPRKA
|
| |