Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_1722 |
Symbol | |
ID | 5733609 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 2004947 |
End bp | 2006086 |
Gene Length | 1140 bp |
Protein Length | 379 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 641278864 |
Product | extracellular solute-binding protein |
Protein accession | YP_001544493 |
Protein GI | 159898246 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG1840] ABC-type Fe3+ transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.0000849376 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGAAACTTG TGCGTTCTTT GATGCTGTTG CTCTGTCTGA GCATTGTGAT TGCAGCCTGT GGTGGGGAAG CAACTCCAAC GGCTGTAACA GCACCAACCA CCGCTGCAGC AACCCCAACG ACCGCTAGCG CAGCCGAAGC TACTGCTACC GTCGAAGTGA CTGCTATTGC TGAGGTTTCG CCAACCGTGC CAACCACCAC CACCACTGAG CCAGCTGGCG GCAGTTTGGT TGTATATTCA GGGCGTTCAG AAAGCTTAGT TGCTCCCTTG TTGGAGCTTT TTACCCAAGA AACTGGGATT GCCGTCGAAG TACGCTATGG CGATACTGCC GAGATGGCCG CGCAAATTCT TGAAGAAGGC GATAATAGCC CAGCCGATGT CTTCTTCGGC CAAGATGCTG GTGCATTAGG CGCATTGAGC GATCGCTTTG TGCAACTTGA TCGGTCATTG TTGAACTTGG TTGATCCACG TTTTGTTTCA AGCGAAGGCA CATGGGTTGG TGCTTCGGGT CGCGCTCGGG TTTTGGTTTA TAACACCGAT GTCTTGACCA GCACCGATTT GCCAGCTTCG ATTCTTGATT TGACCAAACC AGAGTGGAAA GGTCGGGTCG CTTGGGCTCC AACCAATGGT TCATTGCAAG CCTTTGTCAC CGCATTGCGG GTAAGCCAAG GTGAAGATAT GGCCAAACAA TGGCTCGAAG GCATGATCGC CAACGACGTT AAAGTCTATG AATCGAACTC AGCGATTGTC AAAGCCGTTG CGAGTGGCGA AGTTGATACC GGCTTGGTCA ATCACTACTA CTTGTTTGCC TTGAAAAAAG AGCAACCAGA TGCCAAGGCT GCTAACCACT ACTTCGGCAA TGGCGATATT GGCTCGTTGG TGAATGTTGC TGGCGTGGGT GTGCTCAAAA CTGCCAAGAA TCCTGAAGCC GCTCAAACCT TTGCTAATTA TTTGTTGAGC GACGGTGCCC AACTCTATTT CGCCGATAAA ACCAATGAAT ATCCCTTGGT AGCTGGAATT ATTACCAACC CAGCAATCAA GCCATTGGGC GAAATTATTA CCCCAAACAT CGATTTGAGC AATTTGAAAG ATTTGCAAGG AACCTTAGAA TTGTTGCAAG ATGTTGGGGC GATCCAATAA
|
Protein sequence | MKLVRSLMLL LCLSIVIAAC GGEATPTAVT APTTAAATPT TASAAEATAT VEVTAIAEVS PTVPTTTTTE PAGGSLVVYS GRSESLVAPL LELFTQETGI AVEVRYGDTA EMAAQILEEG DNSPADVFFG QDAGALGALS DRFVQLDRSL LNLVDPRFVS SEGTWVGASG RARVLVYNTD VLTSTDLPAS ILDLTKPEWK GRVAWAPTNG SLQAFVTALR VSQGEDMAKQ WLEGMIANDV KVYESNSAIV KAVASGEVDT GLVNHYYLFA LKKEQPDAKA ANHYFGNGDI GSLVNVAGVG VLKTAKNPEA AQTFANYLLS DGAQLYFADK TNEYPLVAGI ITNPAIKPLG EIITPNIDLS NLKDLQGTLE LLQDVGAIQ
|
| |