Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_3501 |
Symbol | |
ID | 5735362 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 4409247 |
End bp | 4411166 |
Gene Length | 1920 bp |
Protein Length | 639 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 641280648 |
Product | extracellular solute-binding protein |
Protein accession | YP_001546265 |
Protein GI | 159900018 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.0000268642 |
Plasmid hitchhiking | No |
Plasmid clonability | unclonable |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCTTCGAT CAAAGCGCCA ATTGATGAGC TTTGCGCTCA TGTTGGTCCT TGTTGTTCCG ATCCTTGCTG CTTGTGGTGG CGAAACAACT CCAACCACAG CTCCAGCAAC GACTGCCCCA GCAACCGCAA CTACAGATAC CTCATCAGCA GCAACTGCTG AACCAACTGC TGCTGAAGCA ACCGTCGAAC CAACCGTTGC TGAAGCAACC GCCGAACCAA GCACCGGTAC TACTCCTGAT ATGGATAAAA CCATCATCAT CGGTATGACC CAATCGCCAG ATACCTTGTT CGGCATTGAA TCACAATCAA GCGCCACGAC CCAAGTGTTG TCAGCGATTC AACCAGCTTG TTTCACCACT TTGAGCTACG AATATCAACC AGTTTGTTTC ACCAAATTGC CAAGCTTCGA AGATGGCGAT GCAGTGACCC AAACCGTCTC AGTTGATAGC GCCTATGCTG GCAACATCGT GATCGATGAT GAGTTGATCA CCGATACCGC TAGCTTGACC GAAGCAATCG AATTGGAACA AGTTGTTGTG ACTTGGACCT TGATCGATGG CATGACTTGG GAAGATGGTA CGCCAATTAC CGCTGCCGAC TTCGTGTTTG CTGCTGAATT GTACCAAGAT CCAGGCATCA AGAACGCTAG CCGCTTCGTG CTTGATCGCA CCGAAAAATA CGAAGCCAAA GACGAAAAAA CCTTGGTATG GTACGCTGCA CCAGGTTACA CCGATGCAAC CTACTTCTTG AACACCTTTG GGCCTGAACC AAAGCACGTT TTGGAAGGCG AAGATCCAGC AACCATTGGT GGTAGCGACT ACGCTAGCAA GCCATTGGCA TATGGCCCAT ACAAGATTGC TGAAAACACC CCACAAGAAA GCACCAAGTT GGTTGCAAAC GAAACCTACT GGAAAAAAGG TTTCCCTCTC GTTGGTAATG TTACCTTCAA GTATCTGACC AGCGAAGATC AAGTGTTGCA ACAATTGGAA AGCGGCGAAA TCGACGTAGT TGGTTCAATT GGTTTGACCT TGGCTAACGC TCCTAAGCTC GACGAACTCG AAGCTGCTGG CGTGCTCAAG GGCCAATATG TTCCAGCAAC CGTGTGGGAA CACATGGACT TCGGTGTCGA GCGCAACGAC GGCCAACCAT CAGTATTCGC TGATGTCAAG TTGCGCCAAG CTGTTGCTTA CGCTGTCAAC CGCAAACAAA TCATCGATAA CGTCTTGTTC GGCAAGACCG TTGTGATGAA CACCTTCTTG CCAGCCGACC ACTGGGCTTA TCCACCAAAC GGCGAAGGCT TGGAAGCATA CGAATATGAT GTAGAAAAAG CTAAGGCTCT CTTGGCTGAA GCTGGTTGGG TTGCTGGCGC TGATGGCATT CTTGAAAAAG ATGGCACCAA GCTCACCATC CAATTCTACA CCACCGAAAA CAACCAAACC CGCGAAGCCG TTGCTCAGTT GATCCAAGAA GACCTGAAAG CTGTTGGTAT CGATGTTACC TTGAACTTCG TTCCAGCAAC CGATGTCTTG TTCAAGAACG GCTCAGAAGG TATCTTGTCA GGCCGCCGCT TCGACTTAGG TTTGTACGCT TGGGTCAGTG GCCCAGAGCC TTCGACCGCT CTGTACCTCT GTGAACAAGT GCCAACCGAA GAAAACAGCT TTGGTGGTCA AAACAACACT GGCTGGTGTA ACCCAGATTA CGACAAGCCA GCCTTGGCCT CACAATCAGA AACCGACCGC GCCAAGCGGA TTCCTTTGGT CATCGAAGCT CAAAAAGTCT TCAATGCCGA ATTGCCAACC TTCCCATTGT ACCAACGTGT CAATGTTGGT GCCTACAACG TCAAGGTTAG CGGCTTGGAA TTGAACCCAA CCAGCCAAGT TGACTTCTGG AACATCGAAA CCTGGGATGT TACTGAGTAA
|
Protein sequence | MLRSKRQLMS FALMLVLVVP ILAACGGETT PTTAPATTAP ATATTDTSSA ATAEPTAAEA TVEPTVAEAT AEPSTGTTPD MDKTIIIGMT QSPDTLFGIE SQSSATTQVL SAIQPACFTT LSYEYQPVCF TKLPSFEDGD AVTQTVSVDS AYAGNIVIDD ELITDTASLT EAIELEQVVV TWTLIDGMTW EDGTPITAAD FVFAAELYQD PGIKNASRFV LDRTEKYEAK DEKTLVWYAA PGYTDATYFL NTFGPEPKHV LEGEDPATIG GSDYASKPLA YGPYKIAENT PQESTKLVAN ETYWKKGFPL VGNVTFKYLT SEDQVLQQLE SGEIDVVGSI GLTLANAPKL DELEAAGVLK GQYVPATVWE HMDFGVERND GQPSVFADVK LRQAVAYAVN RKQIIDNVLF GKTVVMNTFL PADHWAYPPN GEGLEAYEYD VEKAKALLAE AGWVAGADGI LEKDGTKLTI QFYTTENNQT REAVAQLIQE DLKAVGIDVT LNFVPATDVL FKNGSEGILS GRRFDLGLYA WVSGPEPSTA LYLCEQVPTE ENSFGGQNNT GWCNPDYDKP ALASQSETDR AKRIPLVIEA QKVFNAELPT FPLYQRVNVG AYNVKVSGLE LNPTSQVDFW NIETWDVTE
|
| |