Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_0350 |
Symbol | |
ID | 5732260 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 420701 |
End bp | 421642 |
Gene Length | 942 bp |
Protein Length | 313 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 641277473 |
Product | substrate-binding region of ABC-type glycine betaine transport system |
Protein accession | YP_001543129 |
Protein GI | 159896882 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1732] Periplasmic glycine betaine/choline-binding (lipo)protein of an ABC-type transport system (osmoprotectant binding protein) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.000721097 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGACGTT TCATCCAGCT AACCGTGTTG CTCACGCTCT TGATGACCAT TTTAGCCGCT TGTGGCGATG ATGCAGCCAG CAGCACGACA GCTCCCAGTA GCGACCCAAA TCCCGCCCCA AGTAGCAATC AAGTAATTCG GATTGCCTCG AAGAATTTTA CCGAACAGTT TATTGTGGGC GAAATGTATG CCCTCTTGTT GGAACAAGCA GGCTACAAAG TTGAGCGCAA AATTGGCTTA GGCGCAACCG ATGTAGCCCA AGCTGCCATG GAAAAAGGCG AGATCGATAT TTATCCTGAG TACACTGGCA CTGGCTTGTT GACTGTACTC AAATTGCCAA CCCAAACCGA TCCCAAAGCG GTCTATGAAA CGGTCAAAAA AGGCTATGCC GAGAAATATC AAATTGCATG GCTCGACCCT GCGCCAATGA ATAATACCCA AGCCTTGGCC ATGACCAAAG AAGGTGCTCA GAAATTTGGC ATCATCACAA TCTCCGACAT GGCCGCCAAA GCCGGCGAAC TGCGCATGAT CGGCCCACCC GAGTTCCAAG AACGAGAAGA TGGCTTGCCT GGCATTCAAG CCAAGTATGG CAACTTCGAA CTCAAGGAAT ATGTACCAGT CGATCCAGGC CTCAAATACA AGGGCTTGGT CGAAGGCAAT GCCGATGTGG TGGTGGCCTT TGGCACGGAC GGCGAAATTG CCAATTATGG CTTGGTCGTG CTCAAAGATG ATCGCAACAT GTTCCCGCCC TACCAAATCG CTCCGTTGGT GCGCCAAAGC GTTTTGGATG CTAACCCCAA ATTGGCTGAA TTGCTGAATG CCTTGGCTCC CAAACTTGAC GATGCGACCA TGCAACAGCT CAATTTTGAA GTCAGCGGCA ATAATCGCAA ATACGAAGAT GTGGCCAAAG AGTTCCTAAC CACCCAAGGT TTGTTGAAAT AG
|
Protein sequence | MRRFIQLTVL LTLLMTILAA CGDDAASSTT APSSDPNPAP SSNQVIRIAS KNFTEQFIVG EMYALLLEQA GYKVERKIGL GATDVAQAAM EKGEIDIYPE YTGTGLLTVL KLPTQTDPKA VYETVKKGYA EKYQIAWLDP APMNNTQALA MTKEGAQKFG IITISDMAAK AGELRMIGPP EFQEREDGLP GIQAKYGNFE LKEYVPVDPG LKYKGLVEGN ADVVVAFGTD GEIANYGLVV LKDDRNMFPP YQIAPLVRQS VLDANPKLAE LLNALAPKLD DATMQQLNFE VSGNNRKYED VAKEFLTTQG LLK
|
| |