Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_3500 |
Symbol | |
ID | 5735361 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 4407864 |
End bp | 4408931 |
Gene Length | 1068 bp |
Protein Length | 355 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 641280647 |
Product | binding-protein-dependent transport systems inner membrane component |
Protein accession | YP_001546264 |
Protein GI | 159900017 |
COG category | [E] Amino acid transport and metabolism [P] Inorganic ion transport and metabolism |
COG ID | [COG0601] ABC-type dipeptide/oligopeptide/nickel transport systems, permease components |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.0791674 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACCGTAT ATCTCATTCG GCGCTTGTTC CAAGCGATTC TCGTCCTTAT TTTATCGTCA GCAGTGATTT ACTCGCTGTT TGCCCTGGCT CCTGGTGGCC CACTTGAGGA GCTAACCCAA GTTACCGACC CCAAGAATCG GCCTAGCCCC GAAGATATTC AACGTCAAAT CAAGTTGCTA GGGGCCGATA AGCCTTGGTT CTTGTGGTAT CCAACTTGGC TCGCTGGTGA TACCTGGATG GATAAAATTG GGTTTGAAGA ATATCAGGGC GAACGCAAGG GGATCTTGCG CTGGGATTGG GGTACATCGT GGAAGTTTCA ACGTAATAAG CCAGTCTTGG AGATTATTGG CGATAAATTG CCCGATACCC TGTGGTTGAT GATTTCATCA ACAATTATTT CGTTGGTGCT GGGCATTCCG CTGGGGGTTT TCTCAGCAGT GCGCCAATAT TCTTTTTTTG ATTATGTGTT GACCACATTT AGCTTTATTG GCTTATCACT GCCGGCCTTC TGGTTTGGTT TGTTGATTAT CGCGGTATCG CTGTGGTTTA AACGCAATGG CTGGTTCTAC TTCCCCGCTG GCGATATTCT GGCCCTGCGT AATTACGAGG TTCCGATTCT TGGCACGGTC GTCGCTGGCT CGTTGCTTGA TCGGGTGATG CACTTGGTTA TGCCTGTTAC GGTGCTTTCA ATGCTCAACT TGGCCAACTG GAGCCGCTTT ATGCGGGCGA GTATGCTTGA AGTGTTGAGC CAAGATTATG TGCGGACTGC CCGCGCTAAA GGGGTCAAAG AACGCGTCGT GATCTACAAG CATGCCTTCC GCAATGCCTT GATTCCATTG ATCACGATCA TCGTCTTTGC GATTCCTGGG GTGTTTGGTG GCGCACTGTT TACCGAAACA GTCTTTAATT ATAAAGCGCT CGGCTTTACC TTTATTAGCG CTCTGAACCT CAAAGATTAT CCTTTGGCGA TGGCCTTCTT GCTGATTTCG TCGATCTTGT TGGTGTTTGC GACGTTGCTG GCGGATGTGC TCTATACCAT TGTTGACCCA CGAATTCGAC TTGACTAG
|
Protein sequence | MTVYLIRRLF QAILVLILSS AVIYSLFALA PGGPLEELTQ VTDPKNRPSP EDIQRQIKLL GADKPWFLWY PTWLAGDTWM DKIGFEEYQG ERKGILRWDW GTSWKFQRNK PVLEIIGDKL PDTLWLMISS TIISLVLGIP LGVFSAVRQY SFFDYVLTTF SFIGLSLPAF WFGLLIIAVS LWFKRNGWFY FPAGDILALR NYEVPILGTV VAGSLLDRVM HLVMPVTVLS MLNLANWSRF MRASMLEVLS QDYVRTARAK GVKERVVIYK HAFRNALIPL ITIIVFAIPG VFGGALFTET VFNYKALGFT FISALNLKDY PLAMAFLLIS SILLVFATLL ADVLYTIVDP RIRLD
|
| |