Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hhal_0235 |
Symbol | |
ID | 4711091 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorhodospira halophila SL1 |
Kingdom | Bacteria |
Replicon accession | NC_008789 |
Strand | + |
Start bp | 270005 |
End bp | 270865 |
Gene Length | 861 bp |
Protein Length | 286 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 639854695 |
Product | substrate-binding region of ABC-type glycine betaine transport system |
Protein accession | YP_001001831 |
Protein GI | 121997044 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG2113] ABC-type proline/glycine betaine transport systems, periplasmic components |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.00289978 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAACGCA TGACCACCGC TGCCGCCACC GCGGCGCTGA CCGTGGGCAT GGCCAGCGCG GCCAACGCCG ACGAGATCCG CATCGGCTGG ACGGCCTGGG CCGACGCCGA GTTCGTGACC CGCATGGCCG AGGACGTCAT CACCAGCGAG TTCAACACCG ACGTCGAGCT CGTCCAGACC GACATCGCCC CGCAGTACAC GGGCCTGGCC CAGGGCGACA TCGATCTGAT GCTGATGTCC TGGCAGCCGA CCACCCACGA GGACTACGTC GAGCAGTTCG GCGACGACAT CGTCGATCTC GGCGTCCTGT ACGGCGACGC CGCCCTGGGG TGGATCGTGC CGCAGGACCT CTACGACGAG GGCCTGACCT CCATCGAGGA TCTCAAGGAT CCGGAGTGGC ACGACCGGCT GGACGGCCAG ATCCAGGGCA TCGACCCGGG CGCCGGCCTC ACCCGCCTCT CCCACGACGT CATTGAGGAC TACGACCTCG ACTACAACCT GATCGAGGCC TCGGACTCCG CAATGACGGC GGCACTGAAC CGGGCTGCTC GCCGCGGGGA CGCCATCGTG GTCACCGGCT GGAGCCCGCA CTGGAAGTTC GGCGCCTGGA ACCTGGCCTA CCTGGAGGAC CCCAAGGGCA GCCTGGGCGG AGCCGAGTCG ATCCACGCCA TGGCCCGTCA GGGCTTCGAG GGTGACCACC CCGAGGTGGC CAGCTTCGCC AGCTGCATCG AGATCGACAT CGACCTGCTC AACGAGTACA TGTACCTCGG TCGCGAGGAA GGCACCGAGG AGGCTGTTGA GCAGTTCCTC GACGCGGAGG CCGACCTGGT CGATCAGTGG GTGGAGTGCG CCCGCAACTA A
|
Protein sequence | MKRMTTAAAT AALTVGMASA ANADEIRIGW TAWADAEFVT RMAEDVITSE FNTDVELVQT DIAPQYTGLA QGDIDLMLMS WQPTTHEDYV EQFGDDIVDL GVLYGDAALG WIVPQDLYDE GLTSIEDLKD PEWHDRLDGQ IQGIDPGAGL TRLSHDVIED YDLDYNLIEA SDSAMTAALN RAARRGDAIV VTGWSPHWKF GAWNLAYLED PKGSLGGAES IHAMARQGFE GDHPEVASFA SCIEIDIDLL NEYMYLGREE GTEEAVEQFL DAEADLVDQW VECARN
|
| |