Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hhal_0234 |
Symbol | |
ID | 4711090 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorhodospira halophila SL1 |
Kingdom | Bacteria |
Replicon accession | NC_008789 |
Strand | + |
Start bp | 268798 |
End bp | 269928 |
Gene Length | 1131 bp |
Protein Length | 376 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 639854694 |
Product | binding-protein-dependent transport systems inner membrane component |
Protein accession | YP_001001830 |
Protein GI | 121997043 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG4176] ABC-type proline/glycine betaine transport system, permease component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.197638 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAAGATC AACCGGATCT CGAGGAACAC GAAACCATCG GCCCGCTGGA CCAGGCGGCC GAGTGGTTCA GTGAGAACAT CCTGGACAAC ATCACCATCG GCGACTGGAT CGAGGACGGC GTCGACTGGA TCAGCGACAA CCTGGAGCCC CTGCTCGACG GCATCGAGGG CGCCATCCGC GCTCTGGTCG ACAGCACCGA GTTCCTCCTC CTCTACCCGC TGTGGATCGC CGCCTTCTTC CTGGTGGTCG GCGCCTGGCG CACCTGGGGC CGCAAGGCCG GACTGATCAG CCTCGCCGTG GCGGTGGCGC TGTTCGGCAT GGGCCTGTTT TCCGAGACCG TGCAGGCGCT CTTGTGGTAT CCGCCGCCGT GGGTGCTGGC GATCCTGCTC ATCGCGGTGT CCTTCTGGCG GGTCGGGTGG CGCTTCGGCA TCTTCGCCAT CATCGCCCTG GCGCTGATCT TCAGCATGGA GCTGTGGCCG GAGACCATCC GCACCCTGTC GCTGGTGGTG GCCTCCTCCA TCGCGGCGCT GATCATCGGC CTGCCCATCG GCATCGCCAT GTCGCGCAAC GACCGGGTGG AGATGGTGGT CCGCCCCATC CTCGACCTGA TGCAGACCAT GCCGCCGTTC GTCTACCTGA TCCCGGCGGC GATCTTCTTC GGACTGGGCA CGGTGCCGGG GGCCATCGCC ACGCTGATCT TCGCCATGCC GCCGGCGGTG CGCCTGACCA ACCTGGGCAT CCGCCAGGTC AGCCAGGAGC ACGTGGAGGC CGGCCAGGCC TTCGGCTGCA CGCCGCGGCA GCTGCTGTTC AAGATCCAGC TGCCGCTGGC CACGCCATCG ATCATGGCCG GGATCAACCA GACCATCATG CTCGCCCTGT CCATGGTGGT GATCGCCTCC ATGATCGGCG CCGGTGGCCT GGGTGGCACG GTGCTCACCG GCATCCAGCG CCTGCAGGTC GGGCTCGGCT TCGAGGGCGG TCTGGCCGTG GTCTTCCTGG CCATCCTGCT CGACCGCATC AGCCAGAGCT TCGGCGAGCG TCAGCGCGGC AAGGGCCGCG ACTACGGCGC CCTGCTGCGC TGGTTCTTCG GTCAAAAGCG CGACCCGGCG CAACAGCCCC AGCAGGGCTG A
|
Protein sequence | MQDQPDLEEH ETIGPLDQAA EWFSENILDN ITIGDWIEDG VDWISDNLEP LLDGIEGAIR ALVDSTEFLL LYPLWIAAFF LVVGAWRTWG RKAGLISLAV AVALFGMGLF SETVQALLWY PPPWVLAILL IAVSFWRVGW RFGIFAIIAL ALIFSMELWP ETIRTLSLVV ASSIAALIIG LPIGIAMSRN DRVEMVVRPI LDLMQTMPPF VYLIPAAIFF GLGTVPGAIA TLIFAMPPAV RLTNLGIRQV SQEHVEAGQA FGCTPRQLLF KIQLPLATPS IMAGINQTIM LALSMVVIAS MIGAGGLGGT VLTGIQRLQV GLGFEGGLAV VFLAILLDRI SQSFGERQRG KGRDYGALLR WFFGQKRDPA QQPQQG
|
| |