Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hhal_1078 |
Symbol | |
ID | 4709870 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorhodospira halophila SL1 |
Kingdom | Bacteria |
Replicon accession | NC_008789 |
Strand | + |
Start bp | 1172807 |
End bp | 1173826 |
Gene Length | 1020 bp |
Protein Length | 339 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 639855549 |
Product | oligopeptide/dipeptide ABC transporter, ATPase subunit |
Protein accession | YP_001002656 |
Protein GI | 121997869 |
COG category | [E] Amino acid transport and metabolism [P] Inorganic ion transport and metabolism |
COG ID | [COG0444] ABC-type dipeptide/oligopeptide/nickel transport system, ATPase component [COG4608] ABC-type oligopeptide transport system, ATPase component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.102622 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCTCGC ACGCACCCAT GGCGCAACCG ACCGTCGGCG CCGACGAGGC CCTGCTCCGG GTCGACGGCC TCTACACCCA CTTCGAGCTG CGTCAGTGGG GCTTCCTGCG TACCGGCACG GTACACGCCG TCGACGGCGT CACCTTCACC CTGGACGCCG GGGAGGCCGT CGCTGTGGTG GGCGAGAGCG GCTGCGGCAA GAGTTCGCTG GCGCGCACCC TGCTCGCTCT CCACCGGCCC ACCGCGGGCT CGGTGCGTTT TGCCGGTACC GAGCTGGGTG CGCTCAACGG CGCGGCCCTC AAGGCGTACC GGGCCCAGGT CGGCTATGTC CAACAGGATC CCTACGGGGC CCTGCCACCG TTCATGGAGG TGAAGCGGAT TCTAGCCGAG CCGCTGATCA TCCACGGCGT TCGCCCGCGG GCCGAGCGCT GGCGGCGGAT CAAGGCGGCC CTGGAGGAGG TCGGCCTGAC ACCGGCGGCC GACGTTGCCG CGAAGTTTCC CCACCAGCTC AGCGGCGGGC AGCAGCAGCG TGTGGTCATC GCCCGCGCCC TGCTGCTGCG CCCGGCGATG ATCATCGCCG ACGAGCCCGT CTCCATGCTC GACGCTTCGG TGCGGGTGGA GATCCTCAAC CTGCTCCATC GCATCCAGCA GGAGCACCGG CTGGCCCTGC TCTACATCAC CCACGACCTC TCCACGGTCC GCCACTACGT GGACCGAGCC ATGGTGATGT ACGGCGGACG GATCATCGAA CAGGCGCCGG TGGGCGCGCT GCTCCAGCGC CCGCAGCACC CGTATACCGA GGCCCTGCTC AGCGCGCTCG GGGATCCCGA CGCCGCTAAC GCCGGACGGC CGCGCCCGGT CCCCGGGGGC GAGGCCCCGA GCCTGATCCA GCCACCCTCC GGCTGCCGCT ACCATCCACG CTGTCCGTAC GCCATGGTCA ACCGCTGCGA GCTGGACCCG CCGCCCCCGG ACTTCCGGCC CCACCCGGAG CACCGCGCCG CCTGCTGGCT GCGCGAGTAG
|
Protein sequence | MSSHAPMAQP TVGADEALLR VDGLYTHFEL RQWGFLRTGT VHAVDGVTFT LDAGEAVAVV GESGCGKSSL ARTLLALHRP TAGSVRFAGT ELGALNGAAL KAYRAQVGYV QQDPYGALPP FMEVKRILAE PLIIHGVRPR AERWRRIKAA LEEVGLTPAA DVAAKFPHQL SGGQQQRVVI ARALLLRPAM IIADEPVSML DASVRVEILN LLHRIQQEHR LALLYITHDL STVRHYVDRA MVMYGGRIIE QAPVGALLQR PQHPYTEALL SALGDPDAAN AGRPRPVPGG EAPSLIQPPS GCRYHPRCPY AMVNRCELDP PPPDFRPHPE HRAACWLRE
|
| |