Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_1900 |
Symbol | |
ID | 5733789 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 2292838 |
End bp | 2293881 |
Gene Length | 1044 bp |
Protein Length | 347 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641279044 |
Product | oligopeptide/dipeptide ABC transporter, ATPase subunit |
Protein accession | YP_001544671 |
Protein GI | 159898424 |
COG category | [E] Amino acid transport and metabolism [P] Inorganic ion transport and metabolism |
COG ID | [COG0444] ABC-type dipeptide/oligopeptide/nickel transport system, ATPase component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 28 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGTGCGC CATTATTGGA AGTCAAGGAT CTCAAAGTCG AGTTCAAGCG GCCTGGTGGC GTGGTGCATG CAGTCAACGG GGTTAATTTT ACGCTTGAAG CTGGCCAAAG CCTCGGAATT GTCGGTGAAT CTGGCTCGGG CAAGTCGGTG ACGATGCTTT CGTTGCTTGG TTTGATTGGT CGGACTGGCC GCGTGGTTGG TGGTTCGGCG ATGTTCAACG GGGTTGATTT GGTCAAAATG GCTCCGCGTG AGCTGCAAGA TGTGCGCGGT CGCGATATTG CGGTAATCTT TCAAGACCCA ATGACCAGCC TCAACCCAAT TATGAAAATT GGGGCGCAGA TCACCGAAAG TATGCGCATG CGCAAAATTT ACTCGGCAGC CGAAGCCAAA GAACGGGCAA TTGAGTTGCT CGATCGGGTT GGGATTCCCC AGCCTGCCAA GCGGCTCAAC GATTATCCCT ATCAATTTTC TGGTGGCATG CGCCAACGGG TGATGATTGC GCTGGCCTTA GCACTCAAGC CCAAATTGCT GATTGCCGAT GAGCCAACCA CCGCACTTGA CGTAACGGTG CAGGCTCAGG TGCTTGATTT GCTCGAATCG TTGCAGGACG AAACGGGTAT GGCCATGATT ATTATTACCC ACGATTTGGG CGTGGCCACC AACTACTGCG ATAATTTGGC GGTGATGTAT GCTGGCGAAA TTGTCGAAAT GACGACTGTT GATCGCTTGG TTGAGCACAC ATCGCATCCC TATGCTTTGG GTTTGCTCAA CAGCACCATG GAGATTGGCC ACGGCAAAAC CGCCATCCAG CCAATTCCTG GTAACCCGCC AAGTGCCTTA AAAGTGCATA AAGCTTGCCC ATTTGCGCCG CGCTGCCGCT TCAAAAGCAG CGTTTGCCAA GAACGCAAGC CCGAATTAAC TACGGTCGAA CCCAATCACC TGGTGGCTTG TTTCCACGCC GATGCTGTGG TCAAGGCTGC TCAACAAGGT GATGATGCAG CAGCGGAGGT AATGCTCAAT GTCGCCCAAC CAATCCACGC CTAA
|
Protein sequence | MGAPLLEVKD LKVEFKRPGG VVHAVNGVNF TLEAGQSLGI VGESGSGKSV TMLSLLGLIG RTGRVVGGSA MFNGVDLVKM APRELQDVRG RDIAVIFQDP MTSLNPIMKI GAQITESMRM RKIYSAAEAK ERAIELLDRV GIPQPAKRLN DYPYQFSGGM RQRVMIALAL ALKPKLLIAD EPTTALDVTV QAQVLDLLES LQDETGMAMI IITHDLGVAT NYCDNLAVMY AGEIVEMTTV DRLVEHTSHP YALGLLNSTM EIGHGKTAIQ PIPGNPPSAL KVHKACPFAP RCRFKSSVCQ ERKPELTTVE PNHLVACFHA DAVVKAAQQG DDAAAEVMLN VAQPIHA
|
| |