Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_0368 |
Symbol | |
ID | 5732219 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 440952 |
End bp | 441866 |
Gene Length | 915 bp |
Protein Length | 304 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641277491 |
Product | binding-protein-dependent transport systems inner membrane component |
Protein accession | YP_001543147 |
Protein GI | 159896900 |
COG category | [E] Amino acid transport and metabolism [P] Inorganic ion transport and metabolism |
COG ID | [COG1173] ABC-type dipeptide/oligopeptide/nickel transport systems, permease components |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.0652465 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTTATCGC TAACTGGTGC GCTGACCTCG GCATGGCAGC GCTTGACCCA CGGCTCCGCC AAAACGCCAA TAATTGAGCA GCCCGAGCAA CGTGCCAAGC CACAACTAAG CCGCAGCGGC AAATTTGCCG CTGGCGGATT GTTGTTTATG CTGATAGCGG TGATCGTTGG CCCATGGCTC TGGCAGGTTG ATCCGCTGGC GCAGAATATT GGGCAGCGTT TAGCCCCGCC ATCGTTGAGC CATCCCCTTG GCAGCGATCA ATTTGGCCGT GATGTGCTCG CCCGAATGTT GATTGGCGGG CGTTGGTCGT TGTTTGGAGC AGGCTTCGTC TGCATTGGCA CAAGCTGTTT GGGCTTAGTA TTGGGGGCAT GCTGTGCCGT GGGGCCGCGT TGGCTCGATT ATTTGCTGAG TCGGATTACC GAAACCTTCC AAGCAATTCC ACCAGTGTTA CTAGCCCTCG CCTTGAGCGC AATCCTGTCG CGTTCGTTTA GCAATTTATT GTTGGCGCTG ATTTTGACCA ACTGGACGTG GTATGCCCGC ATGTATCGCG CCTTGATTCT CAAAGAATTA GCAATGCCGT ACATCGAAGG TGCGCAGGCA ATTGGCGTAA AACCGCTGGC GATTTTGTTG CGTCATGTAC TGCCCAACCT GTTTGGGTCG ATGGTCGTGA TTGCCACCAC CAACTTTGGC AGCGTGATAT TGAATCTTTC GGCCTTGTCG TTTATTGGCT TTGGACTCAA CCCACCAACC CCTGAATGGG GCAATTTAAT CAACGAATCA CGGGCATTTT TTCAACGCGA ACCACGGCTA ATGATTATTC CTGGCCTGTG CATTGCCACA ACTGTGCTAT GGCTCAACTT GCTTGGTAAC GCCCTGCGGG ATCGGTTGGA TCACTATCAA GGGATTAGGG ATTAG
|
Protein sequence | MLSLTGALTS AWQRLTHGSA KTPIIEQPEQ RAKPQLSRSG KFAAGGLLFM LIAVIVGPWL WQVDPLAQNI GQRLAPPSLS HPLGSDQFGR DVLARMLIGG RWSLFGAGFV CIGTSCLGLV LGACCAVGPR WLDYLLSRIT ETFQAIPPVL LALALSAILS RSFSNLLLAL ILTNWTWYAR MYRALILKEL AMPYIEGAQA IGVKPLAILL RHVLPNLFGS MVVIATTNFG SVILNLSALS FIGFGLNPPT PEWGNLINES RAFFQREPRL MIIPGLCIAT TVLWLNLLGN ALRDRLDHYQ GIRD
|
| |