Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_3445 |
Symbol | |
ID | 5735306 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 4331857 |
End bp | 4333551 |
Gene Length | 1695 bp |
Protein Length | 564 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641280592 |
Product | binding-protein-dependent transport systems inner membrane component |
Protein accession | YP_001546209 |
Protein GI | 159899962 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG1178] ABC-type Fe3+ transport system, permease component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.957532 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCAAATA CTCATCAGCG GCGCATTCAA TGGCTGTATT GGATTTTGGC CTTGCCTGCG CTAGCTTTTT TTGTGCTCTT TTTTGGTGTG CCACTCTGGG CAATTATGCA ACGCGCGTTT GCCGAAAAAG GCCTTGGCGC GGCCATTCAA ATGGTATTCA GTAATCGCAG CCTGTTACAA GTTGTGGCTT GGAGTGCTTG GCAAGCAACG CTTTCAACAA TGTTGACCTT GCTTTTGGGC GTGCCAACCG CCTATATTTT GGCTCATTAT CGATTTCCGG GGCGAGCATT GCTGCGCTCA CTGATTGCCG TGCCGTTTGT GCTGCCAACC GTTGTGGTTG CTTTGGCCTT TCGAGCACTA TTGGGCGAGC GTGGCTTAAT CAATCGCTGG CTCGTGGCTT GGTTGGAGTT GGCGCAGCCG CCGTTGGAGC TTGAGCAAAG CCTTGGCATG ATTTTAATCG CCCATGTTTT TTTCAATTTG ACGGTGGTGG TGCGTTTGCT AACAGCCTAT TGGAGCAATC ACGATCCACG TTTAGAGGCT TCGGCTCAGG TTTTAGGTGC GCCACGCTGG CGAGTTTGGC TCGAAGTGAC CTTACCATTG GCAATGCCCG CCTTGTTGGC AGCAGCATTA TTGGTGTTTA CCTTCACGTT TTCAGCCTTC GGCACAGTGC TGTTATTGGG TAGCAGCCAG CAACGCACAA TCGAAGTCGA AATTTATGAT CAAGCGATTC ATCAATTTAA CTTGCCGATC GCAGCAACCC TCTCGCTCTT GCAAATTCTC ACCAGTTTGG GCTTGACCTT GGCTTATACG CGGCTGGTTC GGCGTAGCAG CGTGCCTCAA GAAGCCCAAG CATTGTCCTT ACGTCGCGCT CGCACGTGGC CAAGTCGCGT GGCAATTGGC GGGGTTATGC TACTTGCTAG CAGTTTAATC GTGCTGCCGC TGGCAAGTTT GGTGCTAGGA GCCTTGCGGA TCGAAGGTCA GTGGAGCCTT GAGTATTTTC GCATGCTGGG GATCAATCAG CGTGGCAGTT ACGCCTATGT GCCGCCAACC CAAGCCATGC TCAACTCATT GCGCTATGCG GGCATTACCA CGATCTTGGC TTTAGTTTTT GGTTTGCCCT GCGCCTATCT ATTGGCCCAA CCCCAAGGCC GATTAACCCG CCTGCTCGAT GGCGTGTTGA TGCTGCCCTT AGGCACCTCG GCGGTAACAG TCGGCTTGGG CTATATTATC GCCTTTCGCT CATATGAATT TTGGGGCTGG GAAACGCCTG ATTTGCGGCG TTGGTCGGGC TTGTTGCCGC TAGCTCACAC CCTATTAGCC TTGCCGTTTG TGATTCGCAC CATGGTGCCA GCCTTGCGCC GTTTGAACCC ACAATTGCGC GAAGCTGCGC GGATGTTGGG TGCAAAACCA TGGCAGGCTT GGCGTGAAGT TGATTTAGGC TTGCTGTTGC CAAGCATCAT GGCCGCAGGC TTATTTGCCT TCACTGTATC GTTGGGCGAT TTTGGAGCAG CACTGGTGGT AAGCGTAACC AGCCCAGCCA CCGCCACCAT GCCAGTCGTG ATCTTTCGCT TTTTAGGCCA ACCAGGAGCC AGCAACTATG GCCAAGCCTT AGCGATGAGC AGCCTTTTGA TGGGTGTGAC CTTCATAAGT TTTCTGCTTT TAGAGCGATT TCGCGACCAA ACCAGCGAAT GGTGA
|
Protein sequence | MSNTHQRRIQ WLYWILALPA LAFFVLFFGV PLWAIMQRAF AEKGLGAAIQ MVFSNRSLLQ VVAWSAWQAT LSTMLTLLLG VPTAYILAHY RFPGRALLRS LIAVPFVLPT VVVALAFRAL LGERGLINRW LVAWLELAQP PLELEQSLGM ILIAHVFFNL TVVVRLLTAY WSNHDPRLEA SAQVLGAPRW RVWLEVTLPL AMPALLAAAL LVFTFTFSAF GTVLLLGSSQ QRTIEVEIYD QAIHQFNLPI AATLSLLQIL TSLGLTLAYT RLVRRSSVPQ EAQALSLRRA RTWPSRVAIG GVMLLASSLI VLPLASLVLG ALRIEGQWSL EYFRMLGINQ RGSYAYVPPT QAMLNSLRYA GITTILALVF GLPCAYLLAQ PQGRLTRLLD GVLMLPLGTS AVTVGLGYII AFRSYEFWGW ETPDLRRWSG LLPLAHTLLA LPFVIRTMVP ALRRLNPQLR EAARMLGAKP WQAWREVDLG LLLPSIMAAG LFAFTVSLGD FGAALVVSVT SPATATMPVV IFRFLGQPGA SNYGQALAMS SLLMGVTFIS FLLLERFRDQ TSEW
|
| |