Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_1227 |
Symbol | |
ID | 5733120 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 1419801 |
End bp | 1420820 |
Gene Length | 1020 bp |
Protein Length | 339 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641278367 |
Product | oligopeptide/dipeptide ABC transporter, ATPase subunit |
Protein accession | YP_001544003 |
Protein GI | 159897756 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG4608] ABC-type oligopeptide transport system, ATPase component |
TIGRFAM ID | [TIGR01727] oligopeptide/dipeptide ABC transporter, ATP-binding protein, C-terminal domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCAGTAC AAAATGATAC TCTGCTCGAT ATCAACGGCT TAAAAATGCA CTTCCCGGTC AAATCCAACG GTTTATTACG CCGCACGATC GGGGCAGTTA AGGCAGTTGA TGGCTTGAGC TTTTCAGTCA AACGCGGCGA GACGCTGGGC TTGGTCGGTG AATCAGGCTG TGGCAAATCG ACAACTGGGC GGGCAATTTT GCAGTTGCAT CGCCCAACCG CTGGCACGGT AAGCTTTGAT GGCACGGACT TGACCAAACT CAAAGGCGAG GCCATGCGCC AAATGCGCCG TAAAGTCCAG ATTATTTTCC AAGACCCCTA TGCTTCGCTC AACCCGCGCA TGACCGTTGG CGATATTGTG GGTGAGCCGA TTCGGGTGCA TGGCCTGCGC ACTGGTAAAG AGGTGCGCAC GCGGGTTGAG GAATTGTTGC GCGTCGTGGG TCTCAATCCT TATTTTATCA ACCGCTACCC GCATGAATTT TCGGGCGGTC AACGCCAACG GATTGGGATT GCCCGTGCTT TGGCGGTCGA GCCGGATTTT ATCGTCTGCG ATGAGCCAGT TTCAGCGCTC GACGTGTCGA TTCAAGCCCA AATTATTAAT TTGCTGCAAG ATTTGCAAGG CCAATTTGGC TTGACCTATC TGTTTATCGC CCATAATCTG AGCGTGGTCA AGCATATCAG CGATCGGGTG GCGGTGATGT ATTTGGGCAA AATGGTTGAG TTGGCTCCCT CAAAAGAGTT GTATGCCAAC CCGATGCACC CCTATACCCA AGCTTTGCTC TCAGCTGTGC CGATTCCTGA TCCTGAGGTC GAAAAGCAAC GCCAGCGAAT TATTTTGCAA GGCGATGTGC CCAGCCCGCT TAATCCGCCG ACTGGCTGTC ACTTCCATAC GCGCTGCCCG ATTGTGATTG ACAAATGCAA AGCCGAAGAT CCACCCTTCC AAGATTATGG TGGCGGCCAT TTTGTGGCTT GCTGGCGGGC AACTGAGTCG CAAGAACAGA TGAATTTGAA GTTGCAGTAG
|
Protein sequence | MAVQNDTLLD INGLKMHFPV KSNGLLRRTI GAVKAVDGLS FSVKRGETLG LVGESGCGKS TTGRAILQLH RPTAGTVSFD GTDLTKLKGE AMRQMRRKVQ IIFQDPYASL NPRMTVGDIV GEPIRVHGLR TGKEVRTRVE ELLRVVGLNP YFINRYPHEF SGGQRQRIGI ARALAVEPDF IVCDEPVSAL DVSIQAQIIN LLQDLQGQFG LTYLFIAHNL SVVKHISDRV AVMYLGKMVE LAPSKELYAN PMHPYTQALL SAVPIPDPEV EKQRQRIILQ GDVPSPLNPP TGCHFHTRCP IVIDKCKAED PPFQDYGGGH FVACWRATES QEQMNLKLQ
|
| |