Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_1225 |
Symbol | |
ID | 5733118 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 1417675 |
End bp | 1418673 |
Gene Length | 999 bp |
Protein Length | 332 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 641278365 |
Product | binding-protein-dependent transport systems inner membrane component |
Protein accession | YP_001544001 |
Protein GI | 159897754 |
COG category | [E] Amino acid transport and metabolism [P] Inorganic ion transport and metabolism |
COG ID | [COG1173] ABC-type dipeptide/oligopeptide/nickel transport systems, permease components |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCAACCG CATCGAATGT GGCAGGCAAA AGCCCTGCCC AGGCTCAAGA ATTTACGCGG CCTGTTCGGA GTCTTTGGAG CGATGCTTGG TTACGTCTGC GGCGCAATCG TTTAGCAATG GCAAGTATTG TCTATTTGTT TTTGCTGGCG TTGGTAGCAA TTTTCGCACC AGTCATTGCC CCGCACTCGC CTAGCCGCCC AACCTCAACC GAATTACGCG AACGAGGCAC CTATCGGCAA GCTGCTTGGA TTGTTGATGA GAAAAACCCT AAACGCAGTG GGGTTTGGAA ATTTCCCTTA GGAACAGATT CTGCTGGCGG CGATGTGCTC AGTCGCTTAA TCTATGGAAC GCGGGTTTCG ATGGTTGTGG GCTTCATTCC CATGATCTTT ACCCTGACGA TCGGGATCAC GATTGGCTTG GTTTCAGGTT TTGCCGGTGG CAAACTCGAT AGTTTGCTCA TGCGGTTTAC TGATATTGTC TTTTCGCTGC CCGATATCTT GTTCTTTATT ATTGTGCAAA CGGCCTTCAG TCAAACCGCC TTTGGCAAGA CCTTCAATGG TTTATTGTTG ATTTTCTTAT CATTCTCAGC GGTCAACTGG GCTAGCGTTG CGCGTTTGGT GCGTGGCCAA GTGCTTTCTT TAAAAGAAAA AGAGTTTGTT GAAGCAGCAG AGGCGATTGG GGTTCGGCGT GGCTCAATTT TATTTCGCCA TATTTTGCCC AACACGCTCG CCCCAATTAT TGTGGCAGGT GCGTTTATTG TGCCAAGCGC GATTGTCACC GAAGCAACCC TGAGCTTTTT GGGGATTGGC ATCCAGCCTG ATACCAACCC CAATAATCCG TTCCCTACCA GCTGGGGCCA GATGATTTTG GAAGGTAAGT CGGCGATTGA TTCGCAACCA TGGATTCTGA TCGCGTCGGC GATTGCAATT GCTTCAATTA CGATTGCTTT TGTGGCTTTG GGCGATGGTT TACGTGATGC GCTTGATCCC CGCCAATAG
|
Protein sequence | MATASNVAGK SPAQAQEFTR PVRSLWSDAW LRLRRNRLAM ASIVYLFLLA LVAIFAPVIA PHSPSRPTST ELRERGTYRQ AAWIVDEKNP KRSGVWKFPL GTDSAGGDVL SRLIYGTRVS MVVGFIPMIF TLTIGITIGL VSGFAGGKLD SLLMRFTDIV FSLPDILFFI IVQTAFSQTA FGKTFNGLLL IFLSFSAVNW ASVARLVRGQ VLSLKEKEFV EAAEAIGVRR GSILFRHILP NTLAPIIVAG AFIVPSAIVT EATLSFLGIG IQPDTNPNNP FPTSWGQMIL EGKSAIDSQP WILIASAIAI ASITIAFVAL GDGLRDALDP RQ
|
| |