Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_4246 |
Symbol | |
ID | 5736100 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 5417242 |
End bp | 5418489 |
Gene Length | 1248 bp |
Protein Length | 415 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 641281401 |
Product | major facilitator transporter |
Protein accession | YP_001547006 |
Protein GI | 159900759 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.000179551 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCTTCAA CATCGACTAC CCCGCGCCAA CCCTCGCCGT TGGTCGCTTT ACGCCACCGC GATTATCGGC TGCTCTGGAG CGGCCAACTG ATTTCAATTG CTGGCTCGCA GATGCATACC GTAGCGTTGC ACGTTCAAGT TTATCGTTTA GCTAGCGCGA TTCCTGGGGC TAATCCAGCG ATTTTTCTGG GCTTGATTGG CTTGTTTCAG TTTATTCCCT TGCTGTTGCT GGCGTTACGT GCAGGTCTAT TGGCCGATCG AGTTGATCGA CGACGTTTGA TGCTGGTAAC ACAAAGTATC TTGATGGGGT TATCGTTGGT ATTGGCGGTT TTATCGTGGT TTGGCTTAAT CAATCTGTGG TTGCTGTATG GAATTATGAT TATCTTTTTC AGCACCAAAA CCTTTGATTT ACCAGCTCGC CAAGCGTTAA TTCCGCGTTT AGTGCCGCGT GAAGTACTGC CAACAGCATT AAGTTTAAAT ATGATTGCTT GGCAAATTGG CAATATTGCT GGGCCGGCCT TGGGTGGTTG GTTTGTTAGC TATTCAATTG CCTTGGTCTA TTTGATCGAT GCGATCAGTT ATGGCGTGGT GGTATTGAAT TTGTGGCAAA TGCGCGGCAA TTATGCCCCA ACCGAGGTTA AACCAATAAT CAAAGGCTCC ATGTGGGAAG GTTTGCACTT TGTACGGCGC ACGCCGATTA TTTGGTCTAC CATGGTGCTC GATTTTATTG CCACCTTTTG TGGTGCTGCC ACGACCCTCT TGCCCTTATT CGCTGATAAA GTCTTAAAGG TTGATGAAAA AGCCCTCGGT TTGATGTATG CAGCACCAGC AATTGGCGCG TTAGTAGCCG CCCTCGCCAT GTCGTGGTTT GGCAATCCGC GCCGCCAGGG CATGGTTGTG GTGGTTTCGG TGGTGCTCTA TGGCTTGGCG ACCATGGTGT TTGGGCTAGC TCCAAGCTTA CCAATTGCCG TGCTGGGCTT GGCGGGCACA GGTGCAGCTG ATACGGTCAG TGCTGTCTTG CGCGGCACAA TTCGCCAATT AAACACCCCC GACGAGCTGC GTGGGAGAGC AACCTCGGCC AATATGCTGT TTTTTCAAGG CGGGCCATTG CTAGGCGAGG TTGAAGCTGG CTTCGCTGCA TCATTGGTTG GTGCGCCAAT CGCTATCGCT TTTGGTGGCG CGATTTGTGT CGCCGCCGCA ATCATCATTG CTGTGCGGAT ACCCAGTTTA CGCTTGTACG ATCGTTGA
|
Protein sequence | MASTSTTPRQ PSPLVALRHR DYRLLWSGQL ISIAGSQMHT VALHVQVYRL ASAIPGANPA IFLGLIGLFQ FIPLLLLALR AGLLADRVDR RRLMLVTQSI LMGLSLVLAV LSWFGLINLW LLYGIMIIFF STKTFDLPAR QALIPRLVPR EVLPTALSLN MIAWQIGNIA GPALGGWFVS YSIALVYLID AISYGVVVLN LWQMRGNYAP TEVKPIIKGS MWEGLHFVRR TPIIWSTMVL DFIATFCGAA TTLLPLFADK VLKVDEKALG LMYAAPAIGA LVAALAMSWF GNPRRQGMVV VVSVVLYGLA TMVFGLAPSL PIAVLGLAGT GAADTVSAVL RGTIRQLNTP DELRGRATSA NMLFFQGGPL LGEVEAGFAA SLVGAPIAIA FGGAICVAAA IIIAVRIPSL RLYDR
|
| |