Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_5425 |
Symbol | |
ID | 5673756 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 6563933 |
End bp | 6564970 |
Gene Length | 1038 bp |
Protein Length | 345 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 641244280 |
Product | bile acid:sodium symporter |
Protein accession | YP_001509686 |
Protein GI | 158317178 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG0798] Arsenite efflux pump ACR3 and related permeases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.18483 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGACCTGCG ACGATGACAG GGACACACCC GGCCAGCGAT CAAGGAGTCT GGCACTGACG CAGAAACCGG TCGGCCTGGT CGCCCGGATG GAACACCATC AGGTCGCGGT CTATCTCACT GCCATGACCG CCGGTGCCGT GCTCGGCCTT ACGGCACCCG GCCTCGGACC GGGTCTGGAA CACGCGATCA ACCCGCTGCT CGGGGCGCTT CTCTACGTGA CGTTCCTCCA GGTCCCCGCC GCCGAACTGG CCCGCTCCCT GCGCGACGGG CGGTTCCTCG CCGCCGCGCT GGTCGTGAAC TTCGCAGTCG TGCCGCTTGT GGTCGCCGCG ATGTTTCCGT TCCTGCCGGA CGATCAGGCC GTCCGCCTCG GGGTGCTGCT GGTTCTGCTC TGTCCCTGCA TCGACTGGGT CATCGTGTTC ACCGGCCTTG CCGGAGGCAG CGGCCGGCGT CTGCTCGCCG CGACCCCGCT GCTCCTGCTC GCGCAGATGC TGGTCCTTCC CGTCCTGCTG TTCGCCTTCC TCGGCTCCGG CCTGGCCGAC ATCGTCGACG CGGGACCCTT CCTCCGGGCG TTCCTCACCC TGATCGTCGT TCCGCTGGCC CTGGCCTGGG TCACCCAGGG CTGGGCAGCC CGCAGGCCAG CCGGGCAGAC CGTGTCGGAG ACGGTCAGCA CGGCGATCGT CCTGCTGATG GCCGCCGTGC TCATCGCCGT GGTCGCCTCC CAGCTCCCGA AACTCGACGG CAGGCTCGGT GACGTCGCCG CCGTCGTCCC GTTCTACCTG GCGTTCCTCG CCGTCATGGC GTTCACCGGG AAGGCACTGG CCCGTCTGTT CCGCCTCGAC GTCCCCTCGG GACGGGCGGT GCTGTTCACC GGTGCCACCC GTAACTCCCT CGTCGTCCTC CCGCTCGCCC TCGCTCTGCC CGACGAGTTC GCCGTCGTAC CGGTCGTCGT GGTCACCCAG ACCCTCGTCG AAGTCCTCGG CATGGTCGCC TACGTCCGGC TCGTGCCACG ACTACTCCCC ACACCCCACC CCACCTGA
|
Protein sequence | MTCDDDRDTP GQRSRSLALT QKPVGLVARM EHHQVAVYLT AMTAGAVLGL TAPGLGPGLE HAINPLLGAL LYVTFLQVPA AELARSLRDG RFLAAALVVN FAVVPLVVAA MFPFLPDDQA VRLGVLLVLL CPCIDWVIVF TGLAGGSGRR LLAATPLLLL AQMLVLPVLL FAFLGSGLAD IVDAGPFLRA FLTLIVVPLA LAWVTQGWAA RRPAGQTVSE TVSTAIVLLM AAVLIAVVAS QLPKLDGRLG DVAAVVPFYL AFLAVMAFTG KALARLFRLD VPSGRAVLFT GATRNSLVVL PLALALPDEF AVVPVVVVTQ TLVEVLGMVA YVRLVPRLLP TPHPT
|
| |