Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_4453 |
Symbol | |
ID | 5672804 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 5319629 |
End bp | 5320699 |
Gene Length | 1071 bp |
Protein Length | 356 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 641243321 |
Product | extracellular solute-binding protein |
Protein accession | YP_001508737 |
Protein GI | 158316229 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG1840] ABC-type Fe3+ transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATCGACA AGGGGCACCA TTTACGTAAC ACCCGGCTCG CCGCGCTGAT ACTCGGTTCC CTCGTCTTCG CGGCGGGTTG TGGTGGCGGC GGGTCCGCCA GCTCGGAGAA CTCCGACAGG ACCGCGGGAA TGACCTGGGA CCAAATATTA TCCTGCGCCC GCGAAGAGGG CACGGTCACC TATTACTCCT CGAACATTCC CAATCTCAAC AAGGCGGTCG GCGAGGCGCT CAACGCCAAG TACGGGATCC GCGTCGAGTC CTACCGCGAC GTCGACGCGA CGATCCACCA GCGGGTCGAC GCCGAGATCA AGACCGGCAA GGTGGTCGCC GACGTCGTCC AGACGGCGTC CCCGCTCACG TTCCAGCGCA TGGCCGAGGC CGGAGAGCTG CGCGCGACCG CCGGCCCGGC GTTCGACAGC CCCGACTACC GGCCGGAGTA CCGGCACGCG GACAGCCAGT ACTTCCAGAC CAACAGCCCG TTCTTCGTGC CCGCCTGGAA CACCGATCTC CTGCCGGCCG GCCTCAAGGA CTACTCGGAT CTCCTCAAGC CCGAGCTGCG TGGACGCATC GCGATCCAGG ACCCGTCCGT GTCGAACGTG ATCCTCGATC ACTACCTCTG GATGGAGAGG ACCGAGGGCC CGGACTTCCT GCCCGCTCTC GCCGCGCTCG CCCCCCGGGT GTACCCCAGC CAGGTGCAGT CCATCGAAGC GCTCGCCGCC GGCGAGGTGG CAGCCACGAC GATCATGTCG CCGTCACTCA TCGCCCCCCG GAAGGCAGCT GGTGCCCCGG TGGACTACGT GTTGCCCTCA GGAGGCTCGT ACCCGGGGTC GCACTTTTAC GCCAGCGTTC TGGCGGGGGC GCCACACCCG TGCGCCGCAC AGGTGCTTGC GAACTTCCTG GTGACATCCG AGGGGCAGGA CGTCGTGGCG GAGAACTCCG CATCCGCGCT GCCGAACGTT CCGGGCTCGC CGATCACCAT CGACAAGGTG GAGGCCACCG ACCCGAACAA GTTCACTGAC GAGTTCGTGG ACGACCACCT GGCGACCTGG ACGAAGCTGT TCCGACGATG A
|
Protein sequence | MIDKGHHLRN TRLAALILGS LVFAAGCGGG GSASSENSDR TAGMTWDQIL SCAREEGTVT YYSSNIPNLN KAVGEALNAK YGIRVESYRD VDATIHQRVD AEIKTGKVVA DVVQTASPLT FQRMAEAGEL RATAGPAFDS PDYRPEYRHA DSQYFQTNSP FFVPAWNTDL LPAGLKDYSD LLKPELRGRI AIQDPSVSNV ILDHYLWMER TEGPDFLPAL AALAPRVYPS QVQSIEALAA GEVAATTIMS PSLIAPRKAA GAPVDYVLPS GGSYPGSHFY ASVLAGAPHP CAAQVLANFL VTSEGQDVVA ENSASALPNV PGSPITIDKV EATDPNKFTD EFVDDHLATW TKLFRR
|
| |