Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_4617 |
Symbol | |
ID | 5672962 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 5502658 |
End bp | 5503959 |
Gene Length | 1302 bp |
Protein Length | 433 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 641243478 |
Product | extracellular solute-binding protein |
Protein accession | YP_001508894 |
Protein GI | 158316386 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.0699008 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 24 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTGAGTC GCGGCACTCC GAGAGGCCGG GGCGGCCGGT CCACGGCGGC CGCCGTGGGA GCGCTGGTAC TGGCCGCGGT GCTGGCCCTG GGCGCCTGCG GCGGTTCCGG CGACGGAGCT GGCCGCGGCC CGGTGCGGCT GACCTGGTAC GTCTACAACG AGTCGTCCGG TTCGTTCGCG AAGGCGGCCG CGGACTGCTC GGCCGCGTCG AACGGCCGGT ACACGATCGG CATCAACATG CTGCCGAACG ATTCGGACGG GCAGCGCCAG CAGCTGGTGC GGCGGCTCGC CGCCGAGGAC TCCTCGATGG ACATCCTCGC GCTGGACGTG ACCTGGACCG CCGAGTTCGC CGAAGCCGGC TGGATCGTGC CGTTCCCGGC GGCGGAGGCC AGGCGGCTCA CCGACGGGAT GCTGCCGGCC GCGGTGCGGA CCGGCACCTG GGAGAACCAG CTCCACGCCG TGCCGCTGAA CACCAACGTC CAGCTCCTGT GGTACCGCAA GGATCTGGTG CCGCGGCCGC CGCGGACCTG GGACGAGATG CTGGCCGACG CCCGGCGGCT GGCCGAGCAG GGCAGGCCGC ACTACGTCGA GGTACAGGGT GCCCAGTACG AGGGCTACAC CGTGCTGTTC AACTCGTTGG TGGCCTCGGC CGGCGGCCAG ATCCTCGACG AGGACGGCAC CCAGGTGGTG CTCGGCGCAC CCGCGCAGAA GGCCGTCGAG GCGATCCGGG CGCTGGCGCA CTCACCGGCC GCCGACCCGT CCTGGTCGAA CCAGCGGGAG GACGACAACA GGCTGGCGTT CGAGACAGGT TCGGCCGCCT TCCAGCTGAA CTACCCGTTC ATCTATCCGT CGGCCCGGCA GAACAACCCG CGGCTGGCCG AGCAGATCGG CTGGGCGCAG TGGCCGACGC TGGTGCCCGG TCAGCCGTCG CACAGCACGA TCGGCGGGTA CAACCTCGCG ATCGGCGCCT ACAGCCCGCA CCGGGCCGAG GCGGCCGCCG CGATCGAGTG CCTGACCGGC CGGGACAACC AGATCCGCGA CGCGATCGAC GGCGGGCTCC CGCCGACCAT CGAGGATCTC TACACCGACC AGAAGTTCAT CGCCGGCGGC TACCCGTTCG CGTCGGCCAT CTACACGGCG CTGCAGAATG CCAGCGTGCG GCCGCGGACG CCGGCCTACC AGAGCGTGTC GCTGCAGATC GCGCACACCC TCTCACCGCC GTCCTCGGCG AGTCTCGGCA GGCTCGGGCA GCTGCGCGGG GCGATCGCCG ACGCCATCGA GTCGAAGGGA CTGGTGCCGT GA
|
Protein sequence | MLSRGTPRGR GGRSTAAAVG ALVLAAVLAL GACGGSGDGA GRGPVRLTWY VYNESSGSFA KAAADCSAAS NGRYTIGINM LPNDSDGQRQ QLVRRLAAED SSMDILALDV TWTAEFAEAG WIVPFPAAEA RRLTDGMLPA AVRTGTWENQ LHAVPLNTNV QLLWYRKDLV PRPPRTWDEM LADARRLAEQ GRPHYVEVQG AQYEGYTVLF NSLVASAGGQ ILDEDGTQVV LGAPAQKAVE AIRALAHSPA ADPSWSNQRE DDNRLAFETG SAAFQLNYPF IYPSARQNNP RLAEQIGWAQ WPTLVPGQPS HSTIGGYNLA IGAYSPHRAE AAAAIECLTG RDNQIRDAID GGLPPTIEDL YTDQKFIAGG YPFASAIYTA LQNASVRPRT PAYQSVSLQI AHTLSPPSSA SLGRLGQLRG AIADAIESKG LVP
|
| |