Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_6446 |
Symbol | |
ID | 5674761 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 7839224 |
End bp | 7840411 |
Gene Length | 1188 bp |
Protein Length | 395 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 641245294 |
Product | hypothetical protein |
Protein accession | YP_001510689 |
Protein GI | 158318181 |
COG category | [R] General function prediction only |
COG ID | [COG2358] TRAP-type uncharacterized transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.0878976 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.0413594 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGCCCACGG AATCCAGGCT CGGCCGGTCG GGGCGCGGCC AGTCGGGCGC AACGCTGACG ACCCTGCAGC GGGTCACGGT CGTCGGCCTT GTGGCGCTCT CCGCCGCGGT GGCGATCGGC CTCGGACTGA CCGCCCGCGA CGACCCGCCG GGCACGGCCT GGCCGCCGGG GCGGCAGGCG ACGTCCTGCC ACGACGTGAA GATCTTCACC GGCCAGGTGG GATCGCCCTA CAACCGGTTC GCGCGGGTGC TGCGCACCCG GCTGCAGGCC GCGCCCGAGC ACTGGGACGT CGAGGTCGTC CCGACCGGCG GCTCCGCGGA GAACATCTAC CACCTGGAGG AACAGCAGTA CCGGACGTGC TCGCTGGCGC TGGCCCAGCT CGGCACGACC GTGGACGCCG GGTCGGCGGT CAACCAGTTC TCGCCCCAGC GGGGCGGCCA CCTCGTCGAG GGGCTGCGCA CGCTCGGGCC CGCCCACGAC GACCTCCTCC ACGTGATCGT CCGCGCGCCC GGGGGCAGCC CGCCCGGCAC CGGCGCGGAC GTGCGAACCT TCACCGACCT CTGCGACCGG ACCATCGCCG CCGGGCCGCA GAACTCGGGC ACTCGCCAGA TCGGTGATGT GCTCCTCCGC GTCGGCCTGC CGAGCACGTG CACCCCCCGA CTGGAGGACG CCTCCATCGA CGACGGGCTG CGGCTGCTGG TCGCCGGCGC CGTGGACGCG GTGTTCTGGG CCGGGGGCGC CGGCACCGAG CGCATCCGGA CCGAGCTCGC CAACGGCGCG AAACTACAGG TCCTCAACCT GGGGCAGTTC CGCGACGCGA TCACCGCGGA CTGGGAGAAG GTCTACCACC CGTCTGGGCG GTACTTCTCC GGAACGGTCT TCCCCCCGGG GCATCTCGGG CCACAGGACT ACCCGGGAAT GAGCGACGTG GACACCGTCT CGCTGCCCAA CGGCGTCCTG GCGCACGAAC AGGCCGATCC GGCGCTGGTC CGGCGGGCGA CGGCCGACCT CTTCGCCGAT CCGGCCGAAT ACGAGCGGGC GCTGTGGGGC GACAACCCCG CCGCCCGCCA CGTCCCGGAC GCGCTGACCG TCTACGAAAG CCCGCTGTTC TGCTACGTCC CGCTGCATCC GGCCGCCGCC GAGTACTACC AGCTCGAGTT TCGCCGCGGG CCCGACTGTG GCCGGTAG
|
Protein sequence | MPTESRLGRS GRGQSGATLT TLQRVTVVGL VALSAAVAIG LGLTARDDPP GTAWPPGRQA TSCHDVKIFT GQVGSPYNRF ARVLRTRLQA APEHWDVEVV PTGGSAENIY HLEEQQYRTC SLALAQLGTT VDAGSAVNQF SPQRGGHLVE GLRTLGPAHD DLLHVIVRAP GGSPPGTGAD VRTFTDLCDR TIAAGPQNSG TRQIGDVLLR VGLPSTCTPR LEDASIDDGL RLLVAGAVDA VFWAGGAGTE RIRTELANGA KLQVLNLGQF RDAITADWEK VYHPSGRYFS GTVFPPGHLG PQDYPGMSDV DTVSLPNGVL AHEQADPALV RRATADLFAD PAEYERALWG DNPAARHVPD ALTVYESPLF CYVPLHPAAA EYYQLEFRRG PDCGR
|
| |