Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_5845 |
Symbol | |
ID | 5674168 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 7090642 |
End bp | 7092066 |
Gene Length | 1425 bp |
Protein Length | 474 aa |
Translation table | 11 |
GC content | 76% |
IMG OID | 641244695 |
Product | hypothetical protein |
Protein accession | YP_001510097 |
Protein GI | 158317589 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.142107 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.411769 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGGCCGAG GCGGGGGTGG CTCGACCCAC GTCCGCAGCG GTCCCCGGCC GCCGGCCATC CGGCGCGCGG TGCGGCTCGG CGCCGTCCTC GGCGCACTGG TGATCGCGTA CGCGGTGATC ACCGCCGTGG TCAGCCGCCC GGCCGGCCAC GGCTACCTGG ACATCACCTC CGCGGACGGA TACGGCACCC GGGCGGTCGC GGAGATCCTG CGCGCCCGCG GGGTGACGGT GACGGCCGGC GAGCAGGTGC CGATGGCAGC CCCGGCGCCC GGCCCCGCGG GGACGTCCAC AGGGCGGACG GTGGTCGTCA CCAACCCCGA CCTGCTGTCC CAGGCCGAGC TCGACGCGCT GGTGACCAGC ACCCGCGCCG GCTCGGACGT CGTTCTCGTC GCCCCGGATC CCAGCGTGGT GGTCGCGCTG GGCCTGCCGG TCACGACCGG GCAGCCGGGC GAAGGTGACT ACGGTGACGA ATCCCCCCCG GGGTGCACGC TCCCCGAGGC AGGCGCGGCC GGGTCGACCA CGGTCGGCGA CTCGGTGACG TTCGCCCGCA CCAATGGCCA GGACAGCCAG AACGGCCAGG GCACCCCGGA CGGCCGGAGC GGCGCCACCC GCGTCGAGCT CTGCTACGGC GACCCCGGCG CGGCCCGGTT GGCGGTGCTC ACCCCCACCC GGTCGAACGT CGGCAGCGGG CCGACCGGGC GCCTGGTGCT CCTGGGCGGC GGCGGCTTCC TCACCAACGA ACGCCTGGAC GAGACCGGGA ACGCGGCGCT GGCCCTCGGG CTGCTCGCCC GCGCCCCCCG GCTGGAGTGG GTGACCCCGG TCGTCGCGGC CGAGGACGCC GTGGGCACGA AAGGCCTGTC GGAGCTGCTC CCCGACGGTT TCTGGTTCGG CGCCCTGCAA CTCCTGCTCG TCCTGGTGTT CCTCGCTTTG TGGCAGGGAC GGCGGCTCGG GCCGCCGATC GAGGAGCCGC TGCCCGTCGT GGTGCGCTCC ACCGAGACCG TCGAGGGCCG GGGCCGGCTC TACGCCGCGG CGCACGCCCG CGAACGGGCC GCCGCGGCGC TGCGGGCGGG GCTGCGGGCC CGGCTCGCCG ACCGGCTCGG CATCGATACC GCCGCGGCCG GCTCGCCGTG GCACACCCGT GGTCCGGATC CGACCGTCCT CGTGGCCTCC GTCGCCGAAC AGACCGGACG GTCACCCATG GAGATCGGGC CACTCCTGTA CGGTTCGGAC ATGGCGCCCA CGGGCTATTC CCAACCACCC TGGCCGGCCC ACCAGGGCCC GCCCGCCTCA TGGGCATCAC CGATCCCGCC GAGCCCGGTC GACGTCCAGG TTGCCAGGAC GAGGCGGGAT GCCGAGGGTG ACGTCGCGCT CGTCCGGCTG GCCCGCGCAC TGCACGAACT CGATCGACAG GTGGGCAGAA GGTGA
|
Protein sequence | MGRGGGGSTH VRSGPRPPAI RRAVRLGAVL GALVIAYAVI TAVVSRPAGH GYLDITSADG YGTRAVAEIL RARGVTVTAG EQVPMAAPAP GPAGTSTGRT VVVTNPDLLS QAELDALVTS TRAGSDVVLV APDPSVVVAL GLPVTTGQPG EGDYGDESPP GCTLPEAGAA GSTTVGDSVT FARTNGQDSQ NGQGTPDGRS GATRVELCYG DPGAARLAVL TPTRSNVGSG PTGRLVLLGG GGFLTNERLD ETGNAALALG LLARAPRLEW VTPVVAAEDA VGTKGLSELL PDGFWFGALQ LLLVLVFLAL WQGRRLGPPI EEPLPVVVRS TETVEGRGRL YAAAHARERA AAALRAGLRA RLADRLGIDT AAAGSPWHTR GPDPTVLVAS VAEQTGRSPM EIGPLLYGSD MAPTGYSQPP WPAHQGPPAS WASPIPPSPV DVQVARTRRD AEGDVALVRL ARALHELDRQ VGRR
|
| |