Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_4935 |
Symbol | |
ID | 5673274 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 5924538 |
End bp | 5926526 |
Gene Length | 1989 bp |
Protein Length | 662 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 641243789 |
Product | hypothetical protein |
Protein accession | YP_001509205 |
Protein GI | 158316697 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.00817564 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.300284 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGACGCCGT GGCGGGAGGG TCTGCGCGGC TGGCTGCTCG CCGGGACGGA GCAGGCCCGC CGCCATCCCG GTCCGCACGC GCAGCCCGAA CCGCACCATC CCCGCCACCA CTGGTGGCGG GTCATGTGCC TGACCGGCGT CGACTACTTC TCCACGCTGG GCTACCAGCC GGGCATCGCC ATCCTCGCGG CCGGCGCGGT GAGCCCGATC GCCACAGCGG TGCTCGTCGC CGTGACCCTG TTCGGGGCGC TGCCGGTCTA CCGGCGGGTC GCCGGCGAGA GCCCGCGTGG TGAGGGCTCG ATCGCGATCC TGGAGAAGCA GCTCTCCTGG TGGAAGGGCA AGCTGCTGGT CCTGGTGCTC CTCGGCTTCG CCTGCACCGA CTTCCTGATC ACGATCACCC TCTCGGCCGC GGACGCCACC GCGCACATCC TGGAGAACCC GTACGCGCCG GACCTGCTCG CCGGCCATGA GGTCGCCGTC ACGCTGGTGC TCCTGGCCCT GCTCGCGGCC GTGTTCCTGC GCGGGTTCAC CGAGGCGGTG GGGATCGCCG TCGTCCTGGT CGTGGTGTAC CTGGCGCTGA ACGTGGTCGT CATCGTGGTG TCGCTGTACA AGGTCGCCAC CCATCCCTCC CTGCTCGACG ACTGGCACGC GCTGCTGGTG GCCGAGCATC CCGACGCGTT CGCGCTGGTG GGCGTGGCCC TGATCGTCTT CCCGAAGCTC GCGCTGGGAC TCTCCGGCTT CGAGACCGGT GTAGCGGTGA TGCCGCTGAT CCGCGGCGGC CCCGATAGCC CCGACGACAC CGCCGACACC GCCGACACCG CCGACACCGC CGACACCGAC GACACCGACG CGAACCCCGC CGGCCGGATC CGCGGCGCGC GGCGGCTGCT CACCACGGCA GCGCTGATCA TGGCCGTGCT GCTGCTCACC AGCAGCCTCG CGACAACGGT CCTCATCCCC GAGCGGCTCG CGGAGCCCGG CGGCCCGGCG AACGGGCGGG CGCTGGCGTA CCTGGCGCAC CAGGAGCTGG GTAACGCCTT CGGCACCGCG TATGACGTCA GCACGATCGC GATCCTGTGG TTCGCCGGCG CCTCGGCGCT GGCCGGCCTG CTCAACCTCG TGCCCCGCTA CCTCCCCAGG TACGGGATGG CACCGCACTG GGCCCGGGCG GTGCGCCCGC TGGTCCTGGT CTTCACCGCC ATCGGCTTCG CGGTCACGAT CATCTTCCGG GCGAGCGTGG ACAAGCAGGG CGGCGCCTAC GCCACCGGGG TGCTCGTCCT GATCACCTCG GCGTCGGTCG CTGTCACGAT CTCCGCCGCG CGCCGCGGGC GGCGGCGGGC GGCCTTCGGG TTCGGCGCCA TCGCCGCCGT GTTCGTCTAC ACGACCGTCG CGAACGTCAT CGAGCGGCCG GACGGGCTCA TCATCGGTTC GATCTTCATC CTGTCAATCC TGGTGATCTC CTTCCTCTCC CGGGCCGTGC GGTCCTTCGA GTTGCGGGTC ACCGGCGTGC GCCTCGACGA GACGGCCACC CGGTGGGTCA CCGAGGCGGC CCGTTGCGGG GCGCTGCACC TGGTCGCCAA CGAGTTCGAC ACCGGCGACG CCGCCGAGTA CGCCGACAAG GCCAGCAAGG CACACGAGGT GCTGCACGTC CCGCGCGCGG CCCGCCTGCT GTTCCTCGAG GTGGTCGTCC CGGACTCGTC GGAGTTCGAG GGGCGGCTCG ACGTCTGCGG ACGGGAGCGG CACGGGTACC GGATCCTGCA GCTGACCAGC AACTCCGTCC CGAACGCGAT CGCGGCCCTG CTCCTGCACC TGCGCGACCT CACCGGGACC AGGCCGAACG TGTACTTCGA GTGGTCCGAG GGCAACCCGC TGCAGAATCT GGCGCGCTTC GTGTTCTTCG GAGTCGGCGA GGTCGCCTCG ACGACCCGGG AGATCCTCCG CGAGGCCGAG CCCGACCCAC AGCGCCGCCC GTTCGTCCAC GTGGCCTGA
|
Protein sequence | MTPWREGLRG WLLAGTEQAR RHPGPHAQPE PHHPRHHWWR VMCLTGVDYF STLGYQPGIA ILAAGAVSPI ATAVLVAVTL FGALPVYRRV AGESPRGEGS IAILEKQLSW WKGKLLVLVL LGFACTDFLI TITLSAADAT AHILENPYAP DLLAGHEVAV TLVLLALLAA VFLRGFTEAV GIAVVLVVVY LALNVVVIVV SLYKVATHPS LLDDWHALLV AEHPDAFALV GVALIVFPKL ALGLSGFETG VAVMPLIRGG PDSPDDTADT ADTADTADTD DTDANPAGRI RGARRLLTTA ALIMAVLLLT SSLATTVLIP ERLAEPGGPA NGRALAYLAH QELGNAFGTA YDVSTIAILW FAGASALAGL LNLVPRYLPR YGMAPHWARA VRPLVLVFTA IGFAVTIIFR ASVDKQGGAY ATGVLVLITS ASVAVTISAA RRGRRRAAFG FGAIAAVFVY TTVANVIERP DGLIIGSIFI LSILVISFLS RAVRSFELRV TGVRLDETAT RWVTEAARCG ALHLVANEFD TGDAAEYADK ASKAHEVLHV PRAARLLFLE VVVPDSSEFE GRLDVCGRER HGYRILQLTS NSVPNAIAAL LLHLRDLTGT RPNVYFEWSE GNPLQNLARF VFFGVGEVAS TTREILREAE PDPQRRPFVH VA
|
| |