Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_3447 |
Symbol | |
ID | 5671818 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 4078128 |
End bp | 4079762 |
Gene Length | 1635 bp |
Protein Length | 544 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 641242335 |
Product | hypothetical protein |
Protein accession | YP_001507755 |
Protein GI | 158315247 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.157505 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 24 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCTGGA TCCGTGACGG TAAGCCGCCG ACCAGCTGCG GCGACTGCCT GGCCTGGGGC GTCTTCGGGG GCCGGTACTG CCCGGCCTGC GCCAACTTCC GACAGCGGTT TCCCCGCGGC ACGTGCGCCG GCTGCGCCCG CCTTCTCCCT GTGAAACGGG GCTACTGCCG GCTGTGCTGG AAACAGGCCA CACTGGAAAC CGCCGGGCGT TGGGCACCCG CCATCCAGCC ATTCCTGGAA GCCGTCCGCC ACCACCAACT GTTTCTCGCC AGCCTCCACC ACAGCGTGCC AGTTCCGGCC GGCCGGCGCC TCGGCAAGAA CAGTCGAAAC GCGCCGGGCC TCACCTCACC CGCCGAAAAT CCCGGCCGCG CCGTGGCATG GGTGCAACTT CGGCTAATCG ACCTGCCGCG GGACTTCACC CGGTTCGACC GCCACAACGC CGACCTCACC AACCCACTGC TGGTCGACGC CCGCCGACGC GCCCGCGCCA TGGGCGAAGC CCGAGGATGG ACCCGCCGCG TCCACACCGA TGTCGACCGG GCGCTGGTCA TACTCCTGTC CGGGCTCGCT CCCGGCGAGA AAGTCCGCTA CTCCGACATG TTTCCCGCTC TCCAAGCACG TTGGATCAGC GTGGAGCGGA CCGTTCAGGC GTTGGACCAT CTCGGGCTGC TCGACGATGA CCGGCAGTCC ACCTTCGACG CCTACCTCGA GCACAAACTT GACGGCATCA CCCCGGGCAT CCGTCGCGAC GTCGAGGACT GGATCCGAAC CCTGTACGCC GGCGGCCCGC GAACCCGCGC GCACAGCAAA AACACCGCCT ACGGCTATCT CAACGAGATC AAACCCACCC TGCTGGACTG GTCGACCCGG TTCCATCACC TCCGCGAGAT CACCGGCGAC GACATCCAGA AGGTCATCAG CTCCGTCCAC GGCAACAAAC GCGACCACAC CATTGTCGTG CTGCGATCCC TGTTCGATCA CTGCAAGAAA ACCGGCACCA TCTTCCGTAA TCCCGTCGCG CGGCTACGCG CCGGCCGCAA ACACTACAAC CTCATCCTCC CGCTCCACCC CGAGCGGGTC AGCATGGTTC TTGATGCCGC GACCAGTCCC GCAGCCCGGC TCGTCGTCGT CCTCGCCGGA ATCCACGCCG CCCGCAACAA GACAACCCGC CACGTGCAGC TCGACGATGT CGACCTCGGC AACCGCCGCC TCGTCATCGC CGACATAAAC CGACCACTCG ACGACCTCAC CTACCATGCC GTCCTGGACT GGCTCGCCTA CCGCCGCGAC CGATGGCCCA ACACCGCCAA TCCCCATCTG ATAGTCAATG GACAGACCGC GCTGGGACAC GGTCCCGTCA GCGACAGCTG GCTATCCCTG ATCGTCCGAG GCCTGCCCGT CACCCTCGAA CAACTACGCG TCGACAGACA GCTCGACGAG GCCCTCACCC ACGGCCCCGA CCCCCTCCAC CTCGCCGCCG TCTTCGGCCT CGACCAGAAC ACCGCCATGC GCTACGCCAA CGCCGCCCGC CACCTCCTCG AGTCGCTCGC CGAGCGGCAC ACTCCCGACG GTTCAGCAGG AACCCAAGGG TCAACCACCG GTCCAAGCAC CGACCGACCC GCGAGTTCGC GCTGA
|
Protein sequence | MTWIRDGKPP TSCGDCLAWG VFGGRYCPAC ANFRQRFPRG TCAGCARLLP VKRGYCRLCW KQATLETAGR WAPAIQPFLE AVRHHQLFLA SLHHSVPVPA GRRLGKNSRN APGLTSPAEN PGRAVAWVQL RLIDLPRDFT RFDRHNADLT NPLLVDARRR ARAMGEARGW TRRVHTDVDR ALVILLSGLA PGEKVRYSDM FPALQARWIS VERTVQALDH LGLLDDDRQS TFDAYLEHKL DGITPGIRRD VEDWIRTLYA GGPRTRAHSK NTAYGYLNEI KPTLLDWSTR FHHLREITGD DIQKVISSVH GNKRDHTIVV LRSLFDHCKK TGTIFRNPVA RLRAGRKHYN LILPLHPERV SMVLDAATSP AARLVVVLAG IHAARNKTTR HVQLDDVDLG NRRLVIADIN RPLDDLTYHA VLDWLAYRRD RWPNTANPHL IVNGQTALGH GPVSDSWLSL IVRGLPVTLE QLRVDRQLDE ALTHGPDPLH LAAVFGLDQN TAMRYANAAR HLLESLAERH TPDGSAGTQG STTGPSTDRP ASSR
|
| |