Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_2052 |
Symbol | |
ID | 5670453 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 2471848 |
End bp | 2473143 |
Gene Length | 1296 bp |
Protein Length | 431 aa |
Translation table | 11 |
GC content | 77% |
IMG OID | 641240974 |
Product | hypothetical protein |
Protein accession | YP_001506395 |
Protein GI | 158313887 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.618886 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.624101 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCCACGA AGGACGGCTC CCGCCCCACG GCGCTGCCGC CGATCGCAGT GCGCCCAGGC CTGTTCGTGT CCGTGGCGGT CGTGACAGTG CTGCTCGGCG CGCTCACCCT CCCCGCCACC GTGCCGGGCC GTCCGGGCTT CGCCTACTTC AGCGGGGGCG TGCTCGGCGC GGGTCTGCTC GTCGCGATCC TGCTCGGGGC CGACCTGGCC CGGGCCGCCG CCGCGCGCAG GGCCGGGATC ACGGTCACCG GGATCACCCT CGGCGCCTTC GGGAGCCGGC TGGGCCTCGC ACCCGCACGC GACCGCTCGA CGGACCGGGG TGACGGCAGC CCGCTGAGCG GTACCGGCCC GTCGGGCGGT AGCGCCCCGT TGGGCGGTAC CGGCCCGGCC GGTTCCGATG ACCCGCTCGC CCCGGCCACC GGTGACGCAC TCGCCGACGC CGCCGTCGCC CGTGCGGGCC TGATGGTGAC GGCCCTCGCC GGGATGGTGC TGGTCGCCGC CGGCGCGTTC GCCCCCGGCG GAACGCTCGC CCTGGTCGGC GAGCTGGCGC TCTGGGTCGG CACGTTCGCC CTGCTCATCA CCGTCGTCGA CCTGCTGCCC GCACCGCGCA GCGCCGGCGG GCGGATCCTC GCCGCCCGGG TCCTGCGCCG CACCGGTGAC GAGGCGGCCG CGGGCGCGGC CGTGGCCCGC GCGGGTGTCA TCACCGGGTG GACCCTGATC GTCTTCGGTG CGGCGGCCAC CTTCCTGGTC GGGCTGGTCG GCCTGTGGGC GATTCTCCTC GGCTGGCTCG CGCTCGGGAC GTCCCGGCTC GCGCAGACGC AGGAGCGCAC CTCCGCCGCG CTGCGCGGGG TCTTCGTCCG TGACGTGATG ATCCCCGCCC CGGAGGCGCT GCCGTCCTGG AAGACAGTCG CCGCCGCGCT GGACGAGACC GTGCTCCCGT CCCGCGCCTC GGTGTTCGGG GTCCGGGACT TCGCCGGGCC GCTGATCGGC GTCACCCTGC TGCGTGATCT GGCCGCGGTG CCCGCGGACG ACCGCGACCT GGCCCGGGTG GCCCGGGTGA CCATCCCGCT GGACCGGGTC GCCACCGCCC GTCCGGAGGA GCCGCTCGCC GCGGTCGCGT CGCGGCTGGC GCACCGGCCC GCCGCGGGCG TGATCGTCGT CGTCGCCGAC GGCCCGGATG GCCTGCCCGG GATGGTGGGC ACCGTGGGCC CGGGCGAACT GGCCCGGGCG CTGGAGACCA CGCCGCTGCA CGGCCGGGTG GTCATCCCGA CCGGCTTCGG CCGCCGCCGC CGGTGA
|
Protein sequence | MSTKDGSRPT ALPPIAVRPG LFVSVAVVTV LLGALTLPAT VPGRPGFAYF SGGVLGAGLL VAILLGADLA RAAAARRAGI TVTGITLGAF GSRLGLAPAR DRSTDRGDGS PLSGTGPSGG SAPLGGTGPA GSDDPLAPAT GDALADAAVA RAGLMVTALA GMVLVAAGAF APGGTLALVG ELALWVGTFA LLITVVDLLP APRSAGGRIL AARVLRRTGD EAAAGAAVAR AGVITGWTLI VFGAAATFLV GLVGLWAILL GWLALGTSRL AQTQERTSAA LRGVFVRDVM IPAPEALPSW KTVAAALDET VLPSRASVFG VRDFAGPLIG VTLLRDLAAV PADDRDLARV ARVTIPLDRV ATARPEEPLA AVASRLAHRP AAGVIVVVAD GPDGLPGMVG TVGPGELARA LETTPLHGRV VIPTGFGRRR R
|
| |