Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_0852 |
Symbol | |
ID | 5669268 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 998673 |
End bp | 1000499 |
Gene Length | 1827 bp |
Protein Length | 608 aa |
Translation table | 11 |
GC content | 79% |
IMG OID | 641239781 |
Product | hypothetical protein |
Protein accession | YP_001505216 |
Protein GI | 158312708 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.00483421 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.421013 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGACCACGG CCTCGTCGGC GGGTGCGCGC GGCGGCGCGG GACGGCGGAG CGGGCGCGAC CGGCCCGCGA GCGCCGGCGC GGCCACCACC GGCACGACCA CCGCCGCAGC CGCGGCCGTC ACCACGGTGA CGGCCGCGCT GCTCGCGGCC GCCTTCTGGG CCTGGGCGGC ACACCGGGTC CCGGCGGTAG ACGTCGCCGC GGACGCCACC GTCGTGCTGG GTCTGCGGGT CATCGTCTCG CTCGCGCGCG CCGCCGCGCC CGGACCCGGC GCGCGGCACC GGCTGCGCGC CGGAGCCCTG GCCGCGCTCG GAGCGGTCAC CCTGGCCTGG GCCGGTGGAT CGCTGATCCC AAGCCTGTCC TTCCTCGCCG CCACCCCCGG TTTCCTCATC CTCCTGCCGC TGGCGGCCGC CAGCGTCGCC CTCTGGCCGG CGCGCCCCGA CGTGTGGCGG TTCGACGCGT CCCAGGACTC ACCGAACCGC CGCACATCGG CGGCCGGGAT GTTGGTCGCC GCGGGGGCGT GGGTGGCGCT GGGCGCCGTT CGGGTGGGCG CGCTGGTGTT CGCCGCGGAC GTCGCGGCGC GGGGCGGGGC GGGGCATGCG CACGTCGCCC TGCTCGCATG GCTGGCGCCG GTCGCCCTGG CGGCCGGTCC ACGCTGCGGA TGGCTGCTCT GGCGGACGAA CGCGGACCCC GGCCTCGCCT CATCCCGTGT GGCCACCCCG GCGGGGCGGC AGCGCCCCGT CCGGCGGCTC GGCGCCCCCC GGCGGACGAC GACGACGGCG CGAGCCGTCC CCGCGCCGGA GTGGGAGCCG CCGGCTCCCG GGTCACGCCC GGGCCCCGGA TCGCTCCCAG CCCGCGGATC GCAACCCGCT CCGGGATCGC TCCCGACCCG CGGCTCGCCA CCAGCTCCCG GCTCGCTCCC CGCTGCCGGG TCACCGTCGG AGCGAGGATC AGTGGTCCGG CCCAGCGCGC AGCGGTGGGC GCAGGGCCCC CAGCGACCGC CCAGGTCACG GCGCCCCTCG CTACCACCGA CGAGGTCGCC GCAGGCGCCG ACCCCGCGGT CACCGTTCTC GGCACTGCCC ACAGCTCCGC CCAGCAGGAC GCCGTTCGGC AGGACGCAGG TGGATGAGGC GGCGCGGCGA TCGCTCACGT CACTCGCGGC CGAGCCGGCC GCGCCGGAGC CTGCGACGCG GGTGCTCTGG TCGGCGGCGG CGGCCGGACT GGCCCTGGTC GTCCTCGATC ACCAGGTCGG CCCGGCGAGC GCCGCCTTCG CCCTGGCCGT GCTCGCCATC CTGGCCGTCG GCGGGTTCGT CGGCGGCCGT GGGGCCGCGA CGCTGCCGTG CGTGGCCGTC GCGCTGGTCG CCGCGCCGCT GCCGCTGGCC GTGCTCGGCG ACGAGTACGC CGATCAGGGA ACAGCCGGGC TCCGCCTGCT CCTGGCCGCC CTCGCGATCG ACGCCCTGCC AACCGGCGCC CAGCCGTCCA GCGGGGTGCG GGCGGCCGCG CGGCCGGTGA TCCGGCGCGG GGCCACGGTG GCTGGGCTGA CCGTTCTGGT GGCCATACTC CCGATCATGC TGGCGGCCTG GGGTGCCGAG GGGGCCGCGC TCGCGCTGCT GCTCGGCCGG GTGGTCGCCG TGGCGCCTGT GGTCCGCCTG CCGGCGCGCC GCGAGCGCGC CGCCGAGCCG GCCGCCGAGG CCCCCGGGCA ATCCCGGCCC GCCGGAGCCG CGCCGGCCGC CGTCGACCGC CGCCCAGAGG CGGCCGGCCG CGGCGGGAGA CGCGCCCGGA TGGCCAGGTT CGCGCCGGGA GTCAGTCACC CATCTCGGTC ACTGTAG
|
Protein sequence | MTTASSAGAR GGAGRRSGRD RPASAGAATT GTTTAAAAAV TTVTAALLAA AFWAWAAHRV PAVDVAADAT VVLGLRVIVS LARAAAPGPG ARHRLRAGAL AALGAVTLAW AGGSLIPSLS FLAATPGFLI LLPLAAASVA LWPARPDVWR FDASQDSPNR RTSAAGMLVA AGAWVALGAV RVGALVFAAD VAARGGAGHA HVALLAWLAP VALAAGPRCG WLLWRTNADP GLASSRVATP AGRQRPVRRL GAPRRTTTTA RAVPAPEWEP PAPGSRPGPG SLPARGSQPA PGSLPTRGSP PAPGSLPAAG SPSERGSVVR PSAQRWAQGP QRPPRSRRPS LPPTRSPQAP TPRSPFSALP TAPPSRTPFG RTQVDEAARR SLTSLAAEPA APEPATRVLW SAAAAGLALV VLDHQVGPAS AAFALAVLAI LAVGGFVGGR GAATLPCVAV ALVAAPLPLA VLGDEYADQG TAGLRLLLAA LAIDALPTGA QPSSGVRAAA RPVIRRGATV AGLTVLVAIL PIMLAAWGAE GAALALLLGR VVAVAPVVRL PARRERAAEP AAEAPGQSRP AGAAPAAVDR RPEAAGRGGR RARMARFAPG VSHPSRSL
|
| |