Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_5071 |
Symbol | |
ID | 5675750 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 6070967 |
End bp | 6072178 |
Gene Length | 1212 bp |
Protein Length | 403 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 641243922 |
Product | hypothetical protein |
Protein accession | YP_001509336 |
Protein GI | 158316828 |
COG category | |
COG ID | |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.105918 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.0802572 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCTCCGG GCTCGCAGGA TTCGGGACAT GCGCCAAGCA CGGGGGCCTT GGACCATCAC ACCTCGGACA GCACCACCTC GGACAGCACC ACCTCGGATG ACGCCGTCCC GGGTCACGCC ACCTCGGACA CATCAGACGC CCGCCACCGT GCCACTCGGT GGATCCAGCG GAGCCGGTGG AGGAAACAGC GGAGGACGCA ATCGATGGAC GCCACCTCAC GGCGTCGACT CATCGCGGTC GGCGTCCTGT TCCCGCTGGA CCTCCTCGTC CCGCGAGGCG GCGGCATGAG CCCGTCCAGC GATGTGCTCG ACCACCTCGA CGACAGGGTC GCCCAGCTCA CCGCGGGTTA CGAGCGGGCC CCGATCGGGG CGACCGCCGT CCACCTGCGC CGTCACCTCC CCGTCGTCCG CTCGCTGCTG CAGCGTGACG TCGTTCCGCA GGCGACGCGG CTGCGGCTGC ACCGGGCCGC CGGCCGGCTC GTCGCCCTCT GGGCAGCCAC CCGGCACGAC CTCGGCGACC TGCCCGGCGC CGTCCTCGCC TTCGAGGAGG CATTCGTGCA CGCGGCCGAG GCCCGCGACG AGGAGCTGAT GTGCTGGGTG CGGCTGTGGC AGTCGTCGCT GACGCGCAAG GCCGGCCGGC CGGCGGACGC GGTCGCGCTC GCCGCGTCGG GAGCTGCCCT GGTCGGCCGC GGGAGCCCGG CCGCGGCCCG CGCCGCCGCC ATCGAGGCTC GAGCGCACGG CACTCTCGGT GACCGGTGGG CGGTGCACGA GGCGGTCGGC CGGGCCTGGA GCATCGCGGG AACACTGAGC GCCGAACAGC TCGGCCGCCC CGGGTTCTCG ATCGACACGC TGCACGTGCT GACGCTGTCA GAGCTGTCGG CGGCCGCGTA CGTCGAGCTC GGTATGCCGG GCGCGGCGAG CGTCTACACC GACGCGGCGA TCCACCACCT GGACGCCGTA GGCGCCACCG GGCTCCGCTC GATGACCCGG ATCGCCGCGG CCACCGCAGC GGCCAAGCGG CGGGACCTCG ACCACGCCGT CGAGCTGGTC GACGAGGCGC TGAACATCTC CCGCCACCGC CCGAGCATCG TCATCGGCGG GCGCGCCGAG CGGTTCGTCG GCGAGGCCCG CGGCTACCTC GGATGGCACT CCCTCCTCGA CGACCTCGAC GAACGTCTCC GGACCTGGCG GTCGCCCCGT CTTACTACCT GA
|
Protein sequence | MPPGSQDSGH APSTGALDHH TSDSTTSDST TSDDAVPGHA TSDTSDARHR ATRWIQRSRW RKQRRTQSMD ATSRRRLIAV GVLFPLDLLV PRGGGMSPSS DVLDHLDDRV AQLTAGYERA PIGATAVHLR RHLPVVRSLL QRDVVPQATR LRLHRAAGRL VALWAATRHD LGDLPGAVLA FEEAFVHAAE ARDEELMCWV RLWQSSLTRK AGRPADAVAL AASGAALVGR GSPAAARAAA IEARAHGTLG DRWAVHEAVG RAWSIAGTLS AEQLGRPGFS IDTLHVLTLS ELSAAAYVEL GMPGAASVYT DAAIHHLDAV GATGLRSMTR IAAATAAAKR RDLDHAVELV DEALNISRHR PSIVIGGRAE RFVGEARGYL GWHSLLDDLD ERLRTWRSPR LTT
|
| |