Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_2653 |
Symbol | |
ID | 5671046 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 3138172 |
End bp | 3141513 |
Gene Length | 3342 bp |
Protein Length | 1113 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 641241568 |
Product | NERD domain-containing protein |
Protein accession | YP_001506988 |
Protein GI | 158314480 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCTCAG TGATCGATAT CCTTTGGGGC TCGGAGCCGG TAGAAGCCTC CGAGCAGCAC TTCCTCGGTC GGCTCCAGGC TGATCTGCAG GCGCATGGCG TCACGGCTAC CGTCTTTGCC AACTTCCACA CCGCCGGCTC GCGGCAGGTC GACTTCCTCG TGATCACTCC CGGCCATGCC TGCCACGTGG AGCTGAAGGC ATACCGCGGC CCCATCCACG GTGGCCGGAA TGGCCAGTGG TCGAGCACCC GGCCGGATGG AACCACCCAG GTCATCGAGC GCTCACCGAA CCCGTACGAG CAAACCGTCA GAGCCCAGCA GTCCATCAGT GACGACATGC GGGCCTTTGC TTCGGGACGG GCTGGCACGC CGCGACCATC GGATGGCCGT CAGTTCTACA CGTGGTTCGA CAGCGTCCTG TGCGTCTATC CCAGACTTGC CGTCGGGTCG GAGGTCCCGA GCGACTTCAA GGTTAAGACG TTCGGCTACC CCGAGCTCCT CGACTTCTTG CTGAAGCCCG GCAAAAATCC CCCGTGGGAT GCCGGGATCT GGCGCGAGTT CGGCATGCAT CTGAACCTTG TTCGCCCTGA GCGGCCGGAT GGTCCGTCCT TGGCTGCCAC GCGGGCACAG CAGGCGCTGG ACGACTACAC GCGACGCTTC AGCCAGTTCC ATCGCCAGGG CCTGCATGAG CTGGTTCCCA GCACCCTTGA AGCTGGTGAG ACTGCGCATC CCTCTTCCGA GCTACCGACG CTACTGGCTG GCACACCGTA TGCCCAGATA GTGGCGCCGT CCGGCTACGG CAAGAGTCAT CTCGCCCATC ACACGGCGCT GACTCTGGCG TCTGAGGGCA CCGTGCCCAT CTTCCTCAGC GCGGCCCGCT ACGAGGGCCG GCTTTCCGTG CTCATGGATC GTGCGGTTGC CCCGTATTCG CCGCAGCCGG CGCTGAATCT CTTGCGGGCG GCGACCTCGG TCGGGCGTGA CGTACTGCTT GTAGTGGACG GCTTCAACGA GTGCCCGGAG CGGGAGCGGG ATGCCTTAGT CCAGGACATC AGTGCGCTTT GCCTGCGGTC TCCGTGCCGT GTATTGGTGA CGGCCCAGCA CTCGGTGTCG TTTGCAACGA TTACCGGCTG GCAGGAGTTT CGGCTCCGGA AACTGGACGA GGACGAGCGC CGGGCGGTGC TCGCATCGTA TGACGCTTCC GTACCACTAG ACCTCTGTGG GCCCTTCGAG ACGGCGTACG AACTCTCGAT CGCCGCGGAG TGTTCCAGCG AATTGAGGGA AGGGGCTACA CGCGCTACCC TCCTCGACGC CTTTGTCCGT CACCGGTTGC AGTCGGGGCG TTCCCCGGCC CTAACCCGAA GCTTGCTTCG TCGGCTGGCC GTGGTTATGG ACGAGCGATT GGTAACCGCC CTGCCGATCG GTGACGTCTG GCGCATCGGT GAGGCGGCGC TGCGGGAGCA GCAGACACCG TCGGACATCC TGGACGAGGT CTTCCGCGCC TCCGTAGTCA GCGTGGAACA GGGCGTGCTT ACTTTCTCCC ATGAACTGAT CGGGCGGTAC TTGGCCGCCG AGGAGCTGCT GCTCGCGGCC GGCACCAACA TGGACGACCT GACATCTGAG CTGCAACGGC CGCGGCACGA GGATCTACCA GCGTTGGTGA TTCCACTTGA GACCTCAGAG GACTGTCTGC GCCAACTGTT CAACTGCCTG GCCAGCAAGC ACCTACTGGT TGAGGCTCTG GGGAACAAAC TGGGGCCGCG AGCCCGAGCA GTAGCTGTCA TGGAGGCCGA GCGGGTCTTG GCGGAACTCT GTGAAATGAC AGCCGGGCTC AAATTAGTGT TCGGAAGCAC CTTTGAGACC ACCGTCACCG GTGGTCGAGA CGTGACCGGC TATGAAGCTG CGGTTCTCGC CGCTGTCGGA GATGGTCTAG CTGACGGAGA TTTCCTGGGG CCCGCAATGC GGTTGCTGGA TGCCACCGAC GAGGCATGCC GGACATCGAC GGCTGCGCAG GCGGCTTCCG GCCATCGGCC CACTCCTTCC GACATCGCCG CAGCGGTAGT GTTGAATTTT GCGGAACCAG GCTCGCGGAG AAAGATCGCC TCCCATATTC TGTTAGATTC GGCGCGCCTC GGCTGGCCCT GGCGTTACAG GCATCGGAAG GGTGTCATCT CCTCCGGCGG CCTTGAAGAG GTATCTATGG GCGGTGGCGA GGACAACTAC TGTCGGCTAT TCTTGATGGG CCTGCTACTG GACAGAGTGC CGCTGGATGT CGGCCAAGAG ATCGTGATGT CCTGGCTTCG GATGTGCTGG CAGTCGGGTG CTTACCACGT CCAACTTCAA GCCCTTGAGT CTATTCGTGC GTATTGCCAG CTTGAAGCAG GGCCTTTGCG TTCCTCGCTC GTTGACTATC TTTCTGACCT GCAAACCCAG AACTTGGGCT TGTCGACGAC GATCGTTGAG GCGCTGTACT CGTTCGGAGA AATTGAGCCG CAGATTAGTA GAAAGGTGGC CGACGATGAG ATTCGAGAAG TTTTGGAGAA GCCCGCCGAC GATAGTGAAG CCTGTGGGCG TGCTTATGGC ATAGTTTCAA ACAGTTTCGA AGACGTTCTG GGGCCGCACT ACTTCGAAGC TATCGAAGCG CTCGGGAAAA ATGATCGAGT TCGCTTGCTC ACCAAGGCGG CTCTCGGAGT CAATCATGGA TTCTGGTTGG ATTGGGTTCT CGGAGAGCTA TTAAAACTCC GTGATCCTCA GGCGATCCCC GCCTTCGAGA GATATGCGAC GGCCTTTGAT GTCAGGTCGC CCTCTCCGCA GGAAGCGGTT AGTTGCTATA TTCTTGCCAT GCAGGGTTGC GGAAATTTCC TCGACACTCC GCCAAGGTTT CATCAGTCAA TGACCGTCGA CCTTGAAGCA TGGCAATGCT ACGGAGCAAT CACCTTCTGG CTGGCGCATC CGATGCCAGC GATCGAGCGC GCAGACCGCT GCGCACCGAT CTGGGCGCGT TTGGCAACAG AACTGGTCGA AGCTGCGGCG GATCCGCTAT ACCAGTTGGC ACAAACCTAT GTTAAGGACC AAGATGCAGG CAAGCACCTC GGCCATAATC TTGTGCTAGA CACATTTCCG GACGAGGTTC GCGTAATTCT TGAGGCCGCT GCCTCAAATT TTGACCGGCA AACAGCGATA TTTCGAGCAT TCGATCCCAT CGAGCGATCA CAATATGTTC TGCGAACCCT CGGGTTGGTC GGCAATGAGG GTTCGCTGCG CCTGATCGAG CCATACATCG AAGACCCGTC GCTCGGATCG ACGGCGATAG AGACGCTCAA AGCACTTCGG GCGCGCTTTT GA
|
Protein sequence | MTSVIDILWG SEPVEASEQH FLGRLQADLQ AHGVTATVFA NFHTAGSRQV DFLVITPGHA CHVELKAYRG PIHGGRNGQW SSTRPDGTTQ VIERSPNPYE QTVRAQQSIS DDMRAFASGR AGTPRPSDGR QFYTWFDSVL CVYPRLAVGS EVPSDFKVKT FGYPELLDFL LKPGKNPPWD AGIWREFGMH LNLVRPERPD GPSLAATRAQ QALDDYTRRF SQFHRQGLHE LVPSTLEAGE TAHPSSELPT LLAGTPYAQI VAPSGYGKSH LAHHTALTLA SEGTVPIFLS AARYEGRLSV LMDRAVAPYS PQPALNLLRA ATSVGRDVLL VVDGFNECPE RERDALVQDI SALCLRSPCR VLVTAQHSVS FATITGWQEF RLRKLDEDER RAVLASYDAS VPLDLCGPFE TAYELSIAAE CSSELREGAT RATLLDAFVR HRLQSGRSPA LTRSLLRRLA VVMDERLVTA LPIGDVWRIG EAALREQQTP SDILDEVFRA SVVSVEQGVL TFSHELIGRY LAAEELLLAA GTNMDDLTSE LQRPRHEDLP ALVIPLETSE DCLRQLFNCL ASKHLLVEAL GNKLGPRARA VAVMEAERVL AELCEMTAGL KLVFGSTFET TVTGGRDVTG YEAAVLAAVG DGLADGDFLG PAMRLLDATD EACRTSTAAQ AASGHRPTPS DIAAAVVLNF AEPGSRRKIA SHILLDSARL GWPWRYRHRK GVISSGGLEE VSMGGGEDNY CRLFLMGLLL DRVPLDVGQE IVMSWLRMCW QSGAYHVQLQ ALESIRAYCQ LEAGPLRSSL VDYLSDLQTQ NLGLSTTIVE ALYSFGEIEP QISRKVADDE IREVLEKPAD DSEACGRAYG IVSNSFEDVL GPHYFEAIEA LGKNDRVRLL TKAALGVNHG FWLDWVLGEL LKLRDPQAIP AFERYATAFD VRSPSPQEAV SCYILAMQGC GNFLDTPPRF HQSMTVDLEA WQCYGAITFW LAHPMPAIER ADRCAPIWAR LATELVEAAA DPLYQLAQTY VKDQDAGKHL GHNLVLDTFP DEVRVILEAA ASNFDRQTAI FRAFDPIERS QYVLRTLGLV GNEGSLRLIE PYIEDPSLGS TAIETLKALR ARF
|
| |