Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_7004 |
Symbol | |
ID | 5675315 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 8536545 |
End bp | 8539700 |
Gene Length | 3156 bp |
Protein Length | 1051 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 641245850 |
Product | hypothetical protein |
Protein accession | YP_001511241 |
Protein GI | 158318733 |
COG category | [S] Function unknown |
COG ID | [COG4995] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.244252 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCCGGGC AGATGACGGT GATTCGGACC GCGGCTGTGC CCTGCGCGGA GGTCGACGAG GCGGTCGCCG AGGACGAGCG GGCGCTCGGA GCCGCGACCG ACCTCGTGGC GAGGATCGCG GCGGCCCAGA ACCTCACCAT CTCCCTACAG TCGCGCTTCC ACTGCCATCG GGATCTCGCG GACCTCGACC GGGCCATCGA GGTGATCAAC GAAATCGGCC GGGATGTCGA CACGGCTCAT CCCGGCCGGC TCATGCTCGA CACCCAGATG ATGTCCTGTC TGATCACATC CACCCGGCTG GAGGATGCCA GGCGGGCGCG AGAGCTCGGA GCGCGGCTGT ACCGTCTGAC GGGCCAGCTG GGCGAGATAA GGGCGGGTGT CGCCGTCATG CTGGGCATGG CCGCCATGGT CTGCTTCCGG CACACGTCCT CCGTCGAGGA CGCCGAGGAG TTCGTCGTCC TCCTCGAGGA GGCGGTCGGC GGCGTTCCGG CCAACTCGCC GTCCAAGCTG ATCGTCCAGG GCGCACTGAT CGCCGCGCTG TCCGTGCGAT ACCGGCTCAG CCCGAACCGC GAACGGGATC TGGACCGTGC GGTCGAGATC TTCGACGAGC TCGACCCGGC AGCGAGCGGC GGAGACAAAG AGGGAGTCGT CCGCTTCGCC TGCGGCGCGC TGGTCGACGG CCTCGTCGAC CGGCACGAAC GGCACCACGA CCCGGCCGAC CTGCGGACCG CGCGGCGAAT CCAGGACTAC ATGCCGGTCG GTGAGGCGAG TGGTGGCCTG ACGGAGAGCC CCGCGGCGTG GCATCCGCAG GCCTTCCGGC ACTACTCCTC GTTTCGGCAG ACGCAGGATC CCCAGTTCCT CGATCAGGCT ATCGAGGAGC AGGACCGCGC GATGAGCGAC CCGAGCCGGT CCGGCAAGCA ACGCGCGAGT TACGCCTACA TGCTCGCGAT CTGCTGTTTC ACGCGTTTTC TCCTCTACCG GGGCCGCCGG CTCGGGGATC TGAACCGGGC CATCGACCTC GTCCGCGAGG CGCGCCAGGA CTGGCGCGAC TTCACCCTCG TCCACAGTTC CGCGCAGCTG CTCGCCCGCT TCCTCATCCA GCGCCACATG AGCCTCGGAC GCCCGGCCGA TCTCCGCGAG GCCCGCGTCG TGCTGGAGTC GGCGCTGCGG ACCGGAGAGG TGGAGAACCG GACGAAGTCC TCGATCCTGA TGCTGCTGGC CTACTGCGAG GCGGTCCGGG CCGAGCGCGA CGGCGACGAC GAGGCCGCGG ACCGGTCGAT GGACCGCTTC GAGGAGCTCA CCCGCCTGAT GCCGGAAGGG TCGATGGAGG CGATGGCGGT GCAGACGACA CTCGCCAACG CGTTGCAGTC CAAAGCGTCC GCGAGCCAGC TTCCCGAGGA CGCCACCCGG GCGTACCGGG CGTCGCGCGC GGTGTGCTCG CACGCGGGCG CGAGTCCCCT GGCGACGGTG CCGTGCGCCG AGGGCTGGGG CCATCTCGGC TGGCGGCGGC AGGACTACGC GGAGGTGGCG GAGGCGTACG GACACGCGGT CGCGGGCCTG CAGGCCATTT CCGTTCTTTA CCCGCGGCGC GAGGACAAGC AGGACTGGCT GGGCCGCGCC GGGTCGATGG CCGCGCGCGC GGCGTTCGCG CTGGCCCGCC TCAACCGGGC CGAGGAGGCG GTCGAGGTGC TCGAGGCGGG ACGGGCCGTC CTGCTTTCGG AGGCCCTGGA CCGCCAGCGG GTGGATCTGG ACCGGCTGAG CGCGGTCGGT CACGGCGAGC TGCGGGAGCG GTACGAGCGC GCCGCGGCCC GCGGCGCCGA GGCCGAGCGC AGCTTCGAGC CCGACTTCGA CATGGAGTAC GTCCCGGTGT TCGATCCGAC CCTGCGGGAC CCCGAAGCGC CGCTGCGCGA GCTGGCCACC GAGCTCGGTG ACGCCCGCGA GGAGCTGCGA CGGGTGGTGG GCGAGATCAG GTCCGTGCCG GGCTACGAGT CCTTCCTCCT CCCTCCCTCG TTCGCCGACG TCCAGAGGGT CACGCGGGCC GACCGGGTCC CGTTGGTCTA CCTGGCGGCC ACGCCACGGG GCGGCCTCGC CCTCGTCGTC ACCGCATCGA AGGTGACCCC CGTGGAGGTG GTGTGGCTGC CCGAGCTGAC CCTGGCCAAC CAGCGCAGGT GGGTAGGGCG GTGCGTCCAG GCCGCCGAGG AGGACGACCT CGACGCTTAC GAGAAGGCGT CCCGATGGCT CGGCCGGGCG GTGATGGACC CGGTACTCAG GCTGCTCGCG CCGTACCGGC GAGCCGTTCT CATGCCCGGC GGCCTGCTGG GCTCCGCGCC GCTGCACGCG GCGAGGCTGG CGGAGACCGT CGGCGGCGCG GGCACCTACG CGCTCGACCG GCTCACCCTC GTCTACGCGC CCAGTGTCCG CGCGCTCGCC GCCACCCGGA TCGCGCCGGG CCCGGCCCGC TCGGGTGATC TGCTGCTCGC CGTGGCGGAC CCGGCCTCCT CGGGTGCCGA CCCGCTGCCG GGTGCCCAGG TCGAGACCGA CATCGTCGCG GCCCATTTCG GCCCCGAGGC CCAGGTCCTG CGCGCCGGAC AGGCAACCCG GGCGAGAGTG CTCGCGGCGA TGTCCGGATC CGACGTGCTG CACTTCGCCT GCCACGGGGT GGCCGAGCCG ATGGATCCGC TCAGCAACCG GCTTCTGCTC GTCGGCGACC AGTCGATGAC GTTGCGGGAC ATCGGCGGGC TCGACCGGAT GAGCCCCCGG CTGACCGTGC TGTCGGCCTG CCAGACGGCC GTCATCGGTG ACGATCTGCC GGACGAGTTC GTAAGCCTCG CCACCGGCCT GCTGCAGGCC GGATCGCCGG GCGTGGTGGC CACGCAGTGG TCGGTGGACG ACGTCGCCAG CGCCGCCCTG ATGGCCCGCT TCTACCAGCT CTGGCGGGAT GACGGAGTGG AAACGGCGGA GGCCCTGCGG CGGGCACAGC TCTGGGTGCG CCGGACGACG AACGGCGAGA AGGTCCGCGA GTTCCCGGAT CTCCTCCGCA TCCCCGACTA CGGTCCGCCG ACCGACGCCT CGCCGGCCGT TCGCCATGCC TGGGAGACCC GACGGGCCCA TGAGCATCCC GTCTACTGGG CCGCGTTCAC CTTCCTCGGC AGATAG
|
Protein sequence | MAGQMTVIRT AAVPCAEVDE AVAEDERALG AATDLVARIA AAQNLTISLQ SRFHCHRDLA DLDRAIEVIN EIGRDVDTAH PGRLMLDTQM MSCLITSTRL EDARRARELG ARLYRLTGQL GEIRAGVAVM LGMAAMVCFR HTSSVEDAEE FVVLLEEAVG GVPANSPSKL IVQGALIAAL SVRYRLSPNR ERDLDRAVEI FDELDPAASG GDKEGVVRFA CGALVDGLVD RHERHHDPAD LRTARRIQDY MPVGEASGGL TESPAAWHPQ AFRHYSSFRQ TQDPQFLDQA IEEQDRAMSD PSRSGKQRAS YAYMLAICCF TRFLLYRGRR LGDLNRAIDL VREARQDWRD FTLVHSSAQL LARFLIQRHM SLGRPADLRE ARVVLESALR TGEVENRTKS SILMLLAYCE AVRAERDGDD EAADRSMDRF EELTRLMPEG SMEAMAVQTT LANALQSKAS ASQLPEDATR AYRASRAVCS HAGASPLATV PCAEGWGHLG WRRQDYAEVA EAYGHAVAGL QAISVLYPRR EDKQDWLGRA GSMAARAAFA LARLNRAEEA VEVLEAGRAV LLSEALDRQR VDLDRLSAVG HGELRERYER AAARGAEAER SFEPDFDMEY VPVFDPTLRD PEAPLRELAT ELGDAREELR RVVGEIRSVP GYESFLLPPS FADVQRVTRA DRVPLVYLAA TPRGGLALVV TASKVTPVEV VWLPELTLAN QRRWVGRCVQ AAEEDDLDAY EKASRWLGRA VMDPVLRLLA PYRRAVLMPG GLLGSAPLHA ARLAETVGGA GTYALDRLTL VYAPSVRALA ATRIAPGPAR SGDLLLAVAD PASSGADPLP GAQVETDIVA AHFGPEAQVL RAGQATRARV LAAMSGSDVL HFACHGVAEP MDPLSNRLLL VGDQSMTLRD IGGLDRMSPR LTVLSACQTA VIGDDLPDEF VSLATGLLQA GSPGVVATQW SVDDVASAAL MARFYQLWRD DGVETAEALR RAQLWVRRTT NGEKVREFPD LLRIPDYGPP TDASPAVRHA WETRRAHEHP VYWAAFTFLG R
|
| |