Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_6956 |
Symbol | |
ID | 5675269 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 8474523 |
End bp | 8479025 |
Gene Length | 4503 bp |
Protein Length | 1500 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 641245805 |
Product | NB-ARC domain-containing protein |
Protein accession | YP_001511196 |
Protein GI | 158318688 |
COG category | |
COG ID | |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.270488 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCCCCCG AGCCGGTGTC AGCGCAGGTT CGCGCGTCTC CTCGCGAGAT CGCGGATGCG GTCTGGCTTT CCGCGTTCCT GGAGCCGGAG CCGAGCGTCC CGGAGAGCAC CGTGGACGCA CACGGCGACG TCCAGCCGAC GCGGCCCGGC CCGGCTGGAC CGGCCGCCAC CGAGGTCATC GCCGATGACT ACCGGAGCCA TCCGGAGGCG GGTGAGCGAC CGGCGGAAGC GCCGGGCACC GGCGCCCTCC CCCGCGCCTC TGACGGGATG GTTCCAGCCA GATCACCACG GAACGTGAAT ATGCAGCCCT CGGTGGCCAC GTTCGCAGAA GTTTCCATTC CCGCCCGGCA CGAGGTCGCC GGCGTCCAGA CCACCACGGC CGGAGCCTAT ACACCAGTCA CGCTGAGTGA GACGGCTCGA GGTATGGCCC GCGCCCTGGC AAGCCTCCGG CAGACTGTGT CGTCGGCGAT CAACGTAGAT CTCGACGAGG AGGCCACCGC TGAGCGGCTC GCAGGTGAGG CGCTCGCCAT GCCTGTTCAC CGCGCACGGC TCGAACGGCG ATGGGACATG ACCCTCGTCA TCGACACGAG CACCTCGATG CACGCGTGGC GCGACGAGGC GGCGCGGCTG GTCGCAGCGC TCGAGCGCTG CGGGGTTTTT CGTGACGTGA ACCATTGTTA TGTCGACACG GATGTCGAGA CGGCCTCTGA GCTGCGGCTG CGTGGATCGC GCACTTCCGC CGGCACGCGG TCGCCCGACC TTCTGGTACG GCCCGGCCTC GACCACGTTG TCTGGATCTA CTCGGACACC CTGGGCAGGG CATGGCGTTC GCACGCCGTG TTCTCGCTTC TGTGGCGATG GGCCGGGAAG GCGAACACCG CCGTGCTGAC GCCGATCAAA CGGCGCATGT GGCACAACAC GAACATCCGT TCATATCCCA TGCTTGGTTC GGCGAGGCTT GGCCCGGCGC CCGCCTCGGG AATGTCGTGG TCCTTCCGCC AGTCCTGGGA CAAGAGGCTG TTCGCGCTGG ACGTCGATAT AGAAAACGCA CGTCCGATCC CTGTTCTGGA ACAGAGTCGC CATGCTGTGG AAAAGTGGGC GCGGGCGCTG GCGGGCCGGG CTGACGGCCG AACCGAGCTG CCGGTCATGC TCGTTCCACC GGTCACCCGC ACGGTTGTCC CGGCGGAGAC CGCGGATACC GACGCGACAC GCCGGAACTC CACCGACGAT GCCGGCTGGC AGCAGGTGGC GGCATTCCAC AATGCCGCCA CCTCACCAGC ATTCGACCTG GCGACGCATC TGGCCGCCGC ACCGCTCACC TGGTCAATGA TCGATCAGGT GATCGCGATG ACCCCTGGTG CCGACCGGCG AGAACTCTCC GAGCTGTTCA TGCACGGGCT CCTCACCCGG ACGGGCCCGT CCGCCGCGGC GGCCACCGGG CACGTGCCGG GAGCCGAGGC GGAGATAGTC CTCGACTTTC TCCCCGGAGT TCGTGCCAAC CTGCTCGCGT TCGGCCGTCA GCGCGACACC ATCCGCGTAC TGAAGGCTGT CTGTGACCAT CTGGGACCCA ATATCGCGAT GGTTCGACAT CTTCGGCAGG CGATAGATTC GCCCGGATCG GCGCCGATTC CTGAGGTGAC GCCCGCCAGT GCGCCTTTCC TCGCGGTGGA GGAGACAGCG TTGTCCGCCT TGTCCGGTCC GTTCCTGGCG CGAGCACGCC GTATTCAGGA ACATTTTGTG GCAGCGGCGG ACAAGCAGCA CATCCTCACA TCCACCCAGT CACCGACAGT AGTTAGTAAC ATGCCCTTGG TGACAGTTGA TATTAGCGCT GCGTCAGACA CCGACAGCGG CGACGAGATC TCCGTTCCGC CGCCACCCCG CGCGCCGACG GACGGCGACG GCCCAGACCG CCTAGGGGAT AGCCAGCTAG GGGGTAACCG GACACATCTG AAAGGAGATG TTTTGACCTC GACCACCGAG ACCTCGACGG CGAAGCAGCA GCAGGCGCGC CAGCACCCGG TGGTGTTCGG AAACGTCCCC CAGCGAAACC CGTATTTCAC CGGGCGCAAC GGACTTCTCC GCGAGCTTCA CACGAGGCTC GGCCACGGCA CCACCGCGGT ACTGCCGGAA GCTCTGCACG GAATGGGCGG GGTGGGTAAA TCCCAGCTCG CGGTGGAGTA CGTATACCGG CACCAGGCCG ACTACGACAT CGTGTGGTGG ATACCGGCCG AACATTCCAC ACAGATCGGC AAGGCGTTGG CGGAACTCGC CCAACGCCTT GGCCTGTCGG TCGGCGGGGA GGCGAACACG GCCGTTCCGG CGGTGCGGGA GGCCCTGCGG ATCGGCGTCC CCTACGGCAA CTGGCTCCTC GTTTTCGACA ACGCGGAGGA TCCCCGCGTC GTTCGAGAAT ACTTCCCACA GGGCGGAAAC GGAAAGATCC TGGTGACGTC GCGAAATGCG CAGTGGTCCA GTATCGCACG CCCTCTGGAG GTGGACGTCT TCAGCCGGGA GGAGAGCGTG GAGCTCCTTC AGAAGCGCGA CACGGACCTC ACTGACCACG ACGCCGGCCG CCTCGCCCGG GCGCTCGGAG ATCTTCCTCT CGCGGTGGAG CAGGCGGCCA CCTGGCGCGC CGAGACGGGA ATGAGCGCCG ACGAGTACCT CACCCTCTTC CAGGAGAAAA GGGACGAGCT GCTCGGCACC TCCCCTCCAA TGGACTACGA GGTCCCCGTC CAGGCCGCGT GGAACCTGTC CCTGGATCGA CTCGCAGACC GCAACCCGGC GGCGCTGAGG CTTCTCCAGG TGTGTTCCTT CTTCGCGCCG GAACCGATTC CGCGGCAGGT CTTCCGGCGC GGCCGCAACA TTATGATCAT GCCGGAGCTG GACGCGGCCC TGCGCGACCC CTTCAAGCTG AACATGGCCA TCCGGGAGAT CACCCGATAC GCGCTCGCCC GGGTCGACCA CCGGACGAAC TCGATCCAGA TGCACCGCCT GGTGCAGACC GTGCTGCGCG GTCGGATGAC CCCCGACGAG CGGGAGACGA TGCGGCACGG CGCCCATCTG CTGCTCGCCG CCAACGATCC GGACGAGCCC AGCAACCCGG AGAACTGGGA GCAGTACTCC GAGCTCTATC CGCACGTCAT CGCGTCCGAG GCCATCGGCA GCCGTGACCC CTTCGTCCGC GACCTGCTCG TCCACGAGGT GGAATACCTG TTCCGGTGGG GTGACCACGA GGGAAGCCTC ACGCTCGCGC AGCAGACGTA CGACGCCTGG ACCGGCAATC CGGATCTTGG CGAGGAGGAT CCGCACAGCA TCACCATGGC CGGCTGGGTC GGCTGGGTGA GCTACATAAC CGGCCGCTTC GCGGATGCCG CCCGGGTCAA CAAGCGGCTG CTGGAACTGT GCGAGCAGGT CCACGGCGAC AACCACACCG AGACCCTCGA GGCGCTGGGA AACGTCGGCG CGGACCGGAT AGTCGCCGGC GATTTCGAGG ACTCGCTGAG ATTCGCGCGC GAGCGACACC GGCGCGCCCT GCGCGCCTAC GGGCCGGGTG ACGCGGTCAC TCTCGACGCG GCCCACAACG TGGGGCTTGG CCTGCGGCTC CTCGGCAGAT TCCAGGAGGC CAAGGAACTC GACCAGGAGA CCTGGGAACG CCGCGTACAG CTGGTCGGCG AGGACAACAT CGAAAGCCTC CGCACCTACA GCAACCTTCT CGTGGACGAA CGTGAACTCG GCGACTACCA GGGGGTACGC ATCCGGCTCG AAGACATGGT GGAACGCGTC CGCCGGCTGG TGAAGAACAT CGAGGACCAT CACGAGCTGC TCCGGGTGTC GGGGCTCCTC GCGATCGCGC GCAGGAAATC CGGTGACCAC GACAGCGCCC TCGAGCTCTC CCGAGACGTG GAACGTCGCT CCCTGCGGCG CTACGGCAAG GACACTCCCC GCACGATCGG TGCCGCGCTG GCCCTGTCCA TCGATCTCCG GCATGCCGGG GAGCTGGCCG AGGCACGCGA GCTGTGTGAG GCCACCAGAA GGCGCTTCGA CCGCGCGTTC GGTGCCACCC ATCCACACAC TCTCGCCGCG ACGGTGGACC TAGGTGTCAT CTCACGGCTC GCCGGCGATC TGGAGACCGC GTCCGAGCTG AGCCGAACCG GCCTGGATGG GCTTCGCAGC AGGCTCGGCG AGGACCACGC ACACACGATG ATCGCGGCGA CGAATCTCGC CAGCGACCGC TACGCCCTCG GCGAGTTCCA GACCGCGCAC GACATGGACG TGGCGACGCT GGAACGCAGC CGCCGCGTCC TCGGCGAGAA CCATCCGTCC ACCCTCGCCT GTGCCAGCAA CCTGGCGCTG GACCTGCGAG CCCTTGGCGA GGACGGCCCG GCGGAGAACC TCCTCGCCGA CACCGTCGTG CAGCTCGACC GGGCTCTGGG CAAGGGCCAT CCGGCGACCC GCGCCGCCGC CAGTTTCGTC CGGGCCGACT GCGACATTGA TCCGTTCGTG TGA
|
Protein sequence | MAPEPVSAQV RASPREIADA VWLSAFLEPE PSVPESTVDA HGDVQPTRPG PAGPAATEVI ADDYRSHPEA GERPAEAPGT GALPRASDGM VPARSPRNVN MQPSVATFAE VSIPARHEVA GVQTTTAGAY TPVTLSETAR GMARALASLR QTVSSAINVD LDEEATAERL AGEALAMPVH RARLERRWDM TLVIDTSTSM HAWRDEAARL VAALERCGVF RDVNHCYVDT DVETASELRL RGSRTSAGTR SPDLLVRPGL DHVVWIYSDT LGRAWRSHAV FSLLWRWAGK ANTAVLTPIK RRMWHNTNIR SYPMLGSARL GPAPASGMSW SFRQSWDKRL FALDVDIENA RPIPVLEQSR HAVEKWARAL AGRADGRTEL PVMLVPPVTR TVVPAETADT DATRRNSTDD AGWQQVAAFH NAATSPAFDL ATHLAAAPLT WSMIDQVIAM TPGADRRELS ELFMHGLLTR TGPSAAAATG HVPGAEAEIV LDFLPGVRAN LLAFGRQRDT IRVLKAVCDH LGPNIAMVRH LRQAIDSPGS APIPEVTPAS APFLAVEETA LSALSGPFLA RARRIQEHFV AAADKQHILT STQSPTVVSN MPLVTVDISA ASDTDSGDEI SVPPPPRAPT DGDGPDRLGD SQLGGNRTHL KGDVLTSTTE TSTAKQQQAR QHPVVFGNVP QRNPYFTGRN GLLRELHTRL GHGTTAVLPE ALHGMGGVGK SQLAVEYVYR HQADYDIVWW IPAEHSTQIG KALAELAQRL GLSVGGEANT AVPAVREALR IGVPYGNWLL VFDNAEDPRV VREYFPQGGN GKILVTSRNA QWSSIARPLE VDVFSREESV ELLQKRDTDL TDHDAGRLAR ALGDLPLAVE QAATWRAETG MSADEYLTLF QEKRDELLGT SPPMDYEVPV QAAWNLSLDR LADRNPAALR LLQVCSFFAP EPIPRQVFRR GRNIMIMPEL DAALRDPFKL NMAIREITRY ALARVDHRTN SIQMHRLVQT VLRGRMTPDE RETMRHGAHL LLAANDPDEP SNPENWEQYS ELYPHVIASE AIGSRDPFVR DLLVHEVEYL FRWGDHEGSL TLAQQTYDAW TGNPDLGEED PHSITMAGWV GWVSYITGRF ADAARVNKRL LELCEQVHGD NHTETLEALG NVGADRIVAG DFEDSLRFAR ERHRRALRAY GPGDAVTLDA AHNVGLGLRL LGRFQEAKEL DQETWERRVQ LVGEDNIESL RTYSNLLVDE RELGDYQGVR IRLEDMVERV RRLVKNIEDH HELLRVSGLL AIARRKSGDH DSALELSRDV ERRSLRRYGK DTPRTIGAAL ALSIDLRHAG ELAEARELCE ATRRRFDRAF GATHPHTLAA TVDLGVISRL AGDLETASEL SRTGLDGLRS RLGEDHAHTM IAATNLASDR YALGEFQTAH DMDVATLERS RRVLGENHPS TLACASNLAL DLRALGEDGP AENLLADTVV QLDRALGKGH PATRAAASFV RADCDIDPFV
|
| |