Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_5483 |
Symbol | |
ID | 5673814 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 6633043 |
End bp | 6636342 |
Gene Length | 3300 bp |
Protein Length | 1099 aa |
Translation table | 11 |
GC content | 76% |
IMG OID | 641244338 |
Product | hypothetical protein |
Protein accession | YP_001509744 |
Protein GI | 158317236 |
COG category | [S] Function unknown |
COG ID | [COG3002] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.688472 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.967507 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCCTACG CCGCGGACCA GATCGAGATG ATCACCCGGA TCGTGGAGGA GGCGGGCGAG CTCCTGCCGC CGCAGGCCCC ACTGGGGTAC TTCTCCCATC ACAACCCGCT GCACGCCCTG GAGGAGCTCC CGTTCCAGCG CGCGGTCGAG CACGCCTCGG CGATGCTGGG CACGGAGGCG CTGCAGACCG AGGAGGCCTT CGCCGCGCAT CTCGCGTCCG GCCGGATCCT TCCCCGCGAT CTCGCCGCGG TCCTCGAGCA TCACGGCGAC CGGGCGGGCG CTGTCCCCGG CCGGCATCTG CCGGTGGGAG ACCAGGGCCC GCGCGGGGCC GGGGAGGAAA TGCTCGACGG TGACGCCGAG GTCGTCCCCG GCGGGCCGAC GTGGAACGAG TTCCGGCTCG CCCGGCTCGG GCTGTTCATC GACGTGCCGC GTGGCGCCGG GGCGTTGTGG GCGCTGGCCG ACGGCGGTGA GCTGCACCGC GTCCACCCGC TGGTCACCGA GGCGCGGCGG GAGGAGCTGA CCCGCCAGGG CCGGCGCCGT TTCGCGACCA CGGAGCGGCG CACCCGGCGC ACCCGGCGCA GCCGGGCGGC CCGGCTGCAG GCGCTGCGGG CCCGGCTGCT CGCGCAACTG TGGGAGGACC TGCGCCGGCA CGCGCCGCCG CCGGCGCCGC GGCCGGCGCC GCTGCGCCGC CGCGACCAGG TGCTCGAACA GTTCGGCGTC GACACCGACG AGGCCGTCCA CCCGGTGCTG ATCCGGCTCT GCGCCGCCTT CCTCGACCAG GGAGTCGCCG CCTGGGAGAT GCCGCACCGG GAGAAGGGCC TGCTGGCCGC CTTCCGGCAC CTGTTCGGGA CACTCGGCGC CCCGCGGGAG GCGTGCTGGG CCGGCCTCGG CGGCCAGCTG CGCCAGCAGC TCCGGATGAA CTGGTCCGCC GAGCGGACGG TCGCCTGGGC GCTGTGGGCG CTGCAGGTGC CGGTGCACGC CTGGGCGGAC ACGGTGCGTG CGGCGCTGGT CTCGCTACGC GGCTGGGCCG GGATGGTCCA CCAGTTCGAG TGCCGTCCCG ACCGGGCGCC GTCGCGCCCG GCACCCGCCC GGCTCATGGA CTACCTCGCC GTCCAGCTCA CGCTCGAGGT CGTCGTCTCC CACAACGTCC TGGCGCGGCT GATCGGCCCC GACGCGCGGC CGGAGGACCT CGGCCCGCTC GGCCCGCCAG GCGCGGCCCA GGGTGTCCTC GCGACCGGCA CCACCGCCCA GGACGATCTC ACCCAGGGTG AGCAGGAACT TCGCGGGGGC GACCTGGAGC TTGCCTACGA GGCGTTCGTC CTCGCGCAGG TGATGGACGT CGAGACCGAG GTGCTCGGTC ATCCCCGGTG GGCGCGGGCC TGGCTGCGGG CCGTCGCCGA GTTCGACGCG GGCCGGCGGC GCTGGCTGCT GCACCTCGCC TACGAGCGCC GCTACCGGAC CCAGGTGCTC GACGCGCTGT CGGCGCACGA CCGGCGCTTC CCCGGGACGG TGCCGCCCCC CGACTTCCAG GCCGTGTTCT GCATGGACGA GCGGGAGGAG TCGCTGCGCC GGCACCTGGA GGAGAGCCAT CCCCAGGTCC GGACCTACGG CGCCTCGGGG TACTTCGGGG TAGCCATGGC CTACCAGGGC CTCGACGACG TCCGCCCGCG TGCGCTGTGC CCGGTCACGA TGACGCCGCG GAGCCTGGTC GTCGAGCGTG CGGTGGACGA CGGCGAGCTG GTCGCCTACC AGCGGGCCCG GCGCAGGAAG GCCCAGCTCC AGCACACGAT CTCGGCGGCC CGCGGCCGTC CGGCGCGGGC GGCGGCGTAC TCCGCCATCG CCGGGCTGGC CGAGCTGGTG CCGCTGGCCG CCCGCGCGGT CGCACCCAGG GCCGCTGGCG AGGGCGTGCG GATGCTCGGC CGCCGGGAGC CGGCGCGCCC GCTCGCCCGG CTCGTCATCG AGGCGGCGCA GGACCACACG CCGTCCACCG GACCGGCGGG GGATGGCGCG CAGGCGGGGG AGGCGGGGGA GCTCGTGCCG GGCGTGGGGG CGGGGCCGCT GCGCCTCGGG TTCACCGTCG AGGAGATGGC CGAGATCGTC GACACGCTGC TCACGACGAT CGGCATGAGC GGCCCGCTCG GCCCGGTCGT GTTCGTGATC GGGCACGGTT CGTCCAGCGT CAACAACCCG CACGCGGCGG CCTACGACTG CGGCGCCACC GGCGGCGGCC AGAGCGGCCC GAACGCGCGC GCGTTCGCCG CCATGGCCAA CCATCCGCGG GTGCGCGCCG CCCTGGCCCA CCGCGGCCGC CTGATCGGCC CGGACACCTG GTTCGTCGGC GGCCACCACG ACACCTGCGA CAGCTCCCTG GCCTACTACG ACACCGACCT GGTGCCCGCG CACCTGCGGC CGGCCCTCAC CGCAGCCACG GACGCGCTGC TGACCGCCGT CCAGCTCGAC GCGCACGAGC GCTGCCGCCG GTTCGAGTCG GTCGGCCCGG ACGTCGCCGC CGGCACCGCC CACGCCCACG TGCGGGGGCG CTCGGAGGAC ATCGGGCAGT CACGACCGGA GTACGGGCAC AGCACGAACG CGACCTGCGT GATCGGCCGG CGCTCGCGGA CCCGGGGCCT GTACCTCGAC CGCCGCTCCT TCCTGGTCTC CTACGACCCG ACCGCGGACC CCGACGGCGC CGTGCTCACC CGGCTGCTCC TGTCGGCCGC CCCGGTCGGC GCCGGCATCA ACCTCGAGTA CTACTTCAGC CGGATCGACC CGATCGGGTA CGGCGCCGGG TCGAAGCTGC CGCACAACAT CACCGGCCTG GTCGGCGTGA TGGACGGGCA CGGCTCGGAC CTCCGGACCG GGATGCCCTG GCAGTCGGTC GAGATCCACG AGCCGATGCG GCTGCTGGTG ATCGCCGAGG CGGAGCCCGA GCGGCTGGCC CGCATCGTGC GGGAGAACCC GCCGCTGCGC GGGCTGGTGG AGGGCGGCTG GATCCAGCTC GCCGCCTGGG ACCCGTCCGG CCCCGAGACC TACCTCTACC GCGACGGCGC CTTCGAGCAG CACCAGCCCG AGAACCTGCG CTTCCCGGTG GTGGCCCGCT CGGAGCACTA CTACGCGGGC CAGCGCGACC ATCTCCCGCC CGCGCACGTG CTCGCCGCCT TCGGTGAATC CCCGGACGCC GTGATCGACC GCCCGGGCAC GGTGGCCGCG GCGGGCGCGG GAGCCGCCCA GCCCACGCGG GACGCAATCG AGCTGCCGGA GCAGGCGAGC GGGCCGCTGC CCGCGCGGGA CGGACAGTGA
|
Protein sequence | MAYAADQIEM ITRIVEEAGE LLPPQAPLGY FSHHNPLHAL EELPFQRAVE HASAMLGTEA LQTEEAFAAH LASGRILPRD LAAVLEHHGD RAGAVPGRHL PVGDQGPRGA GEEMLDGDAE VVPGGPTWNE FRLARLGLFI DVPRGAGALW ALADGGELHR VHPLVTEARR EELTRQGRRR FATTERRTRR TRRSRAARLQ ALRARLLAQL WEDLRRHAPP PAPRPAPLRR RDQVLEQFGV DTDEAVHPVL IRLCAAFLDQ GVAAWEMPHR EKGLLAAFRH LFGTLGAPRE ACWAGLGGQL RQQLRMNWSA ERTVAWALWA LQVPVHAWAD TVRAALVSLR GWAGMVHQFE CRPDRAPSRP APARLMDYLA VQLTLEVVVS HNVLARLIGP DARPEDLGPL GPPGAAQGVL ATGTTAQDDL TQGEQELRGG DLELAYEAFV LAQVMDVETE VLGHPRWARA WLRAVAEFDA GRRRWLLHLA YERRYRTQVL DALSAHDRRF PGTVPPPDFQ AVFCMDEREE SLRRHLEESH PQVRTYGASG YFGVAMAYQG LDDVRPRALC PVTMTPRSLV VERAVDDGEL VAYQRARRRK AQLQHTISAA RGRPARAAAY SAIAGLAELV PLAARAVAPR AAGEGVRMLG RREPARPLAR LVIEAAQDHT PSTGPAGDGA QAGEAGELVP GVGAGPLRLG FTVEEMAEIV DTLLTTIGMS GPLGPVVFVI GHGSSSVNNP HAAAYDCGAT GGGQSGPNAR AFAAMANHPR VRAALAHRGR LIGPDTWFVG GHHDTCDSSL AYYDTDLVPA HLRPALTAAT DALLTAVQLD AHERCRRFES VGPDVAAGTA HAHVRGRSED IGQSRPEYGH STNATCVIGR RSRTRGLYLD RRSFLVSYDP TADPDGAVLT RLLLSAAPVG AGINLEYYFS RIDPIGYGAG SKLPHNITGL VGVMDGHGSD LRTGMPWQSV EIHEPMRLLV IAEAEPERLA RIVRENPPLR GLVEGGWIQL AAWDPSGPET YLYRDGAFEQ HQPENLRFPV VARSEHYYAG QRDHLPPAHV LAAFGESPDA VIDRPGTVAA AGAGAAQPTR DAIELPEQAS GPLPARDGQ
|
| |