Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_5301 |
Symbol | |
ID | 5673635 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 6378089 |
End bp | 6382027 |
Gene Length | 3939 bp |
Protein Length | 1312 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 641244158 |
Product | hypothetical protein |
Protein accession | YP_001509565 |
Protein GI | 158317057 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.437695 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTCGCCG GCTTCGTCCT GCTGTTCCTC CTCCAGGCAC CGGGGAAGCT GACGGCCGAC ACCAAGCTCG ACGTCCCGCT CGAGCCGTGG CGGTTCATGT CGGCGGCCAC GCACCTGTGG AACTCGACGT CCGACTTCGG CTTCCTGCCG AACCAGTACG CCGGCTACCT GTTCCCGATG GGCCCCTTCT TCGGGCTGGG GAACCTGCTC GGCGTGCCGC CGTGGATCAC CCAGCGGCTG TGGATGGCGG TGCTGCTCAC CACCGCCGCC TGGGGCACGG TCCGCCTCGC CGAGGCGCTC GGGATCGGTC GGCCGAGCGC CCGGTTCCTG GCCGGCCTGA GCTACGCCCT GTCGCCCATG TTCCTCGGCA AGGTCGGCGC GACGTCCGTC GCGATGGTCG GGGCGGCGAT GCTGCCCTGG ATCACCCTGC CGCTGATCCT CGCGCTGCGC CCCGACGGCG CCGGCGGGGC GGACACGGGC CACCGTGACG ACACCGGGGA GCGGGCCCGG GAGCTGGCCG CCGCGCGTCT GTCGCCGCGG CGCGCCGCGG CCCTGTCGGG GCTGGCGGTG CTGTGCACCG GTGGCATCAA CGCGACCGTC ACGCTGTGTG TGCTGCTCTG CCCGGCCGTC GTCCTGGTGT TCGCGGGCGC GACACGGCGG GCCTGGGCAC TGCGGGCATG GTGGTGCGTG TGCGTGGTGC TGGCGTGCGC CTGGTGGATG CTCGCGCTGG CCGTCCAGGG CCGCTACGGG CTGAACTTCC TCCCCTTCAC CGAGACCGCC GACACCACGA CGGGAACCAC CTCGGTCGGC GAGACGCTCC GCGGCGCCGC GGACTGGATG GCCTACCTCT CGCTGCCGAC CCCCTGGCTG CCGGCGGCCC AGGAGTACGT GAGCACCCCG CTGGCGGTCG TCGCCTCGGC GGTGGTGTCC GCCTTCGGTC TCGCCGGCCT GGTCCGCCGG GATCTCCCAG CCCGCCGCTT CCTGCTCGTC ACGCTCGCCG TCGGGGTGGT GTCGGTGGCG GCGGCCTACC CGGGCCAGCC GGGCAGCCCG CTCGCGGACG GCGTCCGCTC GCTGCTGACC GAGCCGTTCG GGTTCCTGCG CAATGTCTAC AAGTTCCAGC CGGTCGTCCG GCTGCCGCTG ACGCTGGGCC TCGCGCACCT GCTCGGCGTG GCCCTCTCGT GGCGTCCCGG CGCGGTCCGG GCCGGCGCCG CCGGCCCGCC CGAGGACGCC GGGCCCGCGG GCAGGGCACG GCGCGGCGCG GAGGGCGCCG GCGATGATCG GCGCAGGCTC GTCCCGGTGC TGGTCATCCT GGTGACGGTC GGGACGCTGG TCGCCGGGAT GGCGCCGATG CTGCGTGGCC AGGGCCTGCA GCCCCGCCCC TTCACGAAGG TGCCGGACTA CTGGGCCCAG GCGGCCGACT GGCTCGCGGA CCATCCCGAG GGCGGGCGGG CCCTCGTCCT GCCGGGCGCC CCGTTCGGCG AGTACCAGTG GGGCCGCCCC CTCGACGAGC CGCTGCAGTG GCTTGCCCGC ACCCCCTGGG GGGTGCGCAA CATCATCCCG CTGGGCGGTG TCGGGACGAC CCGCCTGATG GACGGCATCG AGCACATGAT GGCCACCGGC TCCACCCCCG GGCTCGGGGT GACGCTGGCC CGCGCGGGGG TCGGCCAGGT GCTCGTGCGT AACGACATCG AGCAGAAGGA CTGGGACATC CCGCCGTCGA CGGACCAGCT ACACCGCTCG CTGGAGAGCT CGGGCCTGGT CCGGGCGGCG TCCTTCGGGC CCGAGGTCCA GGCCCGCACC GCCGCCAAGG CCCGGCTCGT CGAATCCCTG AGCGAGTCCG CCGAGAAGGT CCCGGCGATC GAGATCTGGA CGGTGCCCGG GGGAGCCAGC ATGGTCCAGG CCTACCCGGT CGACACCGGG GTCGTCGTCT CCGGCGGCCC CGAGGCCACC GTGCAGCTCG CCGGCCAGGG GCTGCTCAGC GCCGATCGAG CCGTGGTGCT GGCCGGGGAC CTCGCCGAGC CGGACCGCCC CGCCGGCGGC GCGGCGGCCG GCGCGGCCGA CGACACCGCG ATCACGCGGG CGCCCATCGC CGATCCTCCG CCGGCGTCCC AGGTCGTCAC GCCCACGACG GCCTGGGCCG TCACCGACAC CAACACCCGC CGTGGCTACA CCTTCGGCAT CGTGCACGAC TCGGCGTCCT ACCTGCTCGG CCCGGACGAG ACCGTCGCCG GCCGGCCCGG CCCGCCGAGC CAGTGGGTCG ACCGGCCGGT GGTCGGTCAC CAGACCGTCG CCGGCTACGC CGACGGGATG TCCGTGCAGG CCTCCTCCTA CGGCTACGAC CTGCTCGCGG CTCCTGACTT CGCGCCGTCC GCGGCCGTCG ACGGGCAGAC CTCGACCTCC TGGACGGCGC TGCGCCGCCA GGGCGCGACC TCCCAGGGAC AGTGGATCCA GCTCGACGTC GGCCGCGAGA TGTCCGTGCC CTACATCGAC ATCCGGCTGC TGCAGGAGGG CGACTGGCGC CCGGAGGTCG AGGCGCTGCG GGTGACCACC GAGCGCGGGT CGGCGGTCAC CCAGGTCTCG CCGATCGAGG ACATCCAGCG GCTGGCGGTG CCCCCGGGGA TGAGTCGCTG GTACAGGATC ACCTTCGACA AGGTCAGCCG GGAGACCGAC CCCGTCCTCG GGGCCGGCCT GCGCGAGATC GAGATCCCCG GCGTGCGGTT CCAGCGGTAC GCCCAGGCCC CGGCCGACAT GGTCGACGAG TTCCAGGCGC CGGACGAGGG GCTGGTCGCC TACTCCTTCG AGCGGACCAG GGTCGACCCG CTCCAGCCCT TCGGCGGATC CGAGGAGATC ACGCTCTCCC GGCGGTTCGA GGTGCCCCGC CGCCTCACCT TCACCCTCAC CGGCACCGCG AGCGCCCTCC CGCCGCCGGC CGGCGCCGAA GTCGACTCCT CCGACGATCC GCTGGTCATC CCATGTGGCC AGGGGCCGGC CCTGACGATC GACGGAGTCC GCCACGACAT CCAGGTCGAG GGCAAATACA GCGACCTCGC CACCGCGCGG CCGTTCCGCA TCAGCCTGTG CTCGGAGGGC CACCAGATCA CCCTGGATCC GGGGCAGCAC CTGATCACCG TCGACCTCGG CCAGTCGACG ATGCTCGTCG ACTCGCTCAG CCTGGTCGGC ACCACCGCCG CGACCAGCAC GGAGAAGCCA CGGACGACCC GGATCGGGGA GTGGGGCGCC GAGCGGCGGA CGATCGAGAT CGGTGCCGGC GCCAGATCCT TCGTGTCGGT GCGGGAGAAC GCGAACGCGT CCTGGACGGC GACCCTGGAC GGGAAGCCGC TCACGGCGGT CCGGCTCGAC GGCTGGGCGC AGGGCTGGAT CGTGCCGGCG GGGGCCGCCG GCACGATCGT GATCGAGAAC CTTCCCGGCC AGGAGTACCG GCGCAACCTG ATCATCGGGC TCGCCCTGGT CGTTCTCCTG ATCGTCCTGG CGGCCGTCCC GGGCCGGCAC CGGCTCCGGC GCCGCTCCGA CCCCGACGGG TACCCTCTGG GCCTCGAGCC GGGGCGCGTC CCGCTGATCG GGCTGCTCAC CCGCGTTCCG GGCGCCTGGG CGGGGATGGC GCTGGCGACA GCCGCGGTGT TCCTGATAGC GGGCTGGCTG GCGCTCGCGG TACCGGTCCT GGTCCTCGTC GGCCGGCGGT TCCCGGTGGT GCTCGGCGTG CTGGCGGTGG CTGGCATGGT CGGCTCCGGC ATCGCCGTCG CGGTCAGCCC CGACAGCATT CCGTTCTCGG GCGAGGGCGC GTTCGGCTGG CAGGCCCAGA CCCTCGGATC GCTGGCGTTC GCCGCGACGG TCGCCGCACT CGCGCTGCGC CGTGCCGAGC CGGCCCGGCC CGCATCACCC GGACCAGCCT CGGACGGCTC AGCCGGCCAT GTCCCGTAG
|
Protein sequence | MFAGFVLLFL LQAPGKLTAD TKLDVPLEPW RFMSAATHLW NSTSDFGFLP NQYAGYLFPM GPFFGLGNLL GVPPWITQRL WMAVLLTTAA WGTVRLAEAL GIGRPSARFL AGLSYALSPM FLGKVGATSV AMVGAAMLPW ITLPLILALR PDGAGGADTG HRDDTGERAR ELAAARLSPR RAAALSGLAV LCTGGINATV TLCVLLCPAV VLVFAGATRR AWALRAWWCV CVVLACAWWM LALAVQGRYG LNFLPFTETA DTTTGTTSVG ETLRGAADWM AYLSLPTPWL PAAQEYVSTP LAVVASAVVS AFGLAGLVRR DLPARRFLLV TLAVGVVSVA AAYPGQPGSP LADGVRSLLT EPFGFLRNVY KFQPVVRLPL TLGLAHLLGV ALSWRPGAVR AGAAGPPEDA GPAGRARRGA EGAGDDRRRL VPVLVILVTV GTLVAGMAPM LRGQGLQPRP FTKVPDYWAQ AADWLADHPE GGRALVLPGA PFGEYQWGRP LDEPLQWLAR TPWGVRNIIP LGGVGTTRLM DGIEHMMATG STPGLGVTLA RAGVGQVLVR NDIEQKDWDI PPSTDQLHRS LESSGLVRAA SFGPEVQART AAKARLVESL SESAEKVPAI EIWTVPGGAS MVQAYPVDTG VVVSGGPEAT VQLAGQGLLS ADRAVVLAGD LAEPDRPAGG AAAGAADDTA ITRAPIADPP PASQVVTPTT AWAVTDTNTR RGYTFGIVHD SASYLLGPDE TVAGRPGPPS QWVDRPVVGH QTVAGYADGM SVQASSYGYD LLAAPDFAPS AAVDGQTSTS WTALRRQGAT SQGQWIQLDV GREMSVPYID IRLLQEGDWR PEVEALRVTT ERGSAVTQVS PIEDIQRLAV PPGMSRWYRI TFDKVSRETD PVLGAGLREI EIPGVRFQRY AQAPADMVDE FQAPDEGLVA YSFERTRVDP LQPFGGSEEI TLSRRFEVPR RLTFTLTGTA SALPPPAGAE VDSSDDPLVI PCGQGPALTI DGVRHDIQVE GKYSDLATAR PFRISLCSEG HQITLDPGQH LITVDLGQST MLVDSLSLVG TTAATSTEKP RTTRIGEWGA ERRTIEIGAG ARSFVSVREN ANASWTATLD GKPLTAVRLD GWAQGWIVPA GAAGTIVIEN LPGQEYRRNL IIGLALVVLL IVLAAVPGRH RLRRRSDPDG YPLGLEPGRV PLIGLLTRVP GAWAGMALAT AAVFLIAGWL ALAVPVLVLV GRRFPVVLGV LAVAGMVGSG IAVAVSPDSI PFSGEGAFGW QAQTLGSLAF AATVAALALR RAEPARPASP GPASDGSAGH VP
|
| |