Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_4189 |
Symbol | |
ID | 5672544 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 4982299 |
End bp | 4985007 |
Gene Length | 2709 bp |
Protein Length | 902 aa |
Translation table | 11 |
GC content | 76% |
IMG OID | 641243062 |
Product | hypothetical protein |
Protein accession | YP_001508479 |
Protein GI | 158315971 |
COG category | [R] General function prediction only |
COG ID | [COG3973] Superfamily I DNA and RNA helicases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.060673 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGTCAAACG AAGCGGTGCC CGCCGCGGCT GTGCCCCCCG CACCCCCGGT GCCCCCTGAG AGCGCCCCGA CCGGAGACAC CTCCGGGGGC GACGAGCACG CTCGGGAGCA GCAGTACCTG ACCATGCTGC ACGAGCGGCT GGACGGGCTG CGCGCCGCAG CCGCCGCCGG GCTCGCCGAA GCCCTGCTGC GCGACGATCC CGAACCCGGC GCGCGCGCCG ACCGCGACGC GCTCGCCGCC CGCCACGCCG ACCAGGTCGC CCGCTTCGAC GCGATGCGCG ACCGGCTGCT GTTCGGCCGG CTCGACATGG CCGACGGCGA GCAGCGCTAC ATCGGCCGGA TGGGCGTCCT CGACCCCGAA GCCGATTACC AGCCGCTGCT CATCGACTGG CGGGCCCCGG CCTCCCGCCC GTTCTACCTG GCCACCGGGG CCACCCCGCT GCACGTCGCC CGCCGCCGCC ACATCCGCAC CGTCGGGCGG CGGGTGACCC ACCTCGACGA CGAGGTGTTC GGCCCGACGA CCCTCGGCGG CGGGGTCCTC GGCCTGCGGA CCCTCGACGG GCTCGGCGCC GACGAGGGCG ACCCCGACCT CACCGCCGGC CCGCCCACCG CGACCGGTGA ACCGGACGGC ACACCGGCGC ACGACAGCCT CGTCGGCGAG GCGGCGCTGC TGGCCACCCT CGGCGCGCAC CGCACCGGCC GGATGCGCGA CATCGTCGCG ACCATCCAGG CCGAACAGGA TCGGATCATC CGCTCCGAGC AGGACGGCAT CCTCGTCGTC GAGGGCGGGC CGGGTACCGG CAAGACCACC GTCGCGCTGC ACCGTGCCGC ATACCTGCTG TACTCCCGGC GCGAGCAGCT GTCCAAGCGC GGGGTGCTGA TCGTCGGGCC GAACCCGGCC TTCCTGCGCT ACATCGAGCA GGTGCTGCCC TCCCTCGGCG AGACCGGGGT GCTGCTGTCG ACCGTCGGCG ACCTCTTCCC CGGCGTGCGG GCCCGCCGCG CCGAGAGCAC GACCGCCGCG GAGGTCAAGG GCCGGCCCGA GATGGTCGAC ATCCTGGCCG CCGCCGTCCG CGACCGCCAG CGGCTCCCCG ACACGCCCCT GGAGATCACC GTCGCGGACG GGGTCGCCCG CCTCGACGAG ACGATCGTCG TGCCGGCCCG GGCCGCCGCC CGGCTCACCG GACGCCCGCA CAACCACGCC CGGTCGGTGT TCGTCCGCGA GGTGATCACC GCCCTGACCC GCCAGATCGC CGACCGGTAC GAGTCGTCGA TCGACGAGGT CGACATCCCG GACTTCGTCG ACGACTTCAT GCTCTGGCCC GACACCGACG CCGCGCGCGA CGCCCTCGGC GACGGCACCG AAGGCGGCGA GCCGGGTGGG GGCAGCGCGG CGGCGGACGG CGGCGACTGG CCCAGCACGG CCCTGGCCGC CGCGGCCGCG GCGGCGGACC GGCCGGTGCT CGACCCCTCC GACCTGGCCG ACCTGCGCCG CGATCTGAGG TCCGACCCCA GGCTCGCCGA GGCGATCGAC GGCCTGTGGC CGCTGCTGAC CCCGCAGGCG CTGCTGGAGG ACCTGTTCGC CTCCGCCGAC GCGCTGTCGC ACGCCGCCCC CGGGCTGACC GACGCCGAGC GGGCGGCACT ACGCCGCGAC CCCGGCGGGC GCTGGGCCCC CGGCGACTGG GCCCCCGGCG ACTGGGCACC CGCCGACACC CCGCTGCTCG ACGAGGCCGC CGAACTGCTC GGCAACGACC CCCGGGCGGC GATCGACGCG GCCGTCGCCG CGCACCTGGA GCGCCAGCAG CGGATCGACT ACGCCGGCGG CGTGCTGGAC ATCCTCTCCC GCGGCGACAC CGAGGATCCC GACGGCGAGC TGCTGATGGC CTCCGACCTC ATCGACGCCG ACCGGTTCGG CGAACGCCAG GAGGAGGTCG ACACCCGCAG CACCGCCGAG CGCGCGGCCG CCGACCGCAC CTGGGCGTTC GGCCACGTGA TCGTCGACGA GGCGCAGGAG CTGTCCCCGA TGGCCTGGCG GATGGTGATG CGCCGCTGCC CGACCCGGTC GATGACGCTG GTCGGCGACG TCGCGCAGAC CGGCGACGCG GCGGGGAGCT CCTCGTGGGC GCAGGCGCTC GACCCGTTCG TCGGCCCGCG CTTCACCCTC GAGCGGCTCA CGGTGAACTA CCGCACCGGC GCGGAGATCA TGGACATCGC CGGCGACGTG CTCGCCGCTC AGGGGCGTGG GCTGCGCGCG CCGCGGTCGG TGCGCCGGTC GGGCCGCGCG CCGTGGCGGC TCACGGTCGG CGCCCACGAG CTGGCCGAGC ATCTGGAGCA GCTGGTACGT GCCGAGCGCG CGGCGGCCGG CGGCGGGCGG CTGGCTGTGA TCGTGCCGCG CTCCCGCCGC GACGAGCTGA CATCCCTCAG CGTGGACGCG GAGGGGGGCG CGGACGGGGA TCCGGAGCGG CCGGTCGTCG TGTTGACGGT CCGTGAGTCC AAGGGGCTGG AGTTCGACGC GGTCATCGTG GTCGAGCCCG AGCGCATTCT GGCCGAGTCG CCGCGGGGCG CCGGTGACCT GTACGTCGCG CTGACCCGGC CGACGAACCA GCTCGGGGTC GTGCACGTCG GCGCGCTGCC GGCGGTGCTG AGCCGGCTGC GACCGCGCGA GGCGACCGGT GGCGTTCCGG CGGCCGGTCC GGCGGCATCC GGTCGCTGA
|
Protein sequence | MSNEAVPAAA VPPAPPVPPE SAPTGDTSGG DEHAREQQYL TMLHERLDGL RAAAAAGLAE ALLRDDPEPG ARADRDALAA RHADQVARFD AMRDRLLFGR LDMADGEQRY IGRMGVLDPE ADYQPLLIDW RAPASRPFYL ATGATPLHVA RRRHIRTVGR RVTHLDDEVF GPTTLGGGVL GLRTLDGLGA DEGDPDLTAG PPTATGEPDG TPAHDSLVGE AALLATLGAH RTGRMRDIVA TIQAEQDRII RSEQDGILVV EGGPGTGKTT VALHRAAYLL YSRREQLSKR GVLIVGPNPA FLRYIEQVLP SLGETGVLLS TVGDLFPGVR ARRAESTTAA EVKGRPEMVD ILAAAVRDRQ RLPDTPLEIT VADGVARLDE TIVVPARAAA RLTGRPHNHA RSVFVREVIT ALTRQIADRY ESSIDEVDIP DFVDDFMLWP DTDAARDALG DGTEGGEPGG GSAAADGGDW PSTALAAAAA AADRPVLDPS DLADLRRDLR SDPRLAEAID GLWPLLTPQA LLEDLFASAD ALSHAAPGLT DAERAALRRD PGGRWAPGDW APGDWAPADT PLLDEAAELL GNDPRAAIDA AVAAHLERQQ RIDYAGGVLD ILSRGDTEDP DGELLMASDL IDADRFGERQ EEVDTRSTAE RAAADRTWAF GHVIVDEAQE LSPMAWRMVM RRCPTRSMTL VGDVAQTGDA AGSSSWAQAL DPFVGPRFTL ERLTVNYRTG AEIMDIAGDV LAAQGRGLRA PRSVRRSGRA PWRLTVGAHE LAEHLEQLVR AERAAAGGGR LAVIVPRSRR DELTSLSVDA EGGADGDPER PVVVLTVRES KGLEFDAVIV VEPERILAES PRGAGDLYVA LTRPTNQLGV VHVGALPAVL SRLRPREATG GVPAAGPAAS GR
|
| |