Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_2742 |
Symbol | |
ID | 5671133 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 3242890 |
End bp | 3245040 |
Gene Length | 2151 bp |
Protein Length | 716 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 641241654 |
Product | hypothetical protein |
Protein accession | YP_001507074 |
Protein GI | 158314566 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.00817564 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGACCGCTG CCGCCTCCTG CCGCCGCGTC CTGCCGTCCG GCACGGCGGG TTCGTCGCCT GGTCAGACGC GCCGCCGGCT GGGTTTGTCT CGGCGCCGCG ACCGCGGGCG TTGCCGGGGG CTGCTGCTGG CCGGGACGGC GGCGTTCACG CTGCTGCTGG TTGTCGTGCT TCCCTCGGTA GCCTCCGCGG CCCCCGCGCC ACCGGTTCCT GCGCCTGCGC CGTCCACCGA CACGCCCGCC CCTACCGGTC CCACCCCGAA CCCGCAGCCT GGGCCGTCCC CGGTTCCGCC GGGCCAGGAA GCGCCGGCCA CGCCGGCTCC CTCCGGCGGG GATGCCACGC CCGCGCCCGA CACTCCGGAC ACCCCGCCCG ACGATGGCGG CAACGACGGT CCCGCCGAGC CGTGCCCGGC CGACAACCCG GACTGCGCCA CCACCCCGGA GATCCCGCCG CCACCGAGGC CGCAGACAAC ACCCCCGGCG CCGGACACCG GCGGCGATGG CGGCGGGATC GTCGGGTGGA TCACCGACGC GATCAACGAC GCCATCACCG CGTTCTTCCG AACTCTGGTG ACCGCCGCGC TCAACCCGCT ACTGGAACTC CTCGGCCGCA CGCTGCTGAC CACCCCAGAA CCGTCCGAGC TTCCCGCGAT CGGGGAATTG TGGTCGACGT CGTGGGCGAT CAGCGTCGCC GCCTACGGCA CGCTGATCAT GTTCGGTGGG ATCACGGTCA TGGCGCTGGG CACGGTCCAG TCCCGTATCT CGATCAAAGA GGTCGCCCCT CGTATCCCGA TCGGTTTCCT CGCCGCCGGG CTGTCGCAGT TCCTGGCGAC GAAGGCCATC CAGATCGCGA ACGTGCTACC CGGCGCGATC CTGAGCGAGG GGTTGAACCC GGACACCGCC GCCGGGCAGC TGAAGACCAT CATCCTGGTG GCGATCGTGC CGACGCCCGG CGGCTTCGGC AAGAGCATCT TCGTCATCTT CCTTGGCCTG TTCCTCGCCG GATCGGTCGT GGCGCTGCTG TGTACCTATA TCGCCCGGGT CGTCATCGAG GTGGCGCTCA TCGGCGCGGC ACCCCTGGCG TTGGCTGGGC ACGGCCATCC GCTGACCGAG AAGGTGGCGT TCTGGTGGTG GCGGGCGTTC TTCGGTTGCC TGGGTATCCA GATCGCCCAG TCGTTCGTCC TGATCGCCTC GTTCAAGGTG TTCTTCACTC CGGGCGGGTT CACCATCTTC GGCCCGACGC CGGACGGGGT CGTGAACCTG CTGGCCGCGA TCACCCTGAT CTACTTCCTG TTCAAGATTC CGTTCTGGTT GATGCCGCGG ATCGGCCACG GTGGTGGCGG CATTCTTGGC CGGGTCGTGC GCGCCTACGT CATAGGCCGG GCGCTGGGAT TCCTCGGTGG CCGCTTCGGA CGCGGCGGCG CCGGACGCGC GGGCTCACGG GCCGGCCGAG GCCGCGGCGG ACGCGGCCGG GGTCGGGGTG GACGCGGAAG GGGCCGGCCG GGCGGAGGCC CGGGAAGCGG ACCGTCCGAC CCGTACCACC ATGTCGAGGC CGATGCCGAC GGGCAGCTAC TCCTCCCGCT GACCAGGGTG CCCCGGGTCC GCCGGCCGGG ACCGGCCCGG GGCGCGCCCC GCCCGCCCCG GCCAACGGGC CTGGCGGGCC GGTCCCGGCT GCGGCACCGC CAGCTGACCA TCCCGTTCGG CCAAGCCGTC GCCCCGGGCG GGCGGTACCT GCCCCGCGAC GGCGGTGCGT GGGTGGACCG TGACGGCCAG CTACTGCTGC CGTTCGAGGT CGACCAGCCC CGCTCTCCGA CCGCGGCCCC TGCGCCTGCC CGGCGTCCGG CCTCGGCGCG GGCTGGTGCG GGCCGGTCAG GTCCCGGCCG GCCGGCACCG TCTCGGCCAG CACCGCCGAA GGCCCGGCAG CCGGAGCTCC CGTTCGACCC CTACCGGGGC ATCCGCCCGG ACCGCACCGG CCAGTACGCC CTCCCCTTGG AGGGCTTGCA GCGCACCCCA CGCCCCACCC GGCCGGCCCC GCCTGTGCCG CGGGCGGCAT CCCGTCCGAC CTCGCCGGCG CGGCCGGTCC CGCCGCGCTA CTGGCAGCAG CTGTTGCCGA ACATGCCCCG CCGCCCCCGC CCACCCCGCA ACGGAAAGTG A
|
Protein sequence | MTAAASCRRV LPSGTAGSSP GQTRRRLGLS RRRDRGRCRG LLLAGTAAFT LLLVVVLPSV ASAAPAPPVP APAPSTDTPA PTGPTPNPQP GPSPVPPGQE APATPAPSGG DATPAPDTPD TPPDDGGNDG PAEPCPADNP DCATTPEIPP PPRPQTTPPA PDTGGDGGGI VGWITDAIND AITAFFRTLV TAALNPLLEL LGRTLLTTPE PSELPAIGEL WSTSWAISVA AYGTLIMFGG ITVMALGTVQ SRISIKEVAP RIPIGFLAAG LSQFLATKAI QIANVLPGAI LSEGLNPDTA AGQLKTIILV AIVPTPGGFG KSIFVIFLGL FLAGSVVALL CTYIARVVIE VALIGAAPLA LAGHGHPLTE KVAFWWWRAF FGCLGIQIAQ SFVLIASFKV FFTPGGFTIF GPTPDGVVNL LAAITLIYFL FKIPFWLMPR IGHGGGGILG RVVRAYVIGR ALGFLGGRFG RGGAGRAGSR AGRGRGGRGR GRGGRGRGRP GGGPGSGPSD PYHHVEADAD GQLLLPLTRV PRVRRPGPAR GAPRPPRPTG LAGRSRLRHR QLTIPFGQAV APGGRYLPRD GGAWVDRDGQ LLLPFEVDQP RSPTAAPAPA RRPASARAGA GRSGPGRPAP SRPAPPKARQ PELPFDPYRG IRPDRTGQYA LPLEGLQRTP RPTRPAPPVP RAASRPTSPA RPVPPRYWQQ LLPNMPRRPR PPRNGK
|
| |