Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Francci3_2199 |
Symbol | |
ID | 3906338 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. CcI3 |
Kingdom | Bacteria |
Replicon accession | NC_007777 |
Strand | - |
Start bp | 2571187 |
End bp | 2572446 |
Gene Length | 1260 bp |
Protein Length | 419 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 637879531 |
Product | epoxide hydrolase-like |
Protein accession | YP_481297 |
Protein GI | 86740897 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.563968 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCCACG AAACCACGCC GTCGCACGCC GTCGAGACCC GGCCGTTCCC ACTGAAGCCG ACGCCGATTC ACGTGCCCGA CGACGTGCTC GCCGACCTCC AGCGGCGCCT GGAGTTGACT CGCTGGCCGC TCGATGCGGG CAACGAGGAC TGGTATTACG GCGTGAACCG CGCCTACCTG CAAGAACTTG TCGACTACTG GCGCACCGGC TACGACTGGC GCAAGTCCGA AGCCGCCATC AACGCCTACG AGCACTACCA GGTCGAGGTC GAAGGCGTGC CGGTGCACTT CATGCGCAAG GCCGGAGTCG GCCCCGATCC GACCCCCCTG ATCCTCACCC ATGGCTGGCC CTGGACGTTC TGGCACTGGT CCAGGGTCAT CGATCCGCTG GCCGACCCCG GCGCGTACGG CGGCGATCCC ACCGAAGCAT TCGATGTGAT CATCCCCTCG TTTCCCGGCT TCGGGTTCTC CGTACCGCTG CCGAACAACC CGGACCTGAA CTTCTGGAAG GTCGCCGACC TCTGGCACAC CCTCATGACT CAGACCCTCG GCTACGACAG GTACGCCGCC GCCGGCTGCG ACGTCGGAGC CCTGGTTACC GGCCAGCTTG GGCACAAGTA CGCCGACGAG CTGTACGCCA TCCACATTGG CTCCGGCCTG AAGCTCACCC TGTTCAACGG CGACCGGGCC TGGGACCTCA GCGGCGGCCG GCCCATCCCC GACGGCCTTC CTGACGACAT CCACGCCCAG ATCGTCGCCG TGGAGAGGCG CTTCGCCGTC CACCTCGCCG CGCACGTGCT CGCCCCGAGC ACGCTCGCCT ACGGGCTGTC CGACTCCCCG GCCGGGATGC TCGCCTGGAT ACTCGAACGC TGGGTGAAGT GGAGCGACAA CGGCGGCGAC ATCGAGACCG TCTTCACCAA AGACGACCTG CTCACCCATG CCATGATCTT CTGGGTGACC AACGCGATCG GTACCTCGAT CCGCACCTAC GCCAACAACA ACCGCTACCC GTGGACCCCG TCCCACGACC GGCAGCCAGC CATCGAGGCG CCCACCGGCA TCACCTTCGT CGGCTATGAA AACCCACCCG GCGTCAGTAC CGACCAGCGG GTTCAGAACT TCCTCGACTC CGACCGCGCC GCCTGGTACA ACCACGTCAA CCTCAACGCC CACGACCACG GCGGCCACTT CATTCCCTGG GAAATCCCCG CTCAATGGGT CGACGACCTG CGGCGTACCT TCCGCGGCCG CCGCTACTGA
|
Protein sequence | MSHETTPSHA VETRPFPLKP TPIHVPDDVL ADLQRRLELT RWPLDAGNED WYYGVNRAYL QELVDYWRTG YDWRKSEAAI NAYEHYQVEV EGVPVHFMRK AGVGPDPTPL ILTHGWPWTF WHWSRVIDPL ADPGAYGGDP TEAFDVIIPS FPGFGFSVPL PNNPDLNFWK VADLWHTLMT QTLGYDRYAA AGCDVGALVT GQLGHKYADE LYAIHIGSGL KLTLFNGDRA WDLSGGRPIP DGLPDDIHAQ IVAVERRFAV HLAAHVLAPS TLAYGLSDSP AGMLAWILER WVKWSDNGGD IETVFTKDDL LTHAMIFWVT NAIGTSIRTY ANNNRYPWTP SHDRQPAIEA PTGITFVGYE NPPGVSTDQR VQNFLDSDRA AWYNHVNLNA HDHGGHFIPW EIPAQWVDDL RRTFRGRRY
|
| |