Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_1221 |
Symbol | |
ID | 5669634 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 1461453 |
End bp | 1463054 |
Gene Length | 1602 bp |
Protein Length | 533 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 641240153 |
Product | RNA binding metal dependent phosphohydrolase |
Protein accession | YP_001505581 |
Protein GI | 158313073 |
COG category | [R] General function prediction only |
COG ID | [COG1418] Predicted HD superfamily hydrolase |
TIGRFAM ID | [TIGR00277] uncharacterized domain HDIG [TIGR03319] conserved hypothetical protein YmdA/YtgF |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.655673 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.159074 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTCGCGG ACCTTCGCCG GGAGGCGCGT GACGTTGAGG AGGAGGTCGA GCGGATCCGG CGCCGGGCCG AGCAGGACGC CGCGGAGCAG ACGGAGCGGG TACGGCGGGA GGCCGAGCAG ATCCGCCGGC ACGCGGAGGA GGCCGCCGAG GCGATCCGGG AACGGGCGGT CGCGGACGCG GAGCTGCGGG CGTCGAGGGC GGAGGCGGCC GCCCGCGACG CGATCCACGC GGAGCGCGAG CAGATCCGCG CCGAGCTCGA CGAGGATCTG CGCACCCAGC GGACCGAGCT GCGCGGCTGG GACAGCCGGC TCACCCAGCG CGAGCAGCGG GTCACCGACC AGGCTGCCAG CGTGGAGGAG CGGCTGCGCC GGCTGGAGAC CCGCGAGGCC GAGCTGGCCG TCCGCGAGGC CGGGCTGGAC AGCCGTGAGT CCGATCTCGG CGAGCTGGAG GAGGCCCGGC GGCGGGAGCT GGAGCGGGTG GCGGGCCTGA CGTCCGCGGA GGCCCGCACC GAGCTGGTCA AGGTGGTCGA GGACCAGGCC AGGCTGGACG CCGCCGTCCG GGTGCGTGAC ATCGAGGCCC GAGCCGAGGA GGAGGCCGAG GACCGGGCGC GCCGGATCGT CACCCTGGCC ATCCAGCGGG TCGCCTCGGA CCAGACCGCC GAGTCCGTGG TGTCGGTGCT GCACCTGCCC AGCGACGAGA TGAAGGGCCG CATCATCGGG CGCGAGGGTC GCAACATCCG CGCGTTCGAG TCCGTGACCG GCGTCAACGT GCTCATCGAC GACACCCCGG AGGCGGTGCT GCTGAGCTGC TTCGATCCCG TGCGCCGGGA GATGGGGCGG ATCACGCTGA CGGCGCTGGT GTCCGACGGC CGCATCCACC CGCACCGGAT CGAGGAGGAG TACGCCCGGG CCGAACGCGA GGTCGCGGCG AAGTGCGTCC GCGCCGGTGA GGACGCCCTG ATCGACGTCG GCATCGCCGA GATGCATCCC GAACTGATCA ACCTGCTGGG CCGGCTGCGC TACCGCACCA GCTACGGGCA GAACGTGCTC GCGCACCTCG TCGAGAGCGC CCACCTGGCC GGGATCATGG CGGCCGAGCT GCGGCTGCCC CCGGCGATCG CGAAGCGCGG CACGCTCCTG CACGACCTCG GCAAGGCGTT GACGCACGAG GTGGAGGGCT CCCACGCGAT CGTGGGTGCG GAGATCGCCC GCCGCTACGG CGAGCACGAG GACGTCGTCC ACGCCATCGA GGCGCATCAC AACGAGGTCG AGCCGCGCTC CATCGGGGCC GTGCTGACCC AGGCCGCGGA CCAGATCTCC GGCGGGCGCC CCGGGGCACG CCGGGACAGC CTCGAGTCCT ACGTCAAGCG CCTCGAGCGC ATCGAGCAGA TCGCCGCCGA GCGCCCCGGG GTCGAGAAGG TCTTCGCCAT GCAGGCCGGC CGCGAGGTGC GGGTGATGGT CGTACCCGAG CTGGTCGACG ACGTGGCCGC CCACCTGCTC GCCCGGGACG TCGCCAAGCA GATCGAGGAC GAGCTCACCT ACCCGGGCCA GATCCGGGTC ACCGTGGTCC GCGAGACCCG CGCCGTCGGC ATGGCCCGCT AG
|
Protein sequence | MVADLRREAR DVEEEVERIR RRAEQDAAEQ TERVRREAEQ IRRHAEEAAE AIRERAVADA ELRASRAEAA ARDAIHAERE QIRAELDEDL RTQRTELRGW DSRLTQREQR VTDQAASVEE RLRRLETREA ELAVREAGLD SRESDLGELE EARRRELERV AGLTSAEART ELVKVVEDQA RLDAAVRVRD IEARAEEEAE DRARRIVTLA IQRVASDQTA ESVVSVLHLP SDEMKGRIIG REGRNIRAFE SVTGVNVLID DTPEAVLLSC FDPVRREMGR ITLTALVSDG RIHPHRIEEE YARAEREVAA KCVRAGEDAL IDVGIAEMHP ELINLLGRLR YRTSYGQNVL AHLVESAHLA GIMAAELRLP PAIAKRGTLL HDLGKALTHE VEGSHAIVGA EIARRYGEHE DVVHAIEAHH NEVEPRSIGA VLTQAADQIS GGRPGARRDS LESYVKRLER IEQIAAERPG VEKVFAMQAG REVRVMVVPE LVDDVAAHLL ARDVAKQIED ELTYPGQIRV TVVRETRAVG MAR
|
| |