Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Francci3_3523 |
Symbol | |
ID | 3904462 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. CcI3 |
Kingdom | Bacteria |
Replicon accession | NC_007777 |
Strand | - |
Start bp | 4207341 |
End bp | 4209281 |
Gene Length | 1941 bp |
Protein Length | 646 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 637880845 |
Product | metal dependent phosphohydrolase |
Protein accession | YP_482605 |
Protein GI | 86742205 |
COG category | [R] General function prediction only |
COG ID | [COG1418] Predicted HD superfamily hydrolase |
TIGRFAM ID | [TIGR00277] uncharacterized domain HDIG [TIGR03319] conserved hypothetical protein YmdA/YtgF |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.555023 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.082287 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAGGGCG TCCTCGTCGT CCTGTTGTCA CTCGTACTTG TTGTCCTGAG CGTTCTGATC CTCGCCGTGG CGCGGTTGGT CCGGGCGACC CGGGTCGACA AGGTACCCGA CCCCGCTCCG GTGGTGCCCC GAACCCCGGC CGCCCGGGGC GTCGGGGACG TGACGGGTCC GGCGGACTTT GACGAGGAGC CGACGGTCCG GGTCCTGCCC GCCCCTTGGG AGGGTTCCGG TGCTCCGGCC ACGACGGACG CCGACTCACC CGCCGATCGG GACGGCACCG CCGCGCGGAC CGGGGATTCG GCGGTGACCG CCGGTCGCTC CGACGGCGGC CTCCGCGCGG CTCATGGTGG TAGCGCGGAG GAGGCGGCGC AGATTGTGGC CCGAGCCGAA CGGGAGGCGG CGGAACGGTT GGCGCGGGCT GAGCGGGATG CCGCCGAGAT CCGGCGGCGC GGCGAGGAGG ATGTCGCCCT ACTGCGCGAG CGGATGCTCG CCGAGGCGGC GGTCGAAACC TCGCGAGTCC AGGCCGCGGC GAGAGAGTCC GTCCGCGCCG AGCAGGAGGC CGCCCGGACC GAGATCGCCG CGACTCGGGC GGCGTTCGAC GGTGAGCAGC AGGCCTGGCG GACGGAACTG CAGAGCCGGG AGGTTGCGAT AGCCGCCCGG GAACAGCGCG TCGAGGACCG GATGGCCAGC CTCGACGATC ATGGTCGCCG GCTGGCGGAC CGCGACCGCG ACCTGCTCGA CCGGGAGAAC GACCTGACCC GTCGGACGGC CGAGGTGGCC GACCTCGAAC GTGCCCGTCA TGCCGCGCTG GAGCAGGTGG CCGGGCTCAC CGCCGGGCAG GCCAGGGGAG AGCTGATCGC CGTCATCGAG CAGGAGGCCC GGCGGGAGGC GGCGCTGACG GTCCGCGAGA TCGAGGCCCG GGCCGAGGAG GAGGGTGAGG AACGCGCCCG CAGGATCGTG ACCACCGCCA TCCAACGGGT CGCGTCCGAC CAGACCACCG AGTCCGTCGT GACGGTCCTG CATCTTCCCG GCGATGAGAT GAAGGGCAGG ATCATCGGGC GGGAGGGGCG CAACATTCGG GCTTTCGAGT CCGTCACCGG GGTCAACGTG CTCATCGACG ACACGCCCGA GGCGGTGCTG CTGAGCTGCT TCGATCCGGT GCGTCGCGAG GTCGGTCGCA TCACGCTGGC GGCTCTGGTG TCCGACGGCC GGATCCATCC GCACCGCATC GAGGAGGAGT ACGCCCGCGC CCAGCTCGAG GTGGCCGAGC GGTGCGTGCG GGCGGGCGAG GACGCCCTGC TTGAGACCGG CATCTCCGAG ATGCACCCCG AGCTGGTTAA CCTGCTGGGC CAGTTGCGTT ACCGAACCAG CTACGGCCAG AACGTGCTCG CGCACCTGAT CGAAAGCGCC CACCTCGCCG GAATCATGGC CGCCGAGCTG CGCATGCCGC TTCCACTCGC GAAACGAGCG GCTCTGCTGC ACGACCTCGG CAAGGCGCTC ACCCACGAGA TCGAGGGCTC TCACGCGTTG ATCGGGGCGG ATGTGGCCCG TCGCTACGGT GAGGACGAGC AGGTCGTGCA CGCGATCGAG GCCCATCACA ACGAGGTCGC ACCCCGCTCG ATCTGCGCGG TGCTGACCCA GGCCGCCGAC CAGATCTCCG GTGGCCGGCC TGGCGCCCGC CGCGACAGCC TGGAGTCGTA TGTGAAACGG CTCGAGCGCA TCGAGCAGAT CGCCGGTGAC CGTCCGGGTG TCGACAAGGT GTTCGCCATG CAGGCCGGCC GGGAGGTGCG TGTCATGGTC GTGCCCGAGG AGATCGACGA TCTCGCCGCC CATCTGCTCG CCCGGGACGT CGCCAGGCAG ATCGAGGAGG AGCTCACCTA TCCGGGTCAG ATCCGGGTGA CCGTCGTGCG CGAGACCCGT GCGGTGGGCA CCGCCCGCTG A
|
Protein sequence | MEGVLVVLLS LVLVVLSVLI LAVARLVRAT RVDKVPDPAP VVPRTPAARG VGDVTGPADF DEEPTVRVLP APWEGSGAPA TTDADSPADR DGTAARTGDS AVTAGRSDGG LRAAHGGSAE EAAQIVARAE REAAERLARA ERDAAEIRRR GEEDVALLRE RMLAEAAVET SRVQAAARES VRAEQEAART EIAATRAAFD GEQQAWRTEL QSREVAIAAR EQRVEDRMAS LDDHGRRLAD RDRDLLDREN DLTRRTAEVA DLERARHAAL EQVAGLTAGQ ARGELIAVIE QEARREAALT VREIEARAEE EGEERARRIV TTAIQRVASD QTTESVVTVL HLPGDEMKGR IIGREGRNIR AFESVTGVNV LIDDTPEAVL LSCFDPVRRE VGRITLAALV SDGRIHPHRI EEEYARAQLE VAERCVRAGE DALLETGISE MHPELVNLLG QLRYRTSYGQ NVLAHLIESA HLAGIMAAEL RMPLPLAKRA ALLHDLGKAL THEIEGSHAL IGADVARRYG EDEQVVHAIE AHHNEVAPRS ICAVLTQAAD QISGGRPGAR RDSLESYVKR LERIEQIAGD RPGVDKVFAM QAGREVRVMV VPEEIDDLAA HLLARDVARQ IEEELTYPGQ IRVTVVRETR AVGTAR
|
| |