Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_4824 |
Symbol | |
ID | 5673165 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 5760202 |
End bp | 5762190 |
Gene Length | 1989 bp |
Protein Length | 662 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 641243680 |
Product | NUDIX hydrolase |
Protein accession | YP_001509096 |
Protein GI | 158316588 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0775] Nucleoside phosphorylase [COG1051] ADP-ribose pyrophosphatase |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.0491634 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGACGTCCG CCAGGGGGGC GTCTTTTTCT GTCAGCGTGA CAAGAAGCTA TTCTCGCCGC GTGGACGATG ATCCGTGGGA ACCTCCCGCC GTTATGGTCG CCGTCGACCT GGTGGTTCTT ACATTGCGTG GGGCGAGCCT GCGGGTACTT CTCGTTGAAC GGGGAGTCGA GCCGTACCGA AATTTCCTGG CGCTACCCGG AGGTTTTCTC GCCCATGCGG AGGAGGATCT GACCTCCGCG GCCCGGCGCG AACTCTCACA GGAAACGGGG GTCGATCCGC GGCTGCTGCA TCTCGAACAG TTCGGCGTCT ACGGAGAGCC GGGCAGGGAT CCGCGGGGCC GGGTCGTCTC GGTGGCCTAT CTCGCGATCG AGCCACAGCT GCCCGAGCCA GTCGCCGGTA CGGACGCGAT CGGGGCGGGC TGGCATCCGG TCGACCGGGT GCTCGCCGGG GACGTCACGC TCGCGTTCGA CCATCTCCAG GTTGTCGCGG ACGGAGTCGA ACGAGCCCGT TCGAAGATTG AGCACTCGAC GCTGGCGACG GCATTCTGCG CGCCCATCTT CACGATCGCG GAACTGAAGG ATGTATACGA GGCGGTGTGG GGAGTACCCG TGGACCCACG GAACTTCTAC CGCAAGGTCC AGAAGACCGG CGGATTCATA GTCTCCGCCG GCGCGACACG GCGGACGGCA GGCGGGCGCC CGGCCCGGCT CTTCAGAGCC GGCCCGAGCG CTGTGCTCTC GCCGCCGCTG TCCCGACCGG CGGTCTCTCC GGGGAGTCCT GGAACGGCCA GCGCAATGCA CACAGAGGGG GGAACCCGAA TGAGTCGGCG ACCAGTCGTG ATTCTCACGG CACTCGATCT GGAATATCAA GCGATCCGTG AGAGCCTTGT CGACCCGCGC CTGCACCGCC ATGACCAGGG CACTCGATTC GAACTCGGAC GGCTGGCCGG AGGCAGCTGC CGGATCGCCC TCGCCCATGT CGGTAAGGGC ACCCATCCGG CCGCCGTGCT GGCCGAGCGC GCGATCGCCG AGTTCGCCCC GGCGGCGCTG CTCTTCGTGG GGGTCGCGGG TGCCCTGCAC GGACACATAG CCCTGGGCGA CGTCGTTGTG GCGACCCACG TGTACGCGTT CCACGGCGGT ACCGGCGAGG ACGACGGGTT GAAGGCACGG CCCCGTGTCT GGGAGACCTC GCACGCGGCC GACCAGATCG CTCGGCACAT TCACCGGACG GGTTCCTGGG CGCAGCGGCC AGGAGCTCCC GCCATGCTGC CCGACGTGCA CTTCGGACCG ATCGCCGCCG GCGAGGTCGT GCTGAACTCC AGGGTCTCCG GACTCGCCCG CTGGATACGT GAGCACTACA ACGACGCCCT GGCGATCGAG ATGGAGGGGG CCGGCGTCGC GCAGGCCGGG CACCTGAACC GGGCGCTGCC GGTCGTCGTG ATCCGTGGTG TCAGCGACCG CGCCGACGGG ACCAAGGAGT CCACGGACCG CCAGCGGTGG CAGCAGCGCG CGGTCGCCAA CGCGGCACTG TTCGCGACGG CACTGGCCGA GGAGATTTCC GCAGAGGACG ACGGCGCCGG ACGCCCGGCA GCGAAATCAA CGAAAGGAGC CGCAGTGCGC GAGTACGTTC AGAGCGTTCA CAACGAGAAC TCCGGCAGCG GGCCGGTGGG TGTACAGGCC CACACCGTGC ACGGCGGTGT CGTGCACGTC ACCGGCGGCG CTCGCCCGCC GGTCGACCTC CCGATGGCCC TCGCGGAGAT ACGCACGCGG TTCCAAGCAG CCCGGGCGGC CGGTGCGGTG GACGAGGACA CCTACGCCGC CGCGGAGGCG GAGCTCGCGG TGGCCGAGGA GGCCCTCAAA GCGGACACGC CACAGAGCCA CAGCACGTTG CGGGTGGCGC TGAAGAAGCT CAAGGGGCTT GTCGGTGACG TGTCCGATCT CGCCGCGAAG ATCACGATTG TCCTGGCGCT TGCCCGGAAT TTGCCATGA
|
Protein sequence | MTSARGASFS VSVTRSYSRR VDDDPWEPPA VMVAVDLVVL TLRGASLRVL LVERGVEPYR NFLALPGGFL AHAEEDLTSA ARRELSQETG VDPRLLHLEQ FGVYGEPGRD PRGRVVSVAY LAIEPQLPEP VAGTDAIGAG WHPVDRVLAG DVTLAFDHLQ VVADGVERAR SKIEHSTLAT AFCAPIFTIA ELKDVYEAVW GVPVDPRNFY RKVQKTGGFI VSAGATRRTA GGRPARLFRA GPSAVLSPPL SRPAVSPGSP GTASAMHTEG GTRMSRRPVV ILTALDLEYQ AIRESLVDPR LHRHDQGTRF ELGRLAGGSC RIALAHVGKG THPAAVLAER AIAEFAPAAL LFVGVAGALH GHIALGDVVV ATHVYAFHGG TGEDDGLKAR PRVWETSHAA DQIARHIHRT GSWAQRPGAP AMLPDVHFGP IAAGEVVLNS RVSGLARWIR EHYNDALAIE MEGAGVAQAG HLNRALPVVV IRGVSDRADG TKESTDRQRW QQRAVANAAL FATALAEEIS AEDDGAGRPA AKSTKGAAVR EYVQSVHNEN SGSGPVGVQA HTVHGGVVHV TGGARPPVDL PMALAEIRTR FQAARAAGAV DEDTYAAAEA ELAVAEEALK ADTPQSHSTL RVALKKLKGL VGDVSDLAAK ITIVLALARN LP
|
| |