Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_4819 |
Symbol | |
ID | 5673160 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 5752854 |
End bp | 5754713 |
Gene Length | 1860 bp |
Protein Length | 619 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 641243675 |
Product | ABC transporter related |
Protein accession | YP_001509091 |
Protein GI | 158316583 |
COG category | [R] General function prediction only |
COG ID | [COG1123] ATPase components of various ABC-type transport systems, contain duplicated ATPase |
TIGRFAM ID | [TIGR02323] phosphonate C-P lyase system protein PhnK |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.0993193 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACACGG TTCCCGATGT GGCCCCGGAA CCCGTGGCCC CGGAACCCGT GGCCCGGGAG CCTGTGACGG CGGGCCCTGT CGCGGCGGAG CGCGTGGCCG CGGAGCGGGA CGAACGGCCT GCCCTGTTGG AGATCGACGG ACTGGAGGTC GACTACCGGC CACGCGGTGA GGTCGTGCAC GCCGTGCGCT CGATCAGTCT CCGGGTCGCC GCCGGGGAGG TCGTGGCGAT CGTCGGCGAG TCCGGTTCGG GGAAGTCCAC GACCGCGCAC GCGGTGCTGC GACTGCTGCC GCGTTCGGCC CGGATCGCCG CCGGCCGGAT CCGGTTCGAC GGCCAGGACG TCACCGGCCT GAGCGAGCGC CGTTTCCGGG CTTTGCGCGG TCGGGACATC GCGCTCATCC CGCAGGAGCC GACGACGTCC CTCAACCCCA CCCAGCGGAT CGGTACGCAG GTCGCCGAGG TGCTGATCGT CCACGGCCTG GCCGACCGGC GCACGGCCGG GCGGGAGGCA CTGCGGGTGC TGGCGGACGC GGGCATGCCC GAGCCGGAAC TGCGGGCCCG CCAGTACCCG TCGCAGCTCT CGGGCGGCCT GTGCCAGCGG GCCCTGATCT CCGTCGCCAT CGCGGCGAAG CCGCGCCTGA TCGTCGCGGA CGAGCCGACG AGCGCCCTGG ACGTCACCGT TCAACGCCGG ATCCTCGACC ATCTGACCAG CCTCACCCGG GACGCGGGGA CAAGCGTCCT GCTGGTGACG CACGATCTGG GGGTGGCCGC CGAGCGGGCG GACCGGATCC TGGTCATGGC GGACGGCCGG ATCGTCGAGG AGGGCGCCCC GGAACGGATT CTCGACCAAC CGGAGCACCC GTACACCCGG GCGCTGATCG CGGCCGCGCC GAGCCTCGAA CCCCGCCCGC GACTGGTACC GCGCCCGACG CCGGCAACCC CGGCCGCGGC GCCCCTCTCG ACAGCACCCT CCACGGCAAC GCCCTCCACG GCAGCACCCT CGACGGCAAC GCACCCCGCG ACAGCGACGG CCGTGGAGCC ACCGGCCGCG GCACCGCCGG CCGAGGTTCC GCACCTGCGC GTCGACGACC TGGTCAAGGA GTTCCGGCTG CCCCGCTCCG GGACCTCCGC GCGGGCGGTG CGCGCGGTGG ACGGCGTCAG TTTCACCATC CAGCGCGGAC GGACCTTCGC GCTGGTGGGC GAATCCGGCT CGGGGAAGTC GACGGCCGCT CGGCTCGTGC TGCGGCTGAC CGAACCCACC TCGGGGCGGA TCACGCTCGC CGGCGAGGAC ATCACCCACA CCCGCGGCGA GGAGCAGCGG GCGCTGCGGC GCCGGATGCA GGTCGTCTAC CAGAACCCGT ACGCCTCGCT GAACCCGCGG TTCACCGTCG AGCAGATCAT CACCGACCCA CTGGCCTCGT TCGGCGTCGG CGCCCGGGCC GAGCGCCGGC GGCGGGCCGC CGAGCTCGTC GATCTCGTCG CGCTGCCCGC GGCCATGCTG ACGCGGCGTC CCGCCGAGCT CTCCGGCGGC CAGCGCCAGC GGGTCGCGAT CGCGCGGGCG CTGGCGCTGC GCCCGGAGCT CGTCGTCTGC GACGAGGCCG TCTCCGCGCT GGACGTCTCG GTGCAGGCGC AGATCCTCGA CCTGCTGGGG CGGCTGCAGA CCGAGCTGGG CGTCAGCTAT CTGTTCATCT CGCACGACCT CGCGGTCGTC CGGCGGATCG CCGACGGGGT CGGCGTCATG CGCCGCGGCC GGCTGGTCGA GAGCGGCAGC ACCGAAGCGG TCTTCACCAG CCCCGCCGAC CCGTACACGC GTGAGCTGCT GGCGTCCATT CCCCGCCGCC GGCCGAAACC GCCCAGCTGA
|
Protein sequence | MNTVPDVAPE PVAPEPVARE PVTAGPVAAE RVAAERDERP ALLEIDGLEV DYRPRGEVVH AVRSISLRVA AGEVVAIVGE SGSGKSTTAH AVLRLLPRSA RIAAGRIRFD GQDVTGLSER RFRALRGRDI ALIPQEPTTS LNPTQRIGTQ VAEVLIVHGL ADRRTAGREA LRVLADAGMP EPELRARQYP SQLSGGLCQR ALISVAIAAK PRLIVADEPT SALDVTVQRR ILDHLTSLTR DAGTSVLLVT HDLGVAAERA DRILVMADGR IVEEGAPERI LDQPEHPYTR ALIAAAPSLE PRPRLVPRPT PATPAAAPLS TAPSTATPST AAPSTATHPA TATAVEPPAA APPAEVPHLR VDDLVKEFRL PRSGTSARAV RAVDGVSFTI QRGRTFALVG ESGSGKSTAA RLVLRLTEPT SGRITLAGED ITHTRGEEQR ALRRRMQVVY QNPYASLNPR FTVEQIITDP LASFGVGARA ERRRRAAELV DLVALPAAML TRRPAELSGG QRQRVAIARA LALRPELVVC DEAVSALDVS VQAQILDLLG RLQTELGVSY LFISHDLAVV RRIADGVGVM RRGRLVESGS TEAVFTSPAD PYTRELLASI PRRRPKPPS
|
| |