Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_5171 |
Symbol | |
ID | 5673505 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 6203249 |
End bp | 6205069 |
Gene Length | 1821 bp |
Protein Length | 606 aa |
Translation table | 11 |
GC content | 77% |
IMG OID | 641244025 |
Product | adenylylsulfate kinase |
Protein accession | YP_001509435 |
Protein GI | 158316927 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG0529] Adenylylsulfate kinase and related kinases |
TIGRFAM ID | [TIGR00455] adenylylsulfate kinase (apsK) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.0533144 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAGCGCCA CCTCCGACAC CGCCGTCGGC TCGTTCGACC CGGATCCGCA TGCCACGGCT GGTCCGCCCG ACATACCGGG CGCTGGTGAG CAGCGCGGCG CGAACGGCGT CTGTCCGCTT GCCGTCGTCG CTGCCGACCG CGCCCGCGAG CTGCGGGCCG CCTCGGTGGC CTGGCCGTCG GTCGTGCTCG ACACGCGCCA GCTCACCGAC CTGGAGCTGA TGCTGCTCGG CGCGTTCGGC CCGGCGCCGA GCTACTCGGG CCGGGCCCGC CAGCCCGCCG GGCCGCCGGT GCCGAGCCTG ACCGTCCCCG GTGAGACAGC GGCCGCGCTG GGCCCGGGCA TGGACGTCGC GCTGCGCGAC CGCGAGGGCG TGATGATCGC GGCCCTGCAC CTGCTCGGTC TGGAGCCCAC CGCGGACGAC TCACCGACGC CCGCCACCGC GATGCCGGCG GAGGCCTCCG GAACGCCGGC GACTGCCACC GGGGCGGGCC CGGTGCGCCT GGTCGGCACG GTGGAGGGCC TGGAGCTGCC CAGCCATCCG GACTACCCGC GGCTACGGCT GACCCCGGAA GGGCTGCGCG CCGAGTTCGT CGCCCGGGGC TGGGCGACCG GCGCGGGCGC GGCCCCGTGG GCGGTGTGGG CCGACGGCCT GCTGTACACC GCCGACGTCG GGCGCATCCG CGCGCTGACC CGGCAGGGCA AGCACTGCGT GATCCTGGCC CCGGTCGGCG GGGCGGACCC GGCCGACGCC GCACACCACC TGCGGGTGCG CTGCCTGCTC GCCGCGCTGG ACGCGATCGA CGCACCGCCC CGGCCGGCCG AGGCGACGCT GGCCCACGAG CCGGTCGCCT CCGACGCGGT GTCGCGGGAC GGCGGCCGGA CGCATCCGGC CAGCCCGCCG GAGCCGGCCC ACACCGCGGC GCTCGCGCCG GAGTCCCGCC GGCACCGCAG CATGCTCGTG CTGGTCCCGG TCGTCCCGTC GGAGCGGCTC GCCGCGCCCC GTGAGCCGGC GATGAGCGCG ACCGCCCCGG GCGGCCTGGC CGACGAGGCG GCGAGTGAGC CGGCGTTGGA CGAGATCGCC GAACTCACCG CCCTGCGGGC CCATCTCGCC GAGGTCTACG GCTGTGCCGG CAGCCTGACC GGGCCGGCGA TCGGCGCGCC GGGCGCCGAC GATCTCACGA CCCTGCTCGA CGCGGGTGTG CCGCTGCCCG CCGAGCTCAC CCCGCCCGCG GTGGCTGCCG AGCTGACCCG CGCGGTTCCC CCGCGCCGCC AGCGCGGCCT GACGATCCTG TTCACCGGGC TGTCCGGCTC GGGCAAGTCG ACCCTGGCGG GTCTGTTGGT GTGCCGGCTG CTCGAACGGG GCCGGCGGGT CACCCTGCTC GACGGCGACA TCGTGCGGAC GCATCTCTCC CAGGGCCTGG GCTTCTCCCG CGCCGACCGG GACACGAACG TGCGCCGCAT CGGCTTCGTC GCCGCGGAGG TCGCGGGCGC CGGCGGGATC GCCGTGTGCG CGCCGATCGC GCCCTACGAC GACGTCCGCG CCCAGGTGCG GGCGATGACC ACCGCCCGCG GCGGCGGCTT CGTGCTCGTG TACGTGTCGA CCCCGCTGGA GGTGTGCGAG GCACGGGACC GCAAGGGCCT CTACGCCAAG GCCCGGGCGG GGGTGATCCC CGCCTTCACC GGCGTCTCCG ACCCGTACGA GGAGCCGGCC GACGCCGACG TGGTGGTCGA CACGGCGGGC CTGCCGACCG AGCAGGCCGT CGACCGGGTG CTCGCGCACC TGGTCGAGGC CGGCTGGGTC GAGGGCGCCC GCGGCCAGTA G
|
Protein sequence | MSATSDTAVG SFDPDPHATA GPPDIPGAGE QRGANGVCPL AVVAADRARE LRAASVAWPS VVLDTRQLTD LELMLLGAFG PAPSYSGRAR QPAGPPVPSL TVPGETAAAL GPGMDVALRD REGVMIAALH LLGLEPTADD SPTPATAMPA EASGTPATAT GAGPVRLVGT VEGLELPSHP DYPRLRLTPE GLRAEFVARG WATGAGAAPW AVWADGLLYT ADVGRIRALT RQGKHCVILA PVGGADPADA AHHLRVRCLL AALDAIDAPP RPAEATLAHE PVASDAVSRD GGRTHPASPP EPAHTAALAP ESRRHRSMLV LVPVVPSERL AAPREPAMSA TAPGGLADEA ASEPALDEIA ELTALRAHLA EVYGCAGSLT GPAIGAPGAD DLTTLLDAGV PLPAELTPPA VAAELTRAVP PRRQRGLTIL FTGLSGSGKS TLAGLLVCRL LERGRRVTLL DGDIVRTHLS QGLGFSRADR DTNVRRIGFV AAEVAGAGGI AVCAPIAPYD DVRAQVRAMT TARGGGFVLV YVSTPLEVCE ARDRKGLYAK ARAGVIPAFT GVSDPYEEPA DADVVVDTAG LPTEQAVDRV LAHLVEAGWV EGARGQ
|
| |