Gene Franean1_5171 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_5171 
Symbol 
ID5673505 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp6203249 
End bp6205069 
Gene Length1821 bp 
Protein Length606 aa 
Translation table11 
GC content77% 
IMG OID641244025 
Productadenylylsulfate kinase 
Protein accessionYP_001509435 
Protein GI158316927 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0529] Adenylylsulfate kinase and related kinases 
TIGRFAM ID[TIGR00455] adenylylsulfate kinase (apsK) 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0533144 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGCGCCA CCTCCGACAC CGCCGTCGGC TCGTTCGACC CGGATCCGCA TGCCACGGCT 
GGTCCGCCCG ACATACCGGG CGCTGGTGAG CAGCGCGGCG CGAACGGCGT CTGTCCGCTT
GCCGTCGTCG CTGCCGACCG CGCCCGCGAG CTGCGGGCCG CCTCGGTGGC CTGGCCGTCG
GTCGTGCTCG ACACGCGCCA GCTCACCGAC CTGGAGCTGA TGCTGCTCGG CGCGTTCGGC
CCGGCGCCGA GCTACTCGGG CCGGGCCCGC CAGCCCGCCG GGCCGCCGGT GCCGAGCCTG
ACCGTCCCCG GTGAGACAGC GGCCGCGCTG GGCCCGGGCA TGGACGTCGC GCTGCGCGAC
CGCGAGGGCG TGATGATCGC GGCCCTGCAC CTGCTCGGTC TGGAGCCCAC CGCGGACGAC
TCACCGACGC CCGCCACCGC GATGCCGGCG GAGGCCTCCG GAACGCCGGC GACTGCCACC
GGGGCGGGCC CGGTGCGCCT GGTCGGCACG GTGGAGGGCC TGGAGCTGCC CAGCCATCCG
GACTACCCGC GGCTACGGCT GACCCCGGAA GGGCTGCGCG CCGAGTTCGT CGCCCGGGGC
TGGGCGACCG GCGCGGGCGC GGCCCCGTGG GCGGTGTGGG CCGACGGCCT GCTGTACACC
GCCGACGTCG GGCGCATCCG CGCGCTGACC CGGCAGGGCA AGCACTGCGT GATCCTGGCC
CCGGTCGGCG GGGCGGACCC GGCCGACGCC GCACACCACC TGCGGGTGCG CTGCCTGCTC
GCCGCGCTGG ACGCGATCGA CGCACCGCCC CGGCCGGCCG AGGCGACGCT GGCCCACGAG
CCGGTCGCCT CCGACGCGGT GTCGCGGGAC GGCGGCCGGA CGCATCCGGC CAGCCCGCCG
GAGCCGGCCC ACACCGCGGC GCTCGCGCCG GAGTCCCGCC GGCACCGCAG CATGCTCGTG
CTGGTCCCGG TCGTCCCGTC GGAGCGGCTC GCCGCGCCCC GTGAGCCGGC GATGAGCGCG
ACCGCCCCGG GCGGCCTGGC CGACGAGGCG GCGAGTGAGC CGGCGTTGGA CGAGATCGCC
GAACTCACCG CCCTGCGGGC CCATCTCGCC GAGGTCTACG GCTGTGCCGG CAGCCTGACC
GGGCCGGCGA TCGGCGCGCC GGGCGCCGAC GATCTCACGA CCCTGCTCGA CGCGGGTGTG
CCGCTGCCCG CCGAGCTCAC CCCGCCCGCG GTGGCTGCCG AGCTGACCCG CGCGGTTCCC
CCGCGCCGCC AGCGCGGCCT GACGATCCTG TTCACCGGGC TGTCCGGCTC GGGCAAGTCG
ACCCTGGCGG GTCTGTTGGT GTGCCGGCTG CTCGAACGGG GCCGGCGGGT CACCCTGCTC
GACGGCGACA TCGTGCGGAC GCATCTCTCC CAGGGCCTGG GCTTCTCCCG CGCCGACCGG
GACACGAACG TGCGCCGCAT CGGCTTCGTC GCCGCGGAGG TCGCGGGCGC CGGCGGGATC
GCCGTGTGCG CGCCGATCGC GCCCTACGAC GACGTCCGCG CCCAGGTGCG GGCGATGACC
ACCGCCCGCG GCGGCGGCTT CGTGCTCGTG TACGTGTCGA CCCCGCTGGA GGTGTGCGAG
GCACGGGACC GCAAGGGCCT CTACGCCAAG GCCCGGGCGG GGGTGATCCC CGCCTTCACC
GGCGTCTCCG ACCCGTACGA GGAGCCGGCC GACGCCGACG TGGTGGTCGA CACGGCGGGC
CTGCCGACCG AGCAGGCCGT CGACCGGGTG CTCGCGCACC TGGTCGAGGC CGGCTGGGTC
GAGGGCGCCC GCGGCCAGTA G
 
Protein sequence
MSATSDTAVG SFDPDPHATA GPPDIPGAGE QRGANGVCPL AVVAADRARE LRAASVAWPS 
VVLDTRQLTD LELMLLGAFG PAPSYSGRAR QPAGPPVPSL TVPGETAAAL GPGMDVALRD
REGVMIAALH LLGLEPTADD SPTPATAMPA EASGTPATAT GAGPVRLVGT VEGLELPSHP
DYPRLRLTPE GLRAEFVARG WATGAGAAPW AVWADGLLYT ADVGRIRALT RQGKHCVILA
PVGGADPADA AHHLRVRCLL AALDAIDAPP RPAEATLAHE PVASDAVSRD GGRTHPASPP
EPAHTAALAP ESRRHRSMLV LVPVVPSERL AAPREPAMSA TAPGGLADEA ASEPALDEIA
ELTALRAHLA EVYGCAGSLT GPAIGAPGAD DLTTLLDAGV PLPAELTPPA VAAELTRAVP
PRRQRGLTIL FTGLSGSGKS TLAGLLVCRL LERGRRVTLL DGDIVRTHLS QGLGFSRADR
DTNVRRIGFV AAEVAGAGGI AVCAPIAPYD DVRAQVRAMT TARGGGFVLV YVSTPLEVCE
ARDRKGLYAK ARAGVIPAFT GVSDPYEEPA DADVVVDTAG LPTEQAVDRV LAHLVEAGWV
EGARGQ