Gene Franean1_5138 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_5138 
Symbol 
ID5673472 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp6155734 
End bp6158517 
Gene Length2784 bp 
Protein Length927 aa 
Translation table11 
GC content72% 
IMG OID641243988 
Product(p)ppGpp synthetase I, SpoT/RelA 
Protein accessionYP_001509402 
Protein GI158316894 
COG category[K] Transcription
[T] Signal transduction mechanisms 
COG ID[COG0317] Guanosine polyphosphate pyrophosphohydrolases/synthetases 
TIGRFAM ID[TIGR00691] (p)ppGpp synthetase, RelA/SpoT family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.918699 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0221462 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAATGCTG AGGCCGCGGG AGCGGGTCTG CCGGCGGGCG GCGGCTTCTC GCCCGCACTG 
CCCGCACTGT CCAGGGCGGA CGGTGACGCG GACTCCCGGA CGATCGCCGC GGCCCGCCAG
GCCAGCATGC GCCTGGCGCA CCTGGCCCGC CGGATGGCCG CGCCGCGCAC CCCGCAGGTG
CCCCCCGAGC TGCGCGATCT CGTCGAGGCG CACCGGGACT TCCATCCCAA GGCCGACATC
AGCGCGGTGA TCCGCGCCTA CTCGGTGGCG GACGGCCTGC ACGCCGGCCA GACCCGGCGC
AGCGGCGACC CGTACATCAG CCATCCGCTG GCCGTCGCCG AGGTGCTCGC CGAGCTCGGT
GTCGACACGA CGACCCTGAT CGCCGCGCTG CTGCACGACA CGGTCGAGGA CACCGGCTAC
ACCCTCGAGT CGGTCGCCGC AGAGTTCGGC GGGGAGGTCG CCAACCTCGT CGACGGTGTC
ACCAAGCTCG ACAAGATGCG TTTCGGTGAG GCCGCCGAGG CCGAGACGCT GCGCAAGCTG
ATCGTCGCGC TCGCCCGCGA CTACCGCGTC CTGGTCATCA AGATCGCCGA CCGGCTGCAC
AACATGCGGA CCCTCGGCTT CATGTCACCT GCCAAGCAGC AGAAGATCTC CCGGGTCACG
CTCGAGGTCC TCGCTCCGTT GGCGCACCGC CTCGGGGTGA GCGTGATCAA ACGGGAGCTC
GAGGACCGCG CGTTCGCGGT GCTCGACCCC GAGGAGCACC ACCGGATCTC CACCGTCGTC
GACGACTTCA CCACCGCCGA ACGGGCGAGC GGTGTGCTCG CCGCGATGGT CACGCGGATG
CGCGCCGGCC TCGCCGAGGC CCGGGTGGAC GGCGCGGTGT CCATCCGGAC CAGCCACATC
TTCTCGATCT ACAAGCGCGG CCAGGAACGC GGCCGCCCAC CGCGCGACTA CAACGACATC
GTCCGGGTGC TCGTGCTCGT CGACGACATC ACGGACTGCT ACGCCTCGCT GGGTGTCATC
CACGGGATCT GGCGACCGGT GCCCGGGCGG CTGCGCGACT TCGTGGCGAC CCCCAAGTTC
AACATGTACC AGAGCCTGCA CACCAGCGTG ATGGACGAGA CCGGGCGCAC GATCGACATC
CAGATCCGGA CCCCGTCCAT GCACCGCCTG GCCGAGACCG GGATCGTCGC GAAGCCCGTC
GGCCCCGGCG CGGACGGCGC CCGCCTCGAA GGGCTCTCCT GGCTGCACAG CCTGCTCGAC
TGGCAGGTCG ACACCGTTGA CCCGGGGGAG TTCCTGGAGT CGCTGTCGTC CGACCTGAAC
TCCGACGAGG TGCTCACCTT CACCCCCAAG GGCAAGATGA TCGCCCTGCC GGCGCGCTCG
TCGCCGGTCG ACGTGGCCTA CGCGGTCCAC ACCGACGTCG GCCACCGGGC CATCGGAGCC
CGGGTGAACG GCCGGCTCGT GCCGCTGCAC ACTCGGCTGC GCAACGGCGA CGTCGTCGAG
ATCCTCACCT CGAACCTGCC CGGCGCCGGC CCGAGCGAGG ACTGGCTCGA GTTCGTGCGG
ACGTCCCGCG CCCGGGTCCG GATCCGTAAG CGGCTGGCCC GGCAGCGACG CGACGCGCAG
GCCCGGGCCA CCCAGGCGGC CGCGGACCGG TCGCTGGTGG TGACCGAGCG TGCGGCCTCC
GACGCCCGCG CCGCGGGCGC GCCGTGGACA GCCCGGGGAG CGGGCGCGGG GGGCGGCGGG
CGGGTCACCC GGCTGCGCCC GTCCGCGCAG CCCGCCCGGG TGGCGGACGC GGCCGACCCG
AATGGTCCCC GGGGCGGCGC TGACGGGGCC GCGGGCTCGC CGCGCGCCCA TTCCGGTGGT
TCGACAATGC GCGCCGGTTC TCCACGTACC GGCGGTGCGA ATTCCCTGGG CGGTTCGATT
ATCTCGGGCG AACCCGCCGG ATTGAACGGT TCCGCCGGAT TGAACGGCGC GGCCGCATTG
AATGGCACGG CCGGCCCGCA GGAATCCGTG AGATCGGGTG ACAGCGCCAC TTCGGGTGAC
AGCGCCTCTT CGGACGGTGT CGCCGGCCCG GGAGGGTCTG GTCAGGGCCG AGACGCCCGT
TTCGCCGGCT CCGCCGGTGC GGTGGGCGGA GCCAGTTCCG GTGGCTCCAG GGGCTCAGGG
GAGGACAGCG CGCTGTCCCG GGGCGCGCGG CGCACGCGGC GTTCCGCCGC GGCACAGCGT
GGTGATCGCC CGGGTGGCCC GGACGGGCCC GGCGCGGGGA GTTCCCAGGG TCGGGAATCC
GCCTCGTCCG CGGGGGGTGA CGGTTCCGCA CGCCGGGGAA CGCAGAACTG GGCGGGCTTC
GCCGAGATCG ACGGCGCGGA TGTGCCGGTG CGGCTCGCGC GATGTTGTCT TCCGCTCCCG
GGCGACACGG TCGTCGGCTT CACCACGCAC AGCAGCTCGG TCTCTCTTCA CCGGCAGGAA
TGCGCGAACG TCGCCGCTTC GGCGTCCTCG CGCGAGCAGG TGACCGTCAA GGGGTGGACC
GCTCCGGAGA GTCAGACCTT CCCCGCCGAA ATCGCCGTGG AGGCGTTCGA CCGCTACGGA
TTGCTGGCGG ATATCACCGA GGTCCTCTCG GACACCTCCG CCTCGGTGCG GGCGGCGTCC
ACGTCGACAT CCGAGGACAG GGTGGCGCGC GCTCGCTTCA CCATCGAGGT CACCGGCCCG
GACCAGCTCG ACCGCGTGCT GGCGGCCGTG CGCGGGGTAG GCGGGGTGTA CGACTGTTAC
CGGGCATGTC AAACAACGGT ATGA
 
Protein sequence
MNAEAAGAGL PAGGGFSPAL PALSRADGDA DSRTIAAARQ ASMRLAHLAR RMAAPRTPQV 
PPELRDLVEA HRDFHPKADI SAVIRAYSVA DGLHAGQTRR SGDPYISHPL AVAEVLAELG
VDTTTLIAAL LHDTVEDTGY TLESVAAEFG GEVANLVDGV TKLDKMRFGE AAEAETLRKL
IVALARDYRV LVIKIADRLH NMRTLGFMSP AKQQKISRVT LEVLAPLAHR LGVSVIKREL
EDRAFAVLDP EEHHRISTVV DDFTTAERAS GVLAAMVTRM RAGLAEARVD GAVSIRTSHI
FSIYKRGQER GRPPRDYNDI VRVLVLVDDI TDCYASLGVI HGIWRPVPGR LRDFVATPKF
NMYQSLHTSV MDETGRTIDI QIRTPSMHRL AETGIVAKPV GPGADGARLE GLSWLHSLLD
WQVDTVDPGE FLESLSSDLN SDEVLTFTPK GKMIALPARS SPVDVAYAVH TDVGHRAIGA
RVNGRLVPLH TRLRNGDVVE ILTSNLPGAG PSEDWLEFVR TSRARVRIRK RLARQRRDAQ
ARATQAAADR SLVVTERAAS DARAAGAPWT ARGAGAGGGG RVTRLRPSAQ PARVADAADP
NGPRGGADGA AGSPRAHSGG STMRAGSPRT GGANSLGGSI ISGEPAGLNG SAGLNGAAAL
NGTAGPQESV RSGDSATSGD SASSDGVAGP GGSGQGRDAR FAGSAGAVGG ASSGGSRGSG
EDSALSRGAR RTRRSAAAQR GDRPGGPDGP GAGSSQGRES ASSAGGDGSA RRGTQNWAGF
AEIDGADVPV RLARCCLPLP GDTVVGFTTH SSSVSLHRQE CANVAASASS REQVTVKGWT
APESQTFPAE IAVEAFDRYG LLADITEVLS DTSASVRAAS TSTSEDRVAR ARFTIEVTGP
DQLDRVLAAV RGVGGVYDCY RACQTTV