Gene Franean1_1231 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_1231 
Symbol 
ID5669644 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp1472802 
End bp1474313 
Gene Length1512 bp 
Protein Length503 aa 
Translation table11 
GC content72% 
IMG OID641240163 
Productsmall GTP-binding protein 
Protein accessionYP_001505591 
Protein GI158313083 
COG category[R] General function prediction only 
COG ID[COG2262] GTPases 
TIGRFAM ID[TIGR00231] small GTP-binding protein domain
[TIGR03156] GTP-binding protein HflX 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.592117 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0200783 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTCTTC GCGCTGACGC CGCTGTGGCG TCCACCCGGC ATCTGAAACT GCACTCGGGC 
GAAAACGACG GCCACGACGG CCACGACGGT CGTGACGGTC GCGCCGGCAC CGCGCTCGCC
CTGGCCGAGG CCGACGGGTT CTCGTTCTTC GACGAGGAGG GCGACGGGCT CGACCTGGAG
GACCGTGGCG CGCTGCGCCG TGTCCCGGGC CTCACCACCG AGCTCGAGGA CGTCACCGAG
GTCGAGTACC GCCAGCTCCG GCTGGAACAG GTCGTTCTGA TCGGGGTGTG GACGTCGGGC
TCGCAGGTCG AGGCCGAGGC GTCCATGGCG GAGCTGGCCG CGCTCGCCAC CACCGCCGGC
TCGGTCGTGC TCGACGCGCT CGTGCAGCGC CGGGACAGGC CCGACGCAGC GACCTTCGTG
GGCTCCGGCA AGGCCAAGGA GCTCGCCGAG ATCGTACTGG CGACCGGCGC GGACACGGTG
ATCTGCGACG GCGAGCTGAC CCCCGGCCAG CTGCGCCAGC TCGAGGAGGT CGTCAAGGTC
AAGGTGATCG ACCGGACGGC GCTGATCCTC GACATCTTCG CCCAGCACGC GACGTCGCGC
GAGGGCAAGG CGCAGGTCGA GCTCGCCCAG CTGCAGTACA TGCTGCCGAG GCTGCGCGGC
TGGGGTGAGT CGATGTCCCG GCAGGCCGCC AGCGGCGGCC GGGCGCCGAT CGGTACCCGT
GGTCCCGGTG AGACGAAGAT CGAGACGGAC CGTCGCCGGC TGCGCGCGCG CATCACCAAG
CTGCGCCGAG AGCTCACCGG GATGGCCACC GTCCGGGCGA CCAAGAGGTC GTCGCGCCGC
CGCGGTGCCG TTCCCGCCGT CGCGATCGCC GGCTACACCA ACGCCGGCAA GTCGTCGCTG
CTCAACCGAC TGACCGGGGC GGGCGTACTG GTCGAGGACG CGCTGTTCGC GACCCTCGAC
CCGACGGTGC GCCGGGCCAC GCTGCCCGAC GGCCGGATCT TCACCCTGGC CGACACGGTC
GGCTTCGTGC GCCACCTGCC GCACCAGATC GTCGAGGCGT TCCGCTCGAC GCTCGAGGAG
GTGGTGGACG CCGACCTCGT CCTGCACGTC GTCGACGGTT CGGCCCCGGA CCCGATGGGG
CAGATCTCGG CTGTGCGCGA GGTGCTCGCC GAGATCGACG CGGCCGGGGT GCCCGAGCTC
ATCGTGGTCA ACAAGGTGGA CGCGGTCGAC CCGACCACGC TCGCCGTCCT GCGCCAGGCG
GTGCCGGACG CCATCTTCGT CTCGGCGCGG TCCGGGGCGG GGCTGCAGGA GCTGGTGGAG
GCGCTCTCGG CGCGGATACC GCACCCGGAG GTCGAGATGT CCCTGCTGGT GCCCTACACC
CGGGGCGATC TCGTCTCCCG CATCCACCAG ATCGGCGAGG TGCTCCGAGT TGAGCACACC
GGCAAGGGCA CCGAGGTCGC CGCGCGCGTG CCCGTGGGCC TGGCCGCCGA GCTGGAGCCG
TTCCGCGCCT GA
 
Protein sequence
MSLRADAAVA STRHLKLHSG ENDGHDGHDG RDGRAGTALA LAEADGFSFF DEEGDGLDLE 
DRGALRRVPG LTTELEDVTE VEYRQLRLEQ VVLIGVWTSG SQVEAEASMA ELAALATTAG
SVVLDALVQR RDRPDAATFV GSGKAKELAE IVLATGADTV ICDGELTPGQ LRQLEEVVKV
KVIDRTALIL DIFAQHATSR EGKAQVELAQ LQYMLPRLRG WGESMSRQAA SGGRAPIGTR
GPGETKIETD RRRLRARITK LRRELTGMAT VRATKRSSRR RGAVPAVAIA GYTNAGKSSL
LNRLTGAGVL VEDALFATLD PTVRRATLPD GRIFTLADTV GFVRHLPHQI VEAFRSTLEE
VVDADLVLHV VDGSAPDPMG QISAVREVLA EIDAAGVPEL IVVNKVDAVD PTTLAVLRQA
VPDAIFVSAR SGAGLQELVE ALSARIPHPE VEMSLLVPYT RGDLVSRIHQ IGEVLRVEHT
GKGTEVAARV PVGLAAELEP FRA