Gene Franean1_1946 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_1946 
Symbol 
ID5670347 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp2338856 
End bp2341729 
Gene Length2874 bp 
Protein Length957 aa 
Translation table11 
GC content74% 
IMG OID641240867 
Productheat shock protein 70 
Protein accessionYP_001506289 
Protein GI158313781 
COG category[O] Posttranslational modification, protein turnover, chaperones
[P] Inorganic ion transport and metabolism 
COG ID[COG0226] ABC-type phosphate transport system, periplasmic component
[COG0443] Molecular chaperone 
TIGRFAM ID[TIGR02136] phosphate binding protein 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.253458 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.635918 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGGTTACC AGCTTGGCAT CGACGTCGGA TCGGCCACCA CAGTCGTCGC CGCCACGGAC 
GGGGGCTGGC CCGCCGTGCT GACCCTGGGC GGCGCCCGAG CCGTGCCGTC CGTCCTGTAC
ATGCCCCAGA CGGGCGGTGT GCTGTTCGGG CGCTCGGCTG AGAGGCGGGC CCGCACCGAT
CCCGACCGGG CCGCCCGGGG GTTCCTGCGC CGGCTCGGCG AGCCCGGCCA CCTGCTGGTC
GGCGGCGCCG CGTACAGCCC GGACGGGCTG CTCGCCCGGC TGGTCGGCCA CCTGGTCGGG
CAGGTCGTCG CGGCCCGGGG GGAGGAGCCC GAGCAGATCG TCGTCGCGCA TCCCGCCTTC
TGGCCCGCAC ACCGCCGCGA GGTGTTCGCG TCCGCGGTCA GCCAGCTCTC GGACGTCTCT
GCACCCGTCG CGACCTGCGC GGCCGCCGAC GCGATCGGCA CCCTGCTCGC CCGCCGCTCG
GGGACCCGCA CGGTCGACCT CGTCGGGGTC TACGACTTCG GCGCCGGGCA CTTCGACGCG
GCGGTGCTCT CGTTCAGCCC GTTCGGGTTC CAGCAGCTGG GCACATCGGT CGGAGTGAAC
CACGCCGGCG GGGCCGACTT CGACGAGCTG CTGGTCGAGC GGGTGCTGGC CGAGGCCGGC
GCCGGCCGGG AGCGGCTCGA CCGCTCCGAT CCCGCGGTCA CCGCGGCACT CGCCCGGCTG
CGTGAGGAGT GCGCCCAGGC GAAGGAGTAC CTCGCGGAGG AGGACGAGAT CGAGGTCGCC
CTCGCGCTCC CCGGCCTGCC CGCGACGTCG GTGGTGCTGC GCCGCGCGGA TCTTGAGACC
CTCGTCGCGC CGGTCGTCGA CGACACGGTC CGGGCGTTCC GGCGGACCCT CCGAACGGGC
GAGGCGACCC CCGAGGATCT CTCCTCCGTG CTCCTCTACG GCGGCGCGGC GCGGATGCCG
ATCGTCGCCG CCCAGATGCG GGCGGCGTTC CCGAGCGTCG GCCGGTGGGA GTACGGCTCG
GACGACGACA TCGCGACAGG CGCGGCGCTG ATCGCCGCGC GGCTGGCCGC CCAGTCCTCC
CGCGAGGAGG TCACCTCGGT CATCCGGCCG CCGGACAGCA GCCCGCCGAT CCTCTCCGCG
CCGCCGCTGA GCTCTGTCCC GCCCCTGAGC CCGGTCCCGC CGGTCGGGTC AGTGCCGCCG
GTCGGGTCAG TGCCGCCCGG CGCGGTGGCG TCGACCGGCG GGCCGGCCGC CGGTGACGGC
GACACGTCCA GCGGGCCGGT TCCCCCCGGC GGGGCGGTGT TCGGGGGAGC GGCCGCCGGC
GTCAGCGCCT CCAGCCTGGG CTACACCTCA TGGCCCGACC GGACGGGACA GCCGCCCGCC
GCCGACCCCG ACGCCACGGC GATCGGCCAC CACGGCGGCT CGGCGGACGA CACCCTGATC
TCCCGCGGCG GCCACGGAAC ACCGCCACCC ACCACGGCCC CGCCACCCAC CGGGCCGACG
TCCACAGGCG CGACGTCCAC CGGGGAGCCG GCTGGCGCGG GGTACCAGGG CGGTTGGGGG
AGCAGCCCGT ACCCGTACAG CGACGCCGCG CACACCGTGG CGGCCGGCTC GTCGGCCGGC
GGCGACCGCA CCCAGGCGGT CGCCACGCCC GGCGGCCCCG GGACGACAGG AGCGCCCGCG
ACCACCGGAG TCACCGGAGT CACCGGGATC GCTGGGACCG CCGGCGGCGG TGATCCTCAG
TCGGCCGCGC CGACACCGTC CCAGGCCGGA ACGTTCGGCG GGAGCTCCGC CGGCACCCGG
GCCGTCCCGC GCGGCGGGCT GTTCGGCGGC TGGTCCCGCG CGACGATCGC CGCGGCCGTG
GCCGCGATCG TCTTCGTCGC CGCCGGCACC ACCCTGGGCA TCGTGCTGAC CGGAGGCGGC
GGCGACACGC CGTCCAACGG CATCGTGCCG ATCGCCGCGC CCGCCGCCAC GCTGCCGCCG
CCCGCGGCGA CGACCGCGCC GCCCGCGGCC CCGACACCCG GACCCAACAC GGTGCTCGTC
GCGGGCTCCA GCGAGGTCGC GCCGATCACC GAGACCGCCT ACGCCGGGTT CCGCAAGGTT
CAGCAGAACG TGACCGTGAA CGTCGAGGCG TCGACGACGG AGGACGGCTT CGCCAAGCTC
TGCGCGGGCG GCGCCGACAT CGCCGGCGCG TCCTTCGAGT TCGACCCGTC CTTCTCGAAG
GACCCCGGCT GCGCCGACCA GATCGTCGGG TTCGAGGTCG CGCACCACAC ACTGCCGATC
GTGGTGAACC CGCAGAACAC CTGGGCGCGC TGCATGACCC TCGACCAGGT GCGCAAGGTC
TGGGACGCCG GCTCGACGAT CAACCGGTGG AACCAGATCG ACCCGTCCTT CCCGGACGAG
CCGATCACGT TCGTGGGGCC GTCCCGGAAC ACCGTCCAGG CGCAGGTGTT CAACTCGACG
GTGAACGACT CCAGCTCCCG GTCCCGGCAG TACCAGGAGA CCGACCTGAG CGGGGTCGCG
AACGACGTCG CCGGTGACCG GTTGGCCATG GGCTTCCTGG ACTTCCCGAC GTTCGAGACC
TTCGGGCCCC GACTGAGGGG CCTGGAGATC GACAACGGTG AGGGATGTGT CGAACCGAAC
GCGGTGACGG CCGGAACGGG CTTCTACCTC CCGCTGTGCA AGCCGGTGTT CGTCTACGCC
CGTAAGGACT CGCTGCAGAA GCCCGCGGCC GCCGCGTTCA TGCGCTACTA CATGGAGAAC
GGCGAGGAGA TCGCCTTCGA CGCGCACTAC GTCCCGCGGA CCAAGAGCAC GATCGACGAG
AACGTGGCCC GCGTCGACGA GCTGACAAAG GGAGTACCAC CCGTCACGGC CTGA
 
Protein sequence
MGYQLGIDVG SATTVVAATD GGWPAVLTLG GARAVPSVLY MPQTGGVLFG RSAERRARTD 
PDRAARGFLR RLGEPGHLLV GGAAYSPDGL LARLVGHLVG QVVAARGEEP EQIVVAHPAF
WPAHRREVFA SAVSQLSDVS APVATCAAAD AIGTLLARRS GTRTVDLVGV YDFGAGHFDA
AVLSFSPFGF QQLGTSVGVN HAGGADFDEL LVERVLAEAG AGRERLDRSD PAVTAALARL
REECAQAKEY LAEEDEIEVA LALPGLPATS VVLRRADLET LVAPVVDDTV RAFRRTLRTG
EATPEDLSSV LLYGGAARMP IVAAQMRAAF PSVGRWEYGS DDDIATGAAL IAARLAAQSS
REEVTSVIRP PDSSPPILSA PPLSSVPPLS PVPPVGSVPP VGSVPPGAVA STGGPAAGDG
DTSSGPVPPG GAVFGGAAAG VSASSLGYTS WPDRTGQPPA ADPDATAIGH HGGSADDTLI
SRGGHGTPPP TTAPPPTGPT STGATSTGEP AGAGYQGGWG SSPYPYSDAA HTVAAGSSAG
GDRTQAVATP GGPGTTGAPA TTGVTGVTGI AGTAGGGDPQ SAAPTPSQAG TFGGSSAGTR
AVPRGGLFGG WSRATIAAAV AAIVFVAAGT TLGIVLTGGG GDTPSNGIVP IAAPAATLPP
PAATTAPPAA PTPGPNTVLV AGSSEVAPIT ETAYAGFRKV QQNVTVNVEA STTEDGFAKL
CAGGADIAGA SFEFDPSFSK DPGCADQIVG FEVAHHTLPI VVNPQNTWAR CMTLDQVRKV
WDAGSTINRW NQIDPSFPDE PITFVGPSRN TVQAQVFNST VNDSSSRSRQ YQETDLSGVA
NDVAGDRLAM GFLDFPTFET FGPRLRGLEI DNGEGCVEPN AVTAGTGFYL PLCKPVFVYA
RKDSLQKPAA AAFMRYYMEN GEEIAFDAHY VPRTKSTIDE NVARVDELTK GVPPVTA