Gene Francci3_3003 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_3003 
Symbol 
ID3905500 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp3558990 
End bp3561797 
Gene Length2808 bp 
Protein Length935 aa 
Translation table11 
GC content72% 
IMG OID637880323 
Productheat shock protein 70 
Protein accessionYP_482089 
Protein GI86741689 
COG category[O] Posttranslational modification, protein turnover, chaperones
[P] Inorganic ion transport and metabolism 
COG ID[COG0226] ABC-type phosphate transport system, periplasmic component
[COG0443] Molecular chaperone 
TIGRFAM ID[TIGR02136] phosphate binding protein 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.376537 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.234156 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGCTCCGG TCGCCGTGAT GATCGCGTGG GTTTCGGGAG ACGCGGTGAG CTATCAGCTG 
GGTATCGACC TCGGGTCCGC GAACACGATC GTCGGGGTGG CGGACGGAGG CTGGCCCCGC
GTGCTCGAAC TGAGCGGTCA GCGTCGCCTG CCGTCGGTGA TCTACGCCGC CCCGGCCGGT
GGCCTGCAGT TCGCGCGCTC GGCGGACCGC CGGTCGCTGG TCGACCCCGA GCGTGCCGCA
ACCGACCTCC TGCGCCCGCT CGGGGACAGC GGTCCCGTTC TGCTCGGGGG TGCCGCCTAC
AGCCGGGAGG GGCTGATTGC CCGGTTGGTG TCCTACCTCG TCACGGCCGC CACCGAGCAG
CTCGGGTCGG AACCGGACCA GGTCGTCGTG ACGTTCCCGA CGTTCTGGTC GGCGGCCCGG
CGCGAGGTGT TCGCGGACGC CGTGAGCCAG CTCTCCGATG TCACGGTGCC CGTCGCGGCG
TTCCCGGCCG CCGACGCGAT CGGAACCCTG CTGTCTCACG GCTCCTCGAC CCAGACCGTC
GAGTTGGTCG GATGGTACGA CTTCGGCGCC GGCTTCCTCG ACACGGGGAT CCTGTCCTTC
TCGCCGTTCG GGTTCCAGCT GCTCGGCTTG GCCGCGGGTC TGCGCCACGG CGCCGGCCTC
GAGCTCGACG AGCTCCTGGT CGACCGGGTG CTGACCGGCG CGGGTGTCGA GCGTGCGGCG
CTCGACCGTT CCGACCCGAT GATCCGGACC GCGCTCGGCA GGCTGCGGCG GGAATGCGCG
CAGGCGAAGG AGATCCTTTC CGGTGAGGAC GAGGTTGACG TTCCGGTCTC CCTTCCCGGA
GCCGATACCT TCGTCACGCT GAGCCGGGCG GAGCTGGAGA CCCTCATCGG CCCGATTGTC
GATGACACAA TCATGACCTT TCGCCGGGCG CTCCGCTCGG TGCCGGTGAC CTCCGCCGAG
CTGTCCCGGA TCCTGTTGTC CGGCGGGATC TCCCACCTTC CGCTGGTGGC CCGCCGGCTG
CGGGAGACCT TCAGCGGGAT CGGCCGGATC GACCACGGTT CGGACGCGGA TGTGGCGATG
GGCGCGGCAC TGCTCGCCGC CGACCTCGCT GACCGGTTCG GCGCCGTCGG CGCCGGGAGT
GCACCGGGTG CACCGGGTGC ACCGGGTGTT CCGGGTGTTC CGGGTGTTCC GGGTGTTCCG
GGTGCGCCCG CGGCCTTTGG AGCCGTCGGG GCGGTGACCG CCGGCGGTTT CGGCGCGTCG
GGCGAGGTCG GTGACGCCCA GGAACCCTCT GCTCAGGAAC CCTCCGACGT CACCGCGGTG
ATCCGGCCGC CGGACGCCGC GGACCTGGTC TCGCGGCCGC CCTTCCTGTC CGGGCCGAGG
GGGCTGTCCG GGCCGAGGGG GTTCGCGGGC GCGGCGGGTC CGGCCGGGAT CCGAGGCGTG
CCGGAGGCGG ACGGCGTGAC GGCCGCGGCG ACGGTGCTCA GCGGCCCGGA CGGCTCCGCG
CCGCACGGCT TGGGCGCCGG GGACGACACG ATGATCTCCT CGGGTGGCGC GGCGGATTCC
CCGCCGCCGG GGATCGTCGT CGGCGGGCCG GGCCCGTCCG ACACGTCCGG TCACCGGATC
CCACCCGGCC TCATGACGGT GGGCGCCGAT CGGGACCACG TTCTGCTCGG GCAGCCGGGG
AAGCTCGACC TGCCCTCGGA CTTCTCGACG GGCGGTGTGG GCGGCCCGCA CGGTGCCTCC
GCCGGGGACG GCCGGGGAAT CTTCGGCTCC CGGCGGGCCG CGGTCGTCGC CGCCATTGTC
GTGGTCCTCT TCCTGGCTCT CGGGACGACC TTCGCCGTGG TGCTGACCGG CCGTGACTCC
GGGGCGGGCT CCGGGGCGGA TGCGGTGGCC GCCCCGGCGG TCACCGCGTC GCCGGCCCCA
TCCGCCACCG GCCCGACCGG GCCACCCGCC TCCGCCGCGA ACCTCGTCCG GGTGGCGGGA
TCCTCGGAGG TCGCCCCGAT TACCGAGACC GCCTATAACG AATTTCGTCA GGTCCAGCGC
AACGTCACCG TCAGCATCGA GTCCACGACG ACCGAGGACG GTTTCGCCGC GCTGTGCAGC
GGCAAGGCCG ACATCGCCGA CGCCTCGTTC GAACTCAATC CCGGGTTCAT CAAGAACCCC
GACTGCGAGA AGAAGGTCGT CGGGTTCGAG GTGGCGCACC ACACCCTGCC GATCGTGGTC
AACCCGCGGA ACACCTGGCT GCACTGTCTG ACCCTGCAGC AGGTCAAGCA GGTGTGGGGG
GCCGGTTCCG CCGTCACCCG GTGGAGCCAG ATCGACCCGT CGTTCCCGGA CGAGCCGATC
ACGTTTGTCG GGCCGCCGCG CGGCTCCGTG CAGGCGCAGG TGTTCAACGC CACGATCAGT
GACGCCAGTG ACCGGTCCCG CGACTACCGG CAGACCGATC TCAGCGGGGT CGCCAACGAC
GTGGCCGCCG ACCGCTCGGC CATCGGCTAC CTCGACTTCC CCACCTACGA GACCTTCGGC
ACCAAGCTGC GCGGTGTCGA AATCAACAAT GGTGACGGGT GCGTCGCGCC GAACGCCGTG
TCGGTCGGTA CCGGCCTCTA CCTGCCGCTG TGCAAGCCGC TGTTCGTCTA CGCCCGCACG
GACGCCCTGC GCCGGCCAGC GACCGCCGCC TTCCTGCGCT ACTACCTGGC GAACGGCCGG
AAGATCGCCT TCGACGCGCA CTACGTCCCC CGCAACGACG ACACGGTCGG GGAGAACGTC
GCCAAGCTCG CGAGCCTGAC GGCCGGCGTG GGACCCGTAC CGGCCTAG
 
Protein sequence
MAPVAVMIAW VSGDAVSYQL GIDLGSANTI VGVADGGWPR VLELSGQRRL PSVIYAAPAG 
GLQFARSADR RSLVDPERAA TDLLRPLGDS GPVLLGGAAY SREGLIARLV SYLVTAATEQ
LGSEPDQVVV TFPTFWSAAR REVFADAVSQ LSDVTVPVAA FPAADAIGTL LSHGSSTQTV
ELVGWYDFGA GFLDTGILSF SPFGFQLLGL AAGLRHGAGL ELDELLVDRV LTGAGVERAA
LDRSDPMIRT ALGRLRRECA QAKEILSGED EVDVPVSLPG ADTFVTLSRA ELETLIGPIV
DDTIMTFRRA LRSVPVTSAE LSRILLSGGI SHLPLVARRL RETFSGIGRI DHGSDADVAM
GAALLAADLA DRFGAVGAGS APGAPGAPGV PGVPGVPGVP GAPAAFGAVG AVTAGGFGAS
GEVGDAQEPS AQEPSDVTAV IRPPDAADLV SRPPFLSGPR GLSGPRGFAG AAGPAGIRGV
PEADGVTAAA TVLSGPDGSA PHGLGAGDDT MISSGGAADS PPPGIVVGGP GPSDTSGHRI
PPGLMTVGAD RDHVLLGQPG KLDLPSDFST GGVGGPHGAS AGDGRGIFGS RRAAVVAAIV
VVLFLALGTT FAVVLTGRDS GAGSGADAVA APAVTASPAP SATGPTGPPA SAANLVRVAG
SSEVAPITET AYNEFRQVQR NVTVSIESTT TEDGFAALCS GKADIADASF ELNPGFIKNP
DCEKKVVGFE VAHHTLPIVV NPRNTWLHCL TLQQVKQVWG AGSAVTRWSQ IDPSFPDEPI
TFVGPPRGSV QAQVFNATIS DASDRSRDYR QTDLSGVAND VAADRSAIGY LDFPTYETFG
TKLRGVEINN GDGCVAPNAV SVGTGLYLPL CKPLFVYART DALRRPATAA FLRYYLANGR
KIAFDAHYVP RNDDTVGENV AKLASLTAGV GPVPA