Gene Franean1_3800 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_3800 
Symbol 
ID5672164 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp4506842 
End bp4509484 
Gene Length2643 bp 
Protein Length880 aa 
Translation table11 
GC content77% 
IMG OID641242679 
Productheat shock protein 70 
Protein accessionYP_001508099 
Protein GI158315591 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0443] Molecular chaperone 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.122258 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCTACG CGCTCGGCAT CGACGTCGGC ACGACGTTCA CCGCCGGTGC CATCTGGCGG 
GATGGCCGGG CCGAGGCGTT CGGGCTCGGC ACGCACTCCA CCGCGGTCCC CAGCGTGCTG
TTCCTGCGGG ACGACGGCGT GATGCTGGTC GGGGAGGCAG CCGAGCAGCG GGCGGTGACC
GAGCCGTCCC GGGTCGCCCG CGAGTTCAAG CGGCGGTTCG GCGACGACGT GCCCGTCCTG
CTCAGCGACA CCTGGGTGAC CGCCACCGAA CTGTTCGCCG ACATGATCCG CTTCGTGGTC
GGGAAGGTGA CCGAACGCGA GTCGCAGGCC CCCGGCTACG TCATGCTGAC CTGCCCGGCC
ACCTGGTCCG ACCACCGCAG GGGGCTGATG GAGGACGCCG CCGGCCTGGC CGGGCTCGGC
CAGGTGGGCC TGGTGGCCGA GCCGACCGCC GCGGCGATGT ACTACGCCGC CCAGGAAAGG
CTCGAGCCGG GCGCCCTGCT CGGCATCTAT GACCTCGGCG GCGGGACCTT CGACGCGACC
GTGCTGCGCA AGACCGCCGG CGGATTCGAG TTGTGCGGCG ATCCCGGCGG CGACGACGAG
ATCGGCGGGG TGGACGTCGA CCAGGCGGTC GTCGACCACA TCGCGCGGGC GGTCGGCCCG
TCCTGGCACG AGCAGGACAC CTCCGACCCG GCGACCGCGC GAGCGCTCGC GGCCGTCCTC
GCCGCCGCGG TCACCGCCAA GGAGACCCTG TCGCAGGACC TCCAGGCGGA GATCCCGGTC
ATACTGCCCG GCTGCAACAA GGTCGTCCGG ATCACCCGTG ACGATCTCGA GGACGCGGTG
CGCATCCTGG TCCTACGCAC CGTGGACGCC TTCCGCCGGA CCGTCCGCGC CGCCGGCGTG
GAGGTGTCGG ACCTCGCACG CGTCCTGCTG GTCGGCGGTT CCAGCCGGAT CCCGCTGATC
GCCCGCATGA TCGAGGACGA CCTGCGGGTG CCGGTTGCCG TCGACACGCA TCCGAAGCTC
GCGGTCTGTC TCGGTGCGGC GATCGCCGCC GGCCCCCGGG TGGCCACCGG GGCGCTCGGG
GCGGCGGCCC CAGGGACCGC CGCCGGTCCC GCACCGTGGA CCGCCCCGCC GGTGGGCACG
CCACCGTCGC CCCGGCCGGC CGACGCGCCG GCGCCGCGGC CCGGCGCTCC GCAGCCCGCG
CCGTGGCCGG ACGGCACGCC CGCGGGGGTT CCGACCGGTG GCCCGGGGGG CGAGCCGACC
GACGTTCCGC GGCGCGTGCC GGAACCGGTC GAGGCGACCG CCGCCCGCCG CGCCGCCGAC
CTGGTGGCGC CGGCACCGGC CGGTCCGCCC GGCGGCGCCC GCTCGGAGGA ACAGGTCCGG
CTCGACGTCG ACCTGGCCGG CGCCGGCCTG GCCGAACCGT CCGACCAGCC GCTACGCCCC
GCGGTCATGC CGACGCGCGC GGTCCGGCTG GCCGACCGCG ACGTCCCCCT CGTCGTCCGG
ACCGCCGGCG ACGCGTCCTA CCGGCAGGCC GGCCGCCGCA CCGCGGCCGT GCTGGGGGCG
GTCGCGGTTG TCGCGGTGCT CGCCGCCGCG GCGATCGGCG TCCTGCTCGG CCTCGGCGGC
GGCTCGGCCG GACCGGAGCC CCCGCCCCGC ACGACGGCGC CAGGCGCGGC CAGCACCGCG
GCGGCGGCGA CCGCCCGAGT GGCCCGGCTG GCCGGCGCTC CGCTACCGGC CGGCGGGAGT
GGCGGTGGCT CGGTGGGCGC CGCCCTCGCC GTCGCCGCCC GACCCGGTGG CGGGCTGGTG
GCCGTCGGCG CCGCCGGATC GCCAGACCCC GCTGGACGCA CGCCATCCGC ATGGTGGACC
GGCGACGGTA CGACCTGGCG GCTGGCGGCG GTGCCACTGC CGGCCGGCAC GACCGTGGGC
ACGATGAGCG GCCTGGCCTC CACCGGGGGA CGCCTGGTGG CGGTCGGCTG GGTTGGTTCC
GGGGATACCA CCAGCGCCGC GGTCTGGGTC TCGGACGACG GGCAGGCCTG GCGGGCCGGG
TCCGTGGGCG GGGCCGCGTC GTCGAGCATG CGTGACGTCG TCGCCCGTGC CGGCGGGCTG
CTTGCCGTGG GTCAGGACGA CGGTTCGGAC CCTGAGGGCG ACGGCGCGGT GTGGACGTCG
GCGGACGGCA GCGACTGGCA GCGGGTCGGC ATCTCCGGGG CAGACGGGCT CGGCACGCAG
ACCCTGCATC GGGTCGTTTC CCTGGCCGGT GGCGGCCTGC TCGCCACGGG GCAGGAACCG
GAGGGCGCCG GCACCGTCGC ACGCGTCCGG CAATCGGCGG ACGGGTCGAG TTGGACCGGG
GTGGAGACCG ACCTGCCGCT CGACGCCGAG GTGACCGGGC TTGCCATACT GCCGGACGGC
CGGCTGGTCG GGGCCGGGTC GGTCCCGCAC GCCGGCGGGC GGCAGCAACA AATCTGGGTG
GCGGATGCGA CCGGCCGCTC GTGGGCACCC CAGGACGCGC TGACCGCAAC GGGCCAGTCG
GGGACCGGGA TCGACATCAC CGGGGTGGCC GTGGCGGGCA CGCTGGTCGC TGCCGGCAGC
ATCGACGGCA CGGACGGACC CGCCGCGGCC TCCTGGTCCG TCACCCTCGA CCAGCCGCGC
TGA
 
Protein sequence
MAYALGIDVG TTFTAGAIWR DGRAEAFGLG THSTAVPSVL FLRDDGVMLV GEAAEQRAVT 
EPSRVAREFK RRFGDDVPVL LSDTWVTATE LFADMIRFVV GKVTERESQA PGYVMLTCPA
TWSDHRRGLM EDAAGLAGLG QVGLVAEPTA AAMYYAAQER LEPGALLGIY DLGGGTFDAT
VLRKTAGGFE LCGDPGGDDE IGGVDVDQAV VDHIARAVGP SWHEQDTSDP ATARALAAVL
AAAVTAKETL SQDLQAEIPV ILPGCNKVVR ITRDDLEDAV RILVLRTVDA FRRTVRAAGV
EVSDLARVLL VGGSSRIPLI ARMIEDDLRV PVAVDTHPKL AVCLGAAIAA GPRVATGALG
AAAPGTAAGP APWTAPPVGT PPSPRPADAP APRPGAPQPA PWPDGTPAGV PTGGPGGEPT
DVPRRVPEPV EATAARRAAD LVAPAPAGPP GGARSEEQVR LDVDLAGAGL AEPSDQPLRP
AVMPTRAVRL ADRDVPLVVR TAGDASYRQA GRRTAAVLGA VAVVAVLAAA AIGVLLGLGG
GSAGPEPPPR TTAPGAASTA AAATARVARL AGAPLPAGGS GGGSVGAALA VAARPGGGLV
AVGAAGSPDP AGRTPSAWWT GDGTTWRLAA VPLPAGTTVG TMSGLASTGG RLVAVGWVGS
GDTTSAAVWV SDDGQAWRAG SVGGAASSSM RDVVARAGGL LAVGQDDGSD PEGDGAVWTS
ADGSDWQRVG ISGADGLGTQ TLHRVVSLAG GGLLATGQEP EGAGTVARVR QSADGSSWTG
VETDLPLDAE VTGLAILPDG RLVGAGSVPH AGGRQQQIWV ADATGRSWAP QDALTATGQS
GTGIDITGVA VAGTLVAAGS IDGTDGPAAA SWSVTLDQPR