Gene Franean1_1569 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_1569 
Symbol 
ID5669972 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp1874776 
End bp1876767 
Gene Length1992 bp 
Protein Length663 aa 
Translation table11 
GC content71% 
IMG OID641240488 
Productchaperone protein DnaK 
Protein accessionYP_001505914 
Protein GI158313406 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0443] Molecular chaperone 
TIGRFAM ID[TIGR02350] chaperone protein DnaK 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.349851 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.104674 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGGAAAGG CAGTTGGAAT AGACCTGGGG ACGACCAATT CGGTCATCGC GGTCGTCGAG 
GGCGGCCAGC CGACGGTCAT CCCCAACGCC GAGGGCTCGC GCACGACGCC GTCGGTGGTC
GCGTTCACCG AGGGTGGCGA GCGGCTGGTG GGGGAGCTGG CCCGACGGCA GTCGATCCTG
AACCCCAAGG GCACGATCAC CTCGGTCAAG CGCTTCGTCG GACGTCGCCA CAGCGAGGTC
GCCAGCGAGC TGCGGACGGT CACCTTCGAT CTCACTCCGG GCAAGGACGA CGCCGTGCGG
ATCACGGTGC GCGGGCGGCA GTACGCGCCC GAGGAGATCT CCGCGATGGT GCTGCGCAAG
CTCGTCGACG ACGCGGCCCG CTTCCTGGGG GAGAAGGTCA CCGAGGCCGT GATCACCGTC
CCGGCCTACT TCAACGACGC CCAGCGCCAG GCGACCAAGG ACGCCGGCCG GATCGCCGGC
CTGGAGGTGC TGCGGATCAT CAACGAGCCG ACCGCCGCGG CGCTCGCCTA CGGGATGGAC
AAGCGCTCGC ACGAGACCGT CCTGGTTTTC GACCTCGGTG GCGGAACGTT CGACGTGTCC
GTTCTCGACG TCGGCGACGG GATTGTCGAG GTGCGGGCGA CCGCCGGTGA CACCCACCTG
GGCGGAAACG ACTGGGACCG CCGCCTGGTC GACTTCCTCG CCGACGACTT CAAGAACCGG
AACGGCATCG ACCTGCGTGA CGATCCGCAG GCGTTGCAGC GGTTGTTCGA GGCCGCGGAG
AAAGCCAAGA TCGAGCTTTC CTCGGTGAGT CAGACTCAGG TCAACCTGCC GTTCATCACC
GCCGACTCGA ACGGCCCGAA GCATCTCAAC ACCACGGTAA CCCGGTCACA GTTCGAAATG
AACACGTCCG ACCTGCTCGA GCGGTGCATG CCGGCGGTGC GGCGGGCGAT GGCCGACGCG
AAGATCGCCG AACCGGACGT CGACGAGGTC ATCCTGGTCG GCGGTGCCAC CCGGATGCCC
GCGGTACAGG CCGCGGTGCG CCGGCTCACC GGCGGCAGGG ACCCGAACAT GACCGTCAAC
CCGGACGAGG TCGTCGCCGT CGGCGCCGCC ATCCAGGCCG CGGTCCTCAA GGGCGAGGTC
TCCGACGTGC TGCTGCTCGA CGTCACCCCG CTCTCCCTCG GGGTGGAGAC CCTCGGCGGC
GTCAGCACGA AGGTGATCGA GCGCAACACG ACGATCCCCG CGCGGCGGAC GGAGACGTTC
TCCACCGCGG AGGACGACCA GTCCGCCGTG GACATCGTCG TCCTGCAGGG CGAGCGCGAG
ATGGCAGCGG ACAACCGGAC CCTCGGCCGG TTCCGGCTGG AGGGCATCCG GCCCGCGCCA
CGCGGCCAGG CCCAGGTGGA CGTCACCTAC GACATCGACG CCAACGGCAT CCTGAACGTC
ACCGCGCGCG ACACCGACAC CGGCGCCGAG CAGCGCATCA CCATCTCGGA CAACACCAAC
CTGCCCGCGG ACGAGATCGA GCGGATGGTG GCCGACGCCG AGCGTAACCG CGCTGAGGAC
ACCCGGCTGC GAGAGAACGC CGACGCCCAA AACCAGCTCG ACACGATCGC CTACCAGGTC
CAGCGTCGGC TGTCCGAGCT CGGCGACGAC GTTCCGGCGC ACGAGCGGGC CCGCGCCGAG
CAGCTCGTCG CCCACGCCCA CCGGGCCCTG GCGGAGAACG CAGGCGCCGA CCGGGTCCGC
CCGATCGTCA ACGATCTCCA GCAGGTGCTG TACGCGCTGC CCGCCCCCGG CGCGCGTGAG
TCGGCGGCCG CGCCGGGCGG CGGTGGACGT TCCGCGGCGG CCGGGCCGGG CACCGGCGGC
CCGGGTGGCG CCGGGCCGGG CGCGAGTGGC ACCGGTTCGG GCGCGAGTGG CACCGGTTCG
GGAGCCGGCG GGCACGGCGT CGGCGGGGAC GATGTGATCG ACGCCGAGTT CACCACCCGC
GATGATGACT GA
 
Protein sequence
MGKAVGIDLG TTNSVIAVVE GGQPTVIPNA EGSRTTPSVV AFTEGGERLV GELARRQSIL 
NPKGTITSVK RFVGRRHSEV ASELRTVTFD LTPGKDDAVR ITVRGRQYAP EEISAMVLRK
LVDDAARFLG EKVTEAVITV PAYFNDAQRQ ATKDAGRIAG LEVLRIINEP TAAALAYGMD
KRSHETVLVF DLGGGTFDVS VLDVGDGIVE VRATAGDTHL GGNDWDRRLV DFLADDFKNR
NGIDLRDDPQ ALQRLFEAAE KAKIELSSVS QTQVNLPFIT ADSNGPKHLN TTVTRSQFEM
NTSDLLERCM PAVRRAMADA KIAEPDVDEV ILVGGATRMP AVQAAVRRLT GGRDPNMTVN
PDEVVAVGAA IQAAVLKGEV SDVLLLDVTP LSLGVETLGG VSTKVIERNT TIPARRTETF
STAEDDQSAV DIVVLQGERE MAADNRTLGR FRLEGIRPAP RGQAQVDVTY DIDANGILNV
TARDTDTGAE QRITISDNTN LPADEIERMV ADAERNRAED TRLRENADAQ NQLDTIAYQV
QRRLSELGDD VPAHERARAE QLVAHAHRAL AENAGADRVR PIVNDLQQVL YALPAPGARE
SAAAPGGGGR SAAAGPGTGG PGGAGPGASG TGSGASGTGS GAGGHGVGGD DVIDAEFTTR
DDD