Gene Franean1_3387 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_3387 
Symbol 
ID5671758 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp4011695 
End bp4013626 
Gene Length1932 bp 
Protein Length643 aa 
Translation table11 
GC content65% 
IMG OID641242275 
Productheat shock protein 90 
Protein accessionYP_001507695 
Protein GI158315187 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0326] Molecular chaperone, HSP90 family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGCAATC GGATCGAGAC GCTTGAATTT CAGGCCGAGG CCCGTCAGCT TCTACAACTG 
ATGGTCCATT CGATCTACTC CGACAAAGAT ATCTTCCTGC GCGAGCTGAT CTCCAACGCG
TCCGACGCCC TCGACAAGCT GCGCATCGCC TCACTTGTCG ACAAGGACCT TTCGACTGAC
ACGTCAGACC TGCACATCGC CCTCGAGGTC GACCGGGAGC AGCGCACCCT CACCGTGCGC
GACAACGGCA TCGGCATGTC CCGCGACGAC GTCGTCCAGC TCATCGGCAC CATTGCCAGG
TCGGGAACGG CCGAGATGCT CCGGAAGGCG AAGGAGTCGA AGGACGCCGC CGCGTCGCTG
GACCTCATCG GCCAGTTCGG TGTCGGTTTC TACTCGACCT TCATGGTGGC GGACAAGGTC
ACGCTGCTGA CGCGCCGGGC CGGCGAGAGC GGTGGCACGC GCTGGGAGTC GGGCGGCGAG
GGCACGTACG TCATCGAGGA TGTCGACGAC GCGCCGCAGG GCACCGCCGT CACCCTGCAT
CTCAAGCCGG AGGACAGCGA GGACCACCTC TTCGACTACA CCGCCGAGCG GAAGATCCGG
GAGATCGTCA AGCGGTACTC CGACTTCATC GCGTGGCCCA TCCGAATGGC TACAGAGAAC
CCGGACCAGG CAGCGAGCAC GGACGAAGAC GGCACGGCCA CGACCGACAT CCAGACGCTC
AACTCGATGA AGGCGTTGTG GGCGCGCCCG CCCGCCGAGG TCGACCAGGC CGAGTACAAC
GAGCTGTACC ACCACCTCAG CCATGACTGG ACCGACCCGC TCGAGACCAT CCGCATGAAG
GGTGAAGGCC TCTTCGAGTA CGAGGCGCTG CTCTTCATCC CGTCCCACGC GCCGTTCGAC
CTGTTCGCGC AGGACGGCAA GCGTGGCGTC CAGCTGTACG TGAAGCGCGT GTTCATCATG
GACGACTGCC AGGCGCTCAT GCCGAACTTC CTGCGGTTTG TCAAGGGTGT CGTCGACGCG
CACGACCTGT CGCTGAACAT CTCCCGCGAG ATCCTCCAGC AGGACCGCCA GATCCAGCTG
GTTCGCCGCC GGCTGGTGAA AAAGGTGCTT TCGACGGTCC GCGGCCTGCA GACGGACGAT
GCCGAACGCT ACCGGACCTT CTGGAAGGAG TTCGGGCCCG TCGTCAAGGA GGGGCTGCTG
GACGACGTCG ACAACCAGGA GGCGATCCTG GAGATCGTCT CCTGCGCCTC GACGCGCGAC
CCGGAGCAGA CGACGACCCT GCGGGAGTAC GTCGACCGGA TGAAGGACGG CCAGGACGAC
ATCTACTATA TGACCGGCGA CTCCCGGTTG ATGATCGAGA AGTCGCCACA CATCGAGGCC
TTCCTCGCCA GGGGGTACGA GGTGCTCCTC CTGACGGACC CGGTCGACGA GGTGTGGGTC
GAGCGGGTGG CGCGGTTCGA CGGCAAGCCG CTGCGGTCGA TCGCCAAGGG CCAGGTCGAT
CTCGACACCG CGGCCGAGAA GGACAGCACC GAGCCCGAGC GGGAACAGCA GCGGAAGGAT
TTCGCCGAGC TGCTGTCCTG GCTGACCCGG ACCCTGCACG AGCAGGTGAA GGAGGTCCGC
CTCTCCTCCC GCCTCACCAC GTCGCCGGCC TGCATCGTGG GGGACACCCA CGACATCACG
CCGACCCTGG AGAAGATGTA CCGCTCCATG GGGCAGAACG TGCCGCATTC CAAGCGCATC
CTGGAGCTCA ATCCGACGCA TCCACTGGTG GAGGGGTTGC GGAAGGCCCA CGAGCAGGAC
GCGAGCTCCG ACGTTCTCAG CGAGACGGCG GAGCTGCTGT ACGGGATGGC GCTGCTGGCC
GAGGGCGGGG AGCTCGCCGA CCCGTCCCGC TTCACCCGCA TTCTCGCCGA CCGGCTAGCG
AACACCCTGT AG
 
Protein sequence
MSNRIETLEF QAEARQLLQL MVHSIYSDKD IFLRELISNA SDALDKLRIA SLVDKDLSTD 
TSDLHIALEV DREQRTLTVR DNGIGMSRDD VVQLIGTIAR SGTAEMLRKA KESKDAAASL
DLIGQFGVGF YSTFMVADKV TLLTRRAGES GGTRWESGGE GTYVIEDVDD APQGTAVTLH
LKPEDSEDHL FDYTAERKIR EIVKRYSDFI AWPIRMATEN PDQAASTDED GTATTDIQTL
NSMKALWARP PAEVDQAEYN ELYHHLSHDW TDPLETIRMK GEGLFEYEAL LFIPSHAPFD
LFAQDGKRGV QLYVKRVFIM DDCQALMPNF LRFVKGVVDA HDLSLNISRE ILQQDRQIQL
VRRRLVKKVL STVRGLQTDD AERYRTFWKE FGPVVKEGLL DDVDNQEAIL EIVSCASTRD
PEQTTTLREY VDRMKDGQDD IYYMTGDSRL MIEKSPHIEA FLARGYEVLL LTDPVDEVWV
ERVARFDGKP LRSIAKGQVD LDTAAEKDST EPEREQQRKD FAELLSWLTR TLHEQVKEVR
LSSRLTTSPA CIVGDTHDIT PTLEKMYRSM GQNVPHSKRI LELNPTHPLV EGLRKAHEQD
ASSDVLSETA ELLYGMALLA EGGELADPSR FTRILADRLA NTL