Gene Franean1_2384 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_2384 
Symbol 
ID5670780 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp2838708 
End bp2842106 
Gene Length3399 bp 
Protein Length1132 aa 
Translation table11 
GC content72% 
IMG OID641241301 
Productintegral membrane sensor signal transduction histidine kinase 
Protein accessionYP_001506722 
Protein GI158314214 
COG category[T] Signal transduction mechanisms 
COG ID[COG0642] Signal transduction histidine kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.878673 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTCCGGA ATTGGCCCAT CCGCTCGAAG CTAGTGGTGA TTCTCATCGT CCCGCTGGCC 
GCGCTGGCCG TGCTGAGCGC GATACAGGTG CGCGGGGACG TCGACAACGT CCGGGCCGCG
GGCCGCATCG AGGCGCTCGC CGACTTCTCG ATCAAAGGTA ACAACCTGGT CCACGTGCTT
CAGCGGGAGC GCTATGCGAC CAACTCTTAC GTTGGATCGA ACTTCCTCAT CCCGATGCTG
GGCCAGCAGG CTGTGGATGC CCGCAAGCCC GTGGACGACG CCCTCGCCAT CTACCTGGCC
GACGAGCAGG CACTGCCCGC GTCCGCGCGC GACGCCCTCG GCTCGACACT GAGTTCCATC
CGTACGCATC TCGACCAGCT CGGCGAGCAC CGCAAGCAGA TCGACGCGCA CCAGACCGAG
ATCGCGTCGA ACGGGGCCTT CTACAACGAG GCCGTGAACG AGCTGCTGCT GCTCAACGCG
CGGATCGCGG CCGGTAGCTC GGACGCCAAG CTGGTCAACG GCTCTACCAC GCTGGCCGGT
ATCGCCCAGG CCAAGGAACA AGCCTCCCAG CAGCGTGGAT CCGTCGCCCA GATGCTCTCC
ATGGGCAACC CCACGCCGCA GATGCTGCGT GCTCTGCTGG CCTACGCCGG TGCCGAGACC
GCCTGGACCG AGCAGTTCCG CTCCACGGCG ACGCCGGCCG AGCTGGAGCT GTTCACGAAC
ACGGTCGGGC GCTCTGCCAC CGGCGTCAAC CAGATGCGCG ACGACGTGGT CAACAGGTTC
AGCGGGCAGC AGGCTCCCGA CGTCGGCCTC GAGGCCTGGC GGACCGCCGC CGACAGCAAG
ATCGCGCAGA TGCACGACGT CGAGGTGCAG ATCGCCAACG ACCTGGCGGC GACAAGTCGC
GACATCGCCT CGACGGCCTC CCGGGACGCC CTGCTCGGAA GCATCGGCGT GGCGATCATT
CTCGTGCTCT CGGTCGTGAT CTCACTGCTC GTCGCGAGCC CCATGATCGG TCAGCTCCGC
CGGCTGCGGG GTGGCGCGCT CGACGTCGCC AACGACCGGC TGCCCAGCGT CGTCGAGCGG
CTGCACCGGG GTGAGCCCGT CGACATCGAC GCGGAGTCCT TCCCGGTACC CGCGACCAGC
AAGGACGAGA TCGGCCAGCT GGCGGAGGCG TTCGCCACGG TGCACGAGGT GGCGGTGCGC
ACCGCCGTCG AGCAGGCCGC GATGCGCAAG AGCATCGGGG ACACCTTCCT CAACCTCGCG
CGCCGCAGCC AGGCCCTGAT CCACCGCCAG CTGAAGATCA TTGACGCGCT GGAGCGCAAG
GAGACCGACC CGGACGAGCT CGAGGAGCTG TTCCGGCTCG ACCACCTCGC GACCCGTATG
CGGCGCCACG CCGAGGACCT CATCGTCCTC TCCGGCTCCA AGCCCGCCCG TGGCTGGCGC
CGGCCGGTTC CGATCAAGGA CGTCGTCCGC GGTGCCGTCG CCGAGGTCGA GGACTACACC
CGGGTCAAGG TCCTGCCCAT CACCGGTGGC GCCGTCTCCG GCCACGCCGT CGGTGACGTG
ATCCACATGT TCGCCGAGCT GATCGAGAAC GCGACCTCCT TCTCGCCCCC GCACACCCCC
GTGCAGGTCG CCGGGCACCC GGTCTCCAAC GGCTTCGTGG TCGAGATCGA GGACCGCGGG
CTCGGGATGA ACCAGGAGGA GATCGACGCG CTCAACAACC GGCTGGCGAA CCCGCCGCCG
TTCGACCTGT CGACCAGCGA GCGGCTGGGC CTGTTCGTGG TCAGCCGGCT GGCGGAGCGG
CACGACGTCC AGGTACGGCT GCGGCCGTCG CCGTACGGCG GCACGATGGC GATCGTCCTG
CTGCCCGCGA CCCTGCTGCG CGCCTCCGAC GACAAGGGCG AGGACAGCGG CCCGCGCGAG
TTCGCCGCGG TCGGCGCGTC GAGCGTCGGC GCCCAGGGCT CGGACCTGGC CGGCTCGCTC
AACGGCACCA GCGAGCGGGT GGGGGCCGGC GCGTCCGGCC CGGAGAGCAA CGGCCACTGG
GCGGGAAACA TCCCCGGGGA CACCCTGGAC GCGGCGGCGC TGTCGACGGC CGGAGCCGTC
AACGGTGCCG GCCCCAGCAC CGACGAGACG CTGATCGACG ACCTCCCGGT CTTCGCCACG
GCGCGGTCGA GCTGGTTCGT GGCGGAGAAG CCCCGTGGCA GGAACCGGAC AGACGACGAG
GGGGACGACG ACGGGCCGAT CCCGCCGCGC CGCCCCGGCG AGCTGACCGG TCCGACGCGG
CGGGCCGAGC TCCCGCCGTC CCAGGACGGA GCCGGCTACC CGGAGGACTC GGGGGACGCG
GGCTGGTCCG ACGGACGGTC CGACCCGTCG GCCTCACTGT TCGGCCCGCC TCCGGGCGGC
GGCGAGTTCG GCGGCTCGCT GTTCAGCGCT CCGCCTGACA CCGGTCAGTT CAGCGGCTTC
GGTCCGAACG GCTTCGGTGA CGGCGGCGGC TACGGTGACG CGGGCTTCGG TGACAGCGGC
GGCTACGCCG GCGGCTTCGG TGGCGGCGCG ACCGATGTCG GCGCCAACGG CATGGCCGGC
AACGGATCGG GTGGCTTCGC TGACTCGGAT TACCACACCG CCGAGTTCAG CCGGTCCGGT
TTCGGCACCG GCGGCGGCTT CGGGAACGAC AGCGGTCACG GGAACGACGG CGGTCACGGG
AACGACGGCG GTCCCGGGAA CGACGGCGGC CTGGGGCAGG AGACCGGCGA CGGCTTCGGC
TCGGCCGGTG GTTTCGGTTC GGGTGACGGC TTCGGTTCGG CTGACTCCCA GCCCGGGGCC
GGGCTGCCGT CCCGCTCCCG GACCGGGCGG GCCTCGGGCG ACTACGCCCG CGCCGACTAC
ACGCCACCGG GCCAGCAGGG CCCGGCCGGG GACGGGACCG GCTACCGCGA GCCCCGCGCG
GACCGCGGCC AGGAATTCGG CCGTCCCATG CGGGAGAACG GGTACGGCGG GGCGGAACAG
GCCTACCCCG GACGCCAGGA CGGCGGCATC GGCGACGGCG CCTTCCGCCG CTCGGCCCGG
TCGTCGCACC CGGGGCAGTC CACCGGTGGG GGCTCCGCCT ACCCCGGGGC CAACCCGCCG
CCGGCGGCGC CGCCGTTCGA CACGGCAGCC GCCGCGGCGG AGGAGTCCGC CGACGACGGT
CTCGACGAGC TCGGCCTGCC CAAGCGGCGG CGGCGCGCGA ACCTGGCGCC GCAGCTGCGC
CGGGAGAACG CGGACGCGGG CCGTTCGTCG CTCGCCGCGG GCCCTCGTTC ACCCGAGGAG
ATCCGCAGCA TGATGTCGTC CTTCCAGGCC AACTTCGGCC GTGGGATTGA GGACGGAACG
GTGTCCAATG ACGGAGATGA TGTGAGGAAG GTAACCTGA
 
Protein sequence
MLRNWPIRSK LVVILIVPLA ALAVLSAIQV RGDVDNVRAA GRIEALADFS IKGNNLVHVL 
QRERYATNSY VGSNFLIPML GQQAVDARKP VDDALAIYLA DEQALPASAR DALGSTLSSI
RTHLDQLGEH RKQIDAHQTE IASNGAFYNE AVNELLLLNA RIAAGSSDAK LVNGSTTLAG
IAQAKEQASQ QRGSVAQMLS MGNPTPQMLR ALLAYAGAET AWTEQFRSTA TPAELELFTN
TVGRSATGVN QMRDDVVNRF SGQQAPDVGL EAWRTAADSK IAQMHDVEVQ IANDLAATSR
DIASTASRDA LLGSIGVAII LVLSVVISLL VASPMIGQLR RLRGGALDVA NDRLPSVVER
LHRGEPVDID AESFPVPATS KDEIGQLAEA FATVHEVAVR TAVEQAAMRK SIGDTFLNLA
RRSQALIHRQ LKIIDALERK ETDPDELEEL FRLDHLATRM RRHAEDLIVL SGSKPARGWR
RPVPIKDVVR GAVAEVEDYT RVKVLPITGG AVSGHAVGDV IHMFAELIEN ATSFSPPHTP
VQVAGHPVSN GFVVEIEDRG LGMNQEEIDA LNNRLANPPP FDLSTSERLG LFVVSRLAER
HDVQVRLRPS PYGGTMAIVL LPATLLRASD DKGEDSGPRE FAAVGASSVG AQGSDLAGSL
NGTSERVGAG ASGPESNGHW AGNIPGDTLD AAALSTAGAV NGAGPSTDET LIDDLPVFAT
ARSSWFVAEK PRGRNRTDDE GDDDGPIPPR RPGELTGPTR RAELPPSQDG AGYPEDSGDA
GWSDGRSDPS ASLFGPPPGG GEFGGSLFSA PPDTGQFSGF GPNGFGDGGG YGDAGFGDSG
GYAGGFGGGA TDVGANGMAG NGSGGFADSD YHTAEFSRSG FGTGGGFGND SGHGNDGGHG
NDGGPGNDGG LGQETGDGFG SAGGFGSGDG FGSADSQPGA GLPSRSRTGR ASGDYARADY
TPPGQQGPAG DGTGYREPRA DRGQEFGRPM RENGYGGAEQ AYPGRQDGGI GDGAFRRSAR
SSHPGQSTGG GSAYPGANPP PAAPPFDTAA AAAEESADDG LDELGLPKRR RRANLAPQLR
RENADAGRSS LAAGPRSPEE IRSMMSSFQA NFGRGIEDGT VSNDGDDVRK VT