Gene Franean1_3834 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_3834 
Symbol 
ID5672197 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp4554835 
End bp4556964 
Gene Length2130 bp 
Protein Length709 aa 
Translation table11 
GC content73% 
IMG OID641242712 
Productintegral membrane sensor signal transduction histidine kinase 
Protein accessionYP_001508132 
Protein GI158315624 
COG category[T] Signal transduction mechanisms 
COG ID[COG4585] Signal transduction histidine kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.154622 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0585187 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGTGGG TGTCACGCCC GTCTGACACG ACGGGCCATG GGGTGTCCCG TCGCTGGTCG 
GCCGCGGCCA CCGCCGCCAC CGTGGTGTTC TCTATGGCGG CGGCACCCAT GATCGTCCTC
GCCGCTGCCG GCGGCCACGG CAAGGACGTC GGGGAATGGT TGATCCTGCT CTACACGCCC
GGGTTCGCCG TCCCCGGGTT GATCCTCACC CGTCGCCGGC CAGAGCTCGT GATCGGCTGG
ATGTACCTGG TGGCGGGCCT CACGACCAGT GTCACCGGGC TCGCCGCCGG CTACGCGGGC
GCCGCGCTCG GCCACGGCTG GCCGGGCCCG GACTGGGCGT TGTGGGTCGT CTCCTGGTCA
TGGCAGGCGC ATCTCACCCT GGAGGCAACG GCACTGTTCC TGTTCCCCTC CGGCCGCATC
ACGTCCATGG GCTGGCGTCG CGCGGCGCAG GTGCCCCTGG CTCTCACGGC ACTCGCGATG
GTGGCGTCCG CGCTGCGCGC GGGAGTAATC GTGACGACAC CGGATGAGCC GGGCGGTTCG
CCGACCGGCC TCGTCAATCC CATGGGCCTC GACGTCCCGG CACTGGCGGG CGTGGCGCAG
GCCCTCGGCC TCGTCGGCGA TCTCGCCGGG GTGGCGGCCA TCGCGAGCAT CCTGGTCCGG
TACGTCCGCG CACGCGGAGA TCTGCGGCAG CAACTCAGGT GGGTCGCGGC GACCCAGCTG
CTGGTACCCG TCATCTTGGT GACGGTCCTC GTCGAGCCGT CCTCGATCGG GCCGTTCATC
GCGATCGCGC AGACGCTGCT GCAACAGGTC GCCGTCGTGG CCGCGATCCT GCGCTGGCGG
CTGTACGGGA TCGACGTGGC CGTCCGGCGT TCGGTCCTGG CCGCGACCCT GCTGACCGCG
GCGCTCGGCA CCTACGCGGC GGTGGTACTC GCGGTGGGCG CGCTAATCGG AACCACCGGG
CCGCTCGTGT CGGCGATCGG CGCGGCCGCG GCGGTCTTCG CGTTCGGCCC GATGTCGGTC
AGCATCCGGG CCCGGATCAA CCGGCTGTTC TACGGCCGGC GAGACGACCC GTACGCGGTG
GTCGCGGCGG TCGGCCGGCA GCTTTCCACC GCCCCCGGCC CAGAGGATGG GCTGCACATC
CTCGCCGAGA CGCTGACCCT CGCGTTGCGG ATCCCTTACG CGGGGATCGT CACCGCCGAC
AACCGTGTCG TTGCCGAACA TCACGCCGGC CGGCGCCGCC CGGAGCACGA CGGCGACCCC
CTGCGCCCGA CCGACGACGA GACCGACGCA CTCCCGCTCG GCCACCACGG CCAGCAGGTG
GGGACCCTCC TGATTGGCCG GCGGCGCGGC GAGGACCGCA TGTCCGGTGC CGACCGGGCA
CTGCTAGGCG ACGTCGCCCG GCAGGTCGGC GCCGCCGTGC ACGCGGTGGC GTTGCTGCAC
GACCTCCGCG GTGCCCGGAC CCGGCTGGTA CTGGCCCGCG AGGACGAGCG GCGTCGGCTC
CAACGTGACC TCCACGACGG GCTTGGGCCG CGGCTGACGG CGACCGGGCT GACTCTTGAC
GCGGCCCGCA ACCGGCTCCG CAGCACTCCA GAGCTCAGCG ACGAACTCCT CGCCGATGCC
CGGGCCCAGG TCAACGAAGC GATCGACGAC GTCAGGCGCC TCGTCTACGC CCTCGGCGAC
CCGTCCTTGG AATCGGCCGG GCTCGTCGCC GCGCTGCGGG CTGCCGCGAC GAGGCTGGGC
CGCGGCGGCT TCCCCGGCCG CGGGACCGAG GGGGGCCGCC ACCCGGAGGT GGTCATTCTC
GCGGACGGTC TGGTTCGGCT CCCGGCCGGC ATCGAGACCG CCGCATACCG CATCGTCACG
GAGGCCATCA CGAACACCGT CAGGCACGCC GCGGCGCGCC GGTGCGAGGT CAGCCTGCGG
GCCGGCTCAG CATTGTCGAT CGAGGTCCAC GACGACGGGC GCGGGCTGCC CGACGGCTGG
CAACCAGGCG TCGGCGTGCG GTCGATCAAC GAACGAGCCG CCGACCTAGG CGGGCAGTGC
ACGATCACGT CGCCGCCTGG AGGTGGCACG CTGGTGACCG TGACGATTCC CCTTCCCCGG
CCGGGTGCGG CCTCGCCGGT GGTCTCATGA
 
Protein sequence
MPWVSRPSDT TGHGVSRRWS AAATAATVVF SMAAAPMIVL AAAGGHGKDV GEWLILLYTP 
GFAVPGLILT RRRPELVIGW MYLVAGLTTS VTGLAAGYAG AALGHGWPGP DWALWVVSWS
WQAHLTLEAT ALFLFPSGRI TSMGWRRAAQ VPLALTALAM VASALRAGVI VTTPDEPGGS
PTGLVNPMGL DVPALAGVAQ ALGLVGDLAG VAAIASILVR YVRARGDLRQ QLRWVAATQL
LVPVILVTVL VEPSSIGPFI AIAQTLLQQV AVVAAILRWR LYGIDVAVRR SVLAATLLTA
ALGTYAAVVL AVGALIGTTG PLVSAIGAAA AVFAFGPMSV SIRARINRLF YGRRDDPYAV
VAAVGRQLST APGPEDGLHI LAETLTLALR IPYAGIVTAD NRVVAEHHAG RRRPEHDGDP
LRPTDDETDA LPLGHHGQQV GTLLIGRRRG EDRMSGADRA LLGDVARQVG AAVHAVALLH
DLRGARTRLV LAREDERRRL QRDLHDGLGP RLTATGLTLD AARNRLRSTP ELSDELLADA
RAQVNEAIDD VRRLVYALGD PSLESAGLVA ALRAAATRLG RGGFPGRGTE GGRHPEVVIL
ADGLVRLPAG IETAAYRIVT EAITNTVRHA AARRCEVSLR AGSALSIEVH DDGRGLPDGW
QPGVGVRSIN ERAADLGGQC TITSPPGGGT LVTVTIPLPR PGAASPVVS