Gene Franean1_4110 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_4110 
Symbol 
ID5672468 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp4893568 
End bp4894803 
Gene Length1236 bp 
Protein Length411 aa 
Translation table11 
GC content72% 
IMG OID641242986 
Productdiguanylate phosphodiesterase with GAF sensor(s) 
Protein accessionYP_001508403 
Protein GI158315895 
COG category[T] Signal transduction mechanisms 
COG ID[COG2200] FOG: EAL domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGCGTC CGGCGTCGCC ACTGGGCCCA CCGATCGAGG TCGTGGAACC GGACCCCGCG 
GTGGCCGACC TGCTCGGGGT GCTGCGCCGC CACCTCGGTA TGGACCTCGC CTGGCTGGGC
CGGGTCGAGG GCGACCTGCT GGTGCTGCAG GTCGTCAACG GCGACGCGGC CGGTTTCGGC
ATCGCGCCGG GCAGCACGAT CCGCCGCGAG GCGGCCCTGT ACGCCCAGGT ACTCGCCCAC
GATCAGCCGG TCATCATCCC CGACACCCAC CGCGACCCGC GGACCGCCGA CGCGGGCACC
GTCCAGGTAC TGGGTATCGG GGCGTACGTG GCCACGCCGG TCTACGACAA CGACAACGAC
ATCTACGGCA TCCTCGGCTG CCTCGCGCAC CAGCCCCGCC CGGAGCTGCG CGAACGCGAC
GGCCGGTTCC TCAGCCTGCT GGCCGCGTTC CTCAGCGACG CCGTCATCGA CCTGCACCGG
GTGTGGGAGA CGCGCAGCCG GGTCTCCCGG GTGATCAACG ACCTCATCGA CGCCGGTGGC
CCGCAGATCG TCTTCCAGCC GGTGGTCGAC CTCTCCGATG GCGGCGTGGT GGGAGTCGAG
GCGCTGTCAC GCTTCCCGGG ATCGACCGAC GACCCGGAGG GCTGGTACGC CGTCGCCAGC
AGCGTCGGGC TCGGCACGGA CCTCGAGCTG ACGGCCATCC GGCGCGCGCT GACCGTGATG
TCCGAGCTGC CCAACTCGGT CACCCTCGCC GTGAACGCCT CACCGGCCAC CATCACCTCC
GGCCTGGTCG GCCTGCTGGC CCCGTTCCCG GCCTGCGAAC GACTGATCGT CGAGATCACC
GAGCACGAGT ACTTCAGCGC CGACCCGGTG GTCATGCGCG GCATCCACGC GCTGCGCGCG
CTCGGCGCCC AGATCGCGGT GGACGACATC GGCACCGGCT ACGCCGGCCT CGAACAGCTC
ATCCACCTGC GGCCCGAGAT CGTGAAACTC GACTACCTGA TCACCCACGG GATGGACGGC
GACCCCGCCC GCCGCGCCGT CGCCGCCGCC ATCGTGGACG TCGCCGCGGA GATCGGCGGC
TGCGTGATCG CCGAGGGCAT CGAGAACACC GCCGAGCTGC GGGTCGCCAT CGACGCCGGC
ATCGACTTCG GCCAGGGCTA TCTGCTCGGC GCACCCGCCC GCACCGCACG CGCCGCCTGC
ACCCCGGCCG TCGCACTCGC CAACCGCCCC GGCTGA
 
Protein sequence
MARPASPLGP PIEVVEPDPA VADLLGVLRR HLGMDLAWLG RVEGDLLVLQ VVNGDAAGFG 
IAPGSTIRRE AALYAQVLAH DQPVIIPDTH RDPRTADAGT VQVLGIGAYV ATPVYDNDND
IYGILGCLAH QPRPELRERD GRFLSLLAAF LSDAVIDLHR VWETRSRVSR VINDLIDAGG
PQIVFQPVVD LSDGGVVGVE ALSRFPGSTD DPEGWYAVAS SVGLGTDLEL TAIRRALTVM
SELPNSVTLA VNASPATITS GLVGLLAPFP ACERLIVEIT EHEYFSADPV VMRGIHALRA
LGAQIAVDDI GTGYAGLEQL IHLRPEIVKL DYLITHGMDG DPARRAVAAA IVDVAAEIGG
CVIAEGIENT AELRVAIDAG IDFGQGYLLG APARTARAAC TPAVALANRP G