Gene Franean1_1097 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_1097 
Symbol 
ID5669511 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp1309865 
End bp1311235 
Gene Length1371 bp 
Protein Length456 aa 
Translation table11 
GC content72% 
IMG OID641240029 
Productdiguanylate phosphodiesterase 
Protein accessionYP_001505459 
Protein GI158312951 
COG category[T] Signal transduction mechanisms 
COG ID[COG2200] FOG: EAL domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTTCCTC AACAGTCGAC CATCAGGCAC CCGGTCGAGG CCGGTCTACC GGGGTCGCGC 
GGACCCGCCG GTCAGGGTCA GGTGGACGAG GCGGGCGGAG CCCGCCCCGC GGACGGACGC
CGCCAGGTTC CCGGCCTGCC CACGCAGCCG GATCTCGACG GCATCCTGGC CGACCCCGAC
ACGCCGTCCC TTGTCTTCCA GCCAATCGTG GACCTGCGCC GCGGCGTCAC GGCCGGCTAC
GAGGCCCTGG CACGCTTCGG CCCGGACCCG CGCAACGCAC CCCATCTCGT GTTCGGCGAG
GCGGACCGCC GCGGCTGCGC GGCCGAGCTC GAGGCCCGGG TCCTGCGCCG GGCACTCGCC
GCCCGCGATC ATCTTCCCGA CCGTTGTTTT CTCGCCGTCA ACGTGTTACC GCATCTTCTT
TCCTCCCCCG AGGTCGCGGC GGTCTGGCGG AGCGCCGATC TCTCCCGCAT CGTTCTCGAG
CTGAACGAGG CCGTCGACAT CGAGCGCGCG ACCGGTCTGA CGGCGACCTC GCAGGAGCTG
CGGGACCACG GCGCGTTCCT CGCCATGGAC GATGTCGGTT CGGGATATGC CGGCCTGCGC
CAGCTCACCC ACATCCGGCC CGATTTCGTG AAACTCGACG CGTCACTGGT CTCGAACATC
GACGACGACC AGGTGAAGAT CGCACTCACC GAGCTGGTCG GCGGATTCGC CAGCCGCCTC
AACGGCTGGG TCATCGCCGA GGGTGTGGAG CGCGTCCAGG AGCTGACCAT GCTGGTCGCC
CTCGGCGTCC CCCTCGGGCA GGGCTTCCTG CTCGGACGGC CGTCCGCCCG CTGGCAGGAG
CTTGACCCGG CCGTGGCCAG GCGGATCAGG CTGCTCTCCG CGCGCTCCGA CCGTTCCTCG
CGCATCGTGA GCCTGATGGA GCCGGTCCGG ATCTCGACCG GCGACTACGG GCGCTGCGGC
CAGGTCCCGG GCTGCCCGCC CTGCGTGCAC TCGCCCGCGC CGGGCGACGA CGCCGGCTCG
CCGTCCCCGG GCGGACGTCC CGAGGAACAC GCCGAAGAAC ACGCCGAGGA TCACGAAACG
CTCAGGGGCG ACGGTTCCGG TGGCGGGACT GCCATCATCG TCAGCAACCG GTGCCGGCCG
GTCGCCGTGC GGCTGGCCGG CGGCGCGGGT GGGCGGGAGC CGCAGCGGAT TCCCACGTCG
CTGTTCGCGC TCCCGGACGA GCCCGTCACG GAGGTCGCCC GCCGGGCGAT GACCCGGCCC
GCCGGCTGCC GGTTCGACCC GGTGATAATC GTGACCGAGA TGGGACGGCC TCTCGGCCTG
GTACGGATGG AGCGCCTGAT GCTGCGTCTC GCGGATCTGT CGGCCACATG A
 
Protein sequence
MVPQQSTIRH PVEAGLPGSR GPAGQGQVDE AGGARPADGR RQVPGLPTQP DLDGILADPD 
TPSLVFQPIV DLRRGVTAGY EALARFGPDP RNAPHLVFGE ADRRGCAAEL EARVLRRALA
ARDHLPDRCF LAVNVLPHLL SSPEVAAVWR SADLSRIVLE LNEAVDIERA TGLTATSQEL
RDHGAFLAMD DVGSGYAGLR QLTHIRPDFV KLDASLVSNI DDDQVKIALT ELVGGFASRL
NGWVIAEGVE RVQELTMLVA LGVPLGQGFL LGRPSARWQE LDPAVARRIR LLSARSDRSS
RIVSLMEPVR ISTGDYGRCG QVPGCPPCVH SPAPGDDAGS PSPGGRPEEH AEEHAEDHET
LRGDGSGGGT AIIVSNRCRP VAVRLAGGAG GREPQRIPTS LFALPDEPVT EVARRAMTRP
AGCRFDPVII VTEMGRPLGL VRMERLMLRL ADLSAT