Gene Francci3_2421 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_2421 
Symbol 
ID3906404 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp2811667 
End bp2812965 
Gene Length1299 bp 
Protein Length432 aa 
Translation table11 
GC content71% 
IMG OID637879751 
Productdiguanylate phosphodiesterase 
Protein accessionYP_481517 
Protein GI86741117 
COG category[T] Signal transduction mechanisms 
COG ID[COG2200] FOG: EAL domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0356024 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0553373 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGAACA TGGAGGTGGT CCGGATGACG GATGCGCTGT CGACGTTGGG CCCGCCGGCA 
GGCGTCACAC AGCCGGACAC AGTAGTGACG GATCTCCTCG AGATCCTGCG CCGGCATCTG
CGGATGGACG TCGCGGGGCT GGCCCGGATC GACGGGGATC TGCTTGTCCT GCAGGTTCTC
AGCGGGGACG CACGGGGGTT CAGGTTGGCG CCGGGTTCGA CGATCCGCCG CGAGCATGCC
CTGCTCGGGC GGGTGCTGTC CGGCGAGATA CCGGAGATCG TCGCGGACAC GCGCACCGAT
CCGCGCACCG CGGACGCGGG CGTCGTCCGC GAGTTGGGCG TCGGCGCGTA CGCCGCGGCG
CCGGTCGCCG ACAACGACGG CGAGGTGTAC GGCATAGTCG GCTGCCTCGC CCACGACGCG
CTCCCCCACC CACGCGACCG CGACGTGCAC TTCCTGCACC TGTTGGCGGC CTTCCTGAGC
GACGCGGTCC TCGACCTGCA CCGCCTGTGG GAACAGCGAC GCCGCATCTG GCAGGAGGTG
AGCGACCTCA TCGACGCGGG CGGCCCAAAG ATGATCTTCC AGCCGATCTT CAGGCTCACG
GACGAGCGGA TCGTCGGGGT CGAGGCGCTG TCCCGCTTCC CCCACACGAC CGGCGATGCG
CAGCAGTGGT ACAACGACGC CGCAACCGTC GGCCTGAGCA TCGAACTGGA ACTCATGGCG
ATCCGCCACG CGCTGACCGT CCTGCCGACG CTCCCGTCCG ACCTCACCCT CGCCGTCAAC
GCCTCGCCGT CCACCATCAC CTCCGGCCTG GTCGACATCC TTCCCGACCG GGGGGCCGAT
CGCCTCATCG TGGAGATCAC CGAGCACGAG CACATCGGCG ACGACTCGGA GCTGCTGGTC
GCCGTCGGCG TGCTGCGCCG CCGCGGGGTC CGCATCGCGA TCGACGACGT CGGCACCGGC
TACGCGGGCC TGGAACAGCT CATCCACCTG CGCCCGGAGA TCATCAAACT GGATCGGGTC
CTCACCCACG GGATCGACAC CGATCCGGCC AGGCGCGCCA TCGCGACGGG ACTGGTGCAG
GTCGCCGGCG AGATCGGCGG CTGCGTCATC GCCGAGGGCA TCGAAACCCC GATGGAGCTC
GACACGGCGA TGGCGGCCGG GATCCCCTAC GGCCAAGGCT ACCTGCTCGG CCATCCCACC
ACGACCGCCG GGGCCGCCTG GGTGGAGCAC TCCGCGCACC GGCCGACGGC CGCCGAACCG
GAACCGGTGC CGGCCACCTC CTCCCGGCGG CTCGGCTGA
 
Protein sequence
MPNMEVVRMT DALSTLGPPA GVTQPDTVVT DLLEILRRHL RMDVAGLARI DGDLLVLQVL 
SGDARGFRLA PGSTIRREHA LLGRVLSGEI PEIVADTRTD PRTADAGVVR ELGVGAYAAA
PVADNDGEVY GIVGCLAHDA LPHPRDRDVH FLHLLAAFLS DAVLDLHRLW EQRRRIWQEV
SDLIDAGGPK MIFQPIFRLT DERIVGVEAL SRFPHTTGDA QQWYNDAATV GLSIELELMA
IRHALTVLPT LPSDLTLAVN ASPSTITSGL VDILPDRGAD RLIVEITEHE HIGDDSELLV
AVGVLRRRGV RIAIDDVGTG YAGLEQLIHL RPEIIKLDRV LTHGIDTDPA RRAIATGLVQ
VAGEIGGCVI AEGIETPMEL DTAMAAGIPY GQGYLLGHPT TTAGAAWVEH SAHRPTAAEP
EPVPATSSRR LG