Gene Francci3_1004 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_1004 
Symbol 
ID3906691 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp1195983 
End bp1198160 
Gene Length2178 bp 
Protein Length725 aa 
Translation table11 
GC content70% 
IMG OID637878337 
Productdiguanylate cyclase/phosphodiesterase with PAS/PAC sensor(s) 
Protein accessionYP_480116 
Protein GI86739716 
COG category[T] Signal transduction mechanisms 
COG ID[COG5001] Predicted signal transduction protein containing a membrane domain, an EAL and a GGDEF domain 
TIGRFAM ID[TIGR00229] PAS domain S-box
[TIGR00254] diguanylate cyclase (GGDEF) domain 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGGTCCC TGCCCCAGAC AGCCCGCCAG CAGGGCGTCG CGCGCTTCGC CCAGGCCTGG 
GCCCGAGAGA TCCTCGGCGC CAGCTATCCG GCGATCGGCC TGATCGACGT CGAGCAGCGG
TTACGGGCGT TGAGCGACCG GCTCGTCGAC GCCCTGCTCG CCGAACCGTT CACGGCCGAC
CCTGCGGCCG CGGTCGGGGC TGCCGTCGTC GAGTACACCT TCGGCGCCGC GGACGCGTTG
GCCGGAACCA TCGAGATCTG TGGCACCAGA CTGCTCACCG ACCTACGACT CGGGACGGCC
GACGGGTACG GCGCCCGGCT GGCGGCGTTG CAGGGCGGGC TGGCCCAGGG ATACGCCCAT
GCGCTGCGGG AGCGGATCCT CGCCGGGCAG GACTCGATCC ATCGCGCGGC TCTGCACGCG
AGCGATGCCC GCTTCCGGAT GATCTTCACC GGCTCCGCGA TCGGAATCCT GATCAATGAT
GGAGGCGGAC GGATCATCGA CGCCAACGAC GCTTTTCTGC GGATGCTCGG AGTCGGCCCG
GGTCAGGTGC AGGGCAGGTC AATCGACGAG TTCGCGCATC CCGAGGACGC GAGGGAGCTC
CGGGAGGCCT ATGCCGCCCA GCGGACCGCC GGACCGGAGC ACATCCGGAT GGAGTGGCGG
TTCACCGGGA TCGGCGGGTC GGTGGTGTGG GCCGAGCTGA CCACGTCGTG TCTGCGCGAC
GACGACGGGC GGTGCCGTGC GCAGCTCTCC ATGATCGTGG ATGTGACTGA CCGGCATCTG
TTGCAGAACC AGCTGCGCCG CCAGGCGCGG CACGACCCGC TCACCGGCCT GCCGAACCGG
ACGGTACTCA TCGAACGGGT CCACGAACTG ATGGCCACGG GACCGGAACG TTGCATCGGG
CTGTGCTTCA TCGATCTGGA CGGATTCAAG ATGGTGAACG ACAGCCTCGG CCACGACGTC
GGCGACCGGT TGCTCGTCGC CGTCACGGAC CGGCTGCGTC GGGTGCTGTC CCCGGAGCAG
CTCCTGGTCA GGATGGGTGG TGACGAGTTC GTGATCCTTG TCGCCGACAC AGCCGGCACG
GCGGACGCCG TCGCCGTCGC CGAGACCGTG CTGGCGGCGC TGGCGAGCCC GCTGCGGGTG
GGTGACCACG CGCTGTCCAT CGGAGCGAGC ATCGGCATCG TCGAACGGCC CGTCGCCGGC
GGGGACTACG CCGATCTGCT ACGGGCCGCC GACATCACGC TCTACCGGGC GAAGGCGGAG
GGGAAGGGTC GCTGGGCCCT GTTCGAGGCG GAGCGAGGCG CCCGGCAGGT GACCCGGCAC
ACTCTGTCGA CCATGCTGCC CCGCGCCCTC GAACAGGACC AGTTCGTCGT CGAGTACCAG
CCGATCATCG CCCTCGGCGA CGGTGCGGGT GCCAGGATGG CCGCCGGGGC GATGGTCGGG
GTGGAGGCGC TGGTCCGCTG GCGGCATCCT GGCCTCGGTC TGCTGCCCCC CGATCTGTTT
ATCGGGATGG CGGAGGAGAC CGGGCAGATC GTGCCCCTCG GCCGGCGCGT GCTGGAGCTC
GCCTGTCGGC AGGCGCGGCA GTGGCAGGAG TGTCATCCGC ACGCCCTGTT CGTCAGCGTC
AACCTGGCCG CCCGGCAGAC CCGCGAGGCC GGCCTGGTGG AACAGGTCCG ACGCATTCTC
GACGAGACCG GGCTTTCCCC GGACCTGCTC CAGCTGGAAC TTACCGAGAG CGCCTTCATG
GGCACGGCCG ACGAGCCGCT GCGGGTCCTG CGCGGGCTCG CCGAGATGGG CGTGCGGATC
GCTATCGACG ACTTCGGCAC CGGTTATTCC AACCTCGCCT ACCTGCGTCG GCTGCCGGTG
CACACGCTCA AGCTCGCCGG ACCGTTCGTC GAGGGCTTCC ACTCGGCGCG GGACGCCGAC
CCGGGGGATG AGGAGATCGT GCAGACGCTG GTCACGTTGG CCCACACCCT GCGCCTGACG
GTGACCGCCG AGGGCGTCGA GACCCCGATG CAGGCCGAGC GGCTGCGCGC GGCCGCCTGC
GACAGCGCGC AGGGCTTTCT CTTCGCGCCG CCGCTGACGG CTGAGGAGAT CACCCAGAGG
ATCCTCACCC GGTCCGCTGG CTCGGACCGA ACATCTTCGA GCCCCACGCA GGCAGCGCGG
GCGGCATGGG CGCGCTGA
 
Protein sequence
MRSLPQTARQ QGVARFAQAW AREILGASYP AIGLIDVEQR LRALSDRLVD ALLAEPFTAD 
PAAAVGAAVV EYTFGAADAL AGTIEICGTR LLTDLRLGTA DGYGARLAAL QGGLAQGYAH
ALRERILAGQ DSIHRAALHA SDARFRMIFT GSAIGILIND GGGRIIDAND AFLRMLGVGP
GQVQGRSIDE FAHPEDAREL REAYAAQRTA GPEHIRMEWR FTGIGGSVVW AELTTSCLRD
DDGRCRAQLS MIVDVTDRHL LQNQLRRQAR HDPLTGLPNR TVLIERVHEL MATGPERCIG
LCFIDLDGFK MVNDSLGHDV GDRLLVAVTD RLRRVLSPEQ LLVRMGGDEF VILVADTAGT
ADAVAVAETV LAALASPLRV GDHALSIGAS IGIVERPVAG GDYADLLRAA DITLYRAKAE
GKGRWALFEA ERGARQVTRH TLSTMLPRAL EQDQFVVEYQ PIIALGDGAG ARMAAGAMVG
VEALVRWRHP GLGLLPPDLF IGMAEETGQI VPLGRRVLEL ACRQARQWQE CHPHALFVSV
NLAARQTREA GLVEQVRRIL DETGLSPDLL QLELTESAFM GTADEPLRVL RGLAEMGVRI
AIDDFGTGYS NLAYLRRLPV HTLKLAGPFV EGFHSARDAD PGDEEIVQTL VTLAHTLRLT
VTAEGVETPM QAERLRAAAC DSAQGFLFAP PLTAEEITQR ILTRSAGSDR TSSSPTQAAR
AAWAR