Gene Franean1_4444 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_4444 
Symbol 
ID5672796 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp5308711 
End bp5310381 
Gene Length1671 bp 
Protein Length556 aa 
Translation table11 
GC content70% 
IMG OID641243313 
Productdiguanylate cyclase with PAS/PAC sensor 
Protein accessionYP_001508729 
Protein GI158316221 
COG category[T] Signal transduction mechanisms 
COG ID[COG2199] FOG: GGDEF domain 
TIGRFAM ID[TIGR00229] PAS domain S-box
[TIGR00254] diguanylate cyclase (GGDEF) domain 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTTGGCC GGGACAGCGC AGAGCTTCTT GGCACCACGT GGACGGCTCC TGACGTCCGG 
TTCAGCGGTG GGAACTGGCG GGAGCTGGGT TCCTTACTGG GCTGCGGCAG CCCGGTGTCG
CGTCGTTTGA TACGGCTCGT GCGCCCTGGT GGGAGTGTCA TTTACCTGCT GGTCAGCGCG
AGCGTGACGG ACGAGGTGGA ATGCGGCGGG CTGACGGAGC GGTGCTGCCT TGGTCAGCTT
ATCGACGTGA CCGTCGGGCA GACCGCGCGT GAACAACTCG CGCGGATTCT GGAAGACAGC
CCAGTCTCCA CGGCACTTGT CAACCGGGCC GGGCGAATAG CCGTCCTCGG GGGAGGCGTT
ATTCCAGGAA TGGCCGAGGA TCTCAAGGCA GCCGAGCATA AATCCGTGTT CGAAGTATTC
AGCTACCTGC CGGATGCGAC CGCGATGATC CGGCGCGCCA TGGCGGGAGA GCCGAGCATC
GGGCTGATGC AGGCGTTTGG CCGTTGGATC GACCTGCACA TACGCCCGGC TCTCGCCGAG
TCCGGCGAGG TCGACTGCAT CGCCGCCATC GCCATCGACG TCACGGAACG CGAACAGGCC
CAGGCGGACC AGCGCCTGCT GGCGGATCTT GCCGGCCAGG CGTTGCGCAG CATTGCACCG
GAGGAGCTCT GGGATGAGGC CGCGCAGGTG GTCGCCCGCC GCTTCGACGC CATGGTCGCC
GTGCACGTCG TCGTTCCCGA CCGCGAGCTC CGCCTGGCTG CCATGGCCGG ACCGGCGCCG
CTCCCCGCGG TGGCCGTGAA GCTGTGGGAG AGCGCCAGGT CCGTTCTCGG CCCTCGGACC
GTCGGGCCTG AAGGGAGGCC GCGGGAGAGG GACGATCCCG GCGAGGGGCG CGGCAAGCCC
GTCGCGACCG CCACCGCCAC CGCGGGCGGG GCTGGACGTG GCGTCCACCT GCTGTCGGTG
CCGCTGGGCC AGCCCAGCGC GCCGAGTGCC GTGCTCACCC TGTACCGCGG GGACCAGCCG
GCGGCCTCGA CCCGCGGCCG GGCTGACGAA CGGACTGTCG GAAAGTCCGC GACGGGGCCG
TGGTCGACGC AGGAGCGGGA GTTCGCCGAC GCGGTTGCCG GTGTGCTTGG CGCGGCGGCG
GTGCGGTTCG CGATGGAACG GCTGGCCCGC TACCGGGCCT CGCACGACGA GCTGACGGAC
CTGCCGAACC GGGCCGCCCT CTTCGAGCGG TTATACGATG ACCTCCGACA CGGATTCGAC
GAGGGTGTGC AGACCGGGGT GATCTTCATC GACCTGGACA ACTTCAAAAC GATTAATGAT
TCTCGCGGTC ATCTGGCTGG CGATGAGGTG CTCCGGCAGG TTGCCCGCCG GCTTCGGGCC
GCGGTTCGCG CGAGCGATGT GGTGGCGCGG CTCGCCGGCG ACGAGTTCGC CGTCGTCTGT
CCGCGGGTGA CCGGCGCGAC CGAGGTCGAA CGGGTGGCAC GTCGGGTGCT GGCGAGCCTC
GACGGACCGA TCGCCCTGCA CGGCGGGCGG GTGAACGTGA CCGCCAGCGC CGGTGTCGCG
GTGTCGGGAT GCGACCTCAC CGATGCCGAC CGTCTGCTGA ACGCCGCCGA CATCGCCATG
TACACCGCGA AGCGGGCAGG CCCGGGTCGG TGCTTCGTCC AACATGGCTG A
 
Protein sequence
MLGRDSAELL GTTWTAPDVR FSGGNWRELG SLLGCGSPVS RRLIRLVRPG GSVIYLLVSA 
SVTDEVECGG LTERCCLGQL IDVTVGQTAR EQLARILEDS PVSTALVNRA GRIAVLGGGV
IPGMAEDLKA AEHKSVFEVF SYLPDATAMI RRAMAGEPSI GLMQAFGRWI DLHIRPALAE
SGEVDCIAAI AIDVTEREQA QADQRLLADL AGQALRSIAP EELWDEAAQV VARRFDAMVA
VHVVVPDREL RLAAMAGPAP LPAVAVKLWE SARSVLGPRT VGPEGRPRER DDPGEGRGKP
VATATATAGG AGRGVHLLSV PLGQPSAPSA VLTLYRGDQP AASTRGRADE RTVGKSATGP
WSTQEREFAD AVAGVLGAAA VRFAMERLAR YRASHDELTD LPNRAALFER LYDDLRHGFD
EGVQTGVIFI DLDNFKTIND SRGHLAGDEV LRQVARRLRA AVRASDVVAR LAGDEFAVVC
PRVTGATEVE RVARRVLASL DGPIALHGGR VNVTASAGVA VSGCDLTDAD RLLNAADIAM
YTAKRAGPGR CFVQHG