Gene Franean1_3142 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_3142 
Symbol 
ID5671519 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp3696450 
End bp3698150 
Gene Length1701 bp 
Protein Length566 aa 
Translation table11 
GC content62% 
IMG OID641242037 
Productdiguanylate cyclase 
Protein accessionYP_001507457 
Protein GI158314949 
COG category[T] Signal transduction mechanisms 
COG ID[COG2199] FOG: GGDEF domain 
TIGRFAM ID[TIGR00254] diguanylate cyclase (GGDEF) domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.429492 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGGACG CCGCTGAGCA GATTGAGCAG GTCCCCTCAC AGACAGAACG AGAGTTGGTT 
AGTAGAACTC TGTTCAGGTG GGCCATCCTG GCCTGTACCG GCTATGGGGC GATCCTGCTC
GTCGTGGCGG TCGTCCTCCC CGACCGGGAC GCCGCCGCAC TCATAAAGCT GTCCCGGGCT
CTCGCGCTCC TGCTCGCCGG CCCGTGCTGC CTGTGGTGCG GCTGGCGTTC CCGCGGACCG
GAGCGTCTCT GGCGCGTACT CATGGGAGTC GCCTGCATCG GCGCCACCGC GGGCGTCGGC
GCCGCAGTCC GTTTCTCCCT GGAACACTAC CGGTACTCCG CCTGGGTTCC GCCGGTGAGC
TGGATACACA TGCTCTACCT GCTACCCTAT GCGGCGGTCC TGGCAGCGCT CTTAGTGTTC
CCCTCGACAC CGCTGACGGG TACTGCAGAC CGCACCTCGC ACAATGGCAG ATACTGGTAT
GTGGTCAGCA TTCTGGACAG CCTGCTCGTC GTCGGATCGC TCGGCGTCCT CGTGTGGGAG
ATCGGGCTCG GCGAGTTGTT CCAGGACAGG GGCGATCCGA CAGGACTGAT CGCCTTCGCG
GCCAGCAACG CTATCGTCAG CGCGACCGTC GTCGTCCTGC TCGTCCTGAT CGCTACGTTT
CGCCGGCCAC GGTCCCTGGC TGCGGCCACA CTACTCGCCA CGGGCCTGTT GGGAACGACC
TTCAGTACCG GGGTGTACCT GGTGGCGCGG GCGAACCACA TACAGCAGTT CGACCCGATA
TTACTTGTGG TTCCCGCGAC CGCTCCACTG ATAGTCGCGT TGTCCTGCCT TGTGCCGCCA
CCGCGCCGGG CCGGGCCCGC CCGCCGCGTG ATCTCACGCA CCATCTGGAT ACACACGGTG
CTGCCATACA TGTCGCTCGC GACGGTCGTT TTAATATTCA TTTTTCTCTG GGTTCTAGGC
AGCCACCAGA TCGGATTCGG ACAGATATGC ACACTTTCTG TGCTGCTGGT GCTGGCGCTG
GTCCGGCAGA TGATGACCAT GGCGGACAAC ACCCGTCTCA TTCTACAACT CCAGAACAGG
GAGCGCCTAT TGCACCATCA GGCATTCCAC GACCCCCTAA CCGGCCTGGC GAACCGGCTG
CTATTCACCG ACCGGCTCCA ACTGGCGCTC ACCCACTGTC TGCGCGACGG CCTACCATTC
GCTCTACTCT TCTGCGACAT CGACCGTTTC AAATGGATCA ACGATAACTT CGGACATGCT
GCGGGTGATG AACTGCTGAG AATCACAGCT ACACGCATGA CGAAATGCAC CCGGGTCTCC
GATACGGTCG CCCGACTCGG TGGTGATGAG TTCGCTATTC TCCTGACCGG TGGAGGAGTC
GGGCCGGAGA CGGTGAGCCG GCGCATCGTC GAGGCGATCC ACGCACCATG TACCATCGCC
GGGCAGCCCT ACCGGGTCAA AACCAGCGCC GGCCTGGTCA TCGCCACCGA AGCAGACGGA
CCGATGACGG CGGACATGCT GTTGTTTCAG GCCGACCTCG CCATGTATGA GGCCAAACGA
CAAGCATCGG AAAAGCTCGT AGTATACCAG CCAGAAATGT CGCTTGAAAC CGGGTATATT
TCGGCAGAAT GCCAGCAAAT GGCGGGCAGT CGCAGAAATG GCGCAAGCAG CATCGAACCG
CGACGCTATC GCACCTCCTG A
 
Protein sequence
MTDAAEQIEQ VPSQTERELV SRTLFRWAIL ACTGYGAILL VVAVVLPDRD AAALIKLSRA 
LALLLAGPCC LWCGWRSRGP ERLWRVLMGV ACIGATAGVG AAVRFSLEHY RYSAWVPPVS
WIHMLYLLPY AAVLAALLVF PSTPLTGTAD RTSHNGRYWY VVSILDSLLV VGSLGVLVWE
IGLGELFQDR GDPTGLIAFA ASNAIVSATV VVLLVLIATF RRPRSLAAAT LLATGLLGTT
FSTGVYLVAR ANHIQQFDPI LLVVPATAPL IVALSCLVPP PRRAGPARRV ISRTIWIHTV
LPYMSLATVV LIFIFLWVLG SHQIGFGQIC TLSVLLVLAL VRQMMTMADN TRLILQLQNR
ERLLHHQAFH DPLTGLANRL LFTDRLQLAL THCLRDGLPF ALLFCDIDRF KWINDNFGHA
AGDELLRITA TRMTKCTRVS DTVARLGGDE FAILLTGGGV GPETVSRRIV EAIHAPCTIA
GQPYRVKTSA GLVIATEADG PMTADMLLFQ ADLAMYEAKR QASEKLVVYQ PEMSLETGYI
SAECQQMAGS RRNGASSIEP RRYRTS