Gene Franean1_2686 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_2686 
Symbol 
ID5671077 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp3178548 
End bp3180218 
Gene Length1671 bp 
Protein Length556 aa 
Translation table11 
GC content69% 
IMG OID641241598 
Productdiguanylate cyclase 
Protein accessionYP_001507018 
Protein GI158314510 
COG category[T] Signal transduction mechanisms 
COG ID[COG2199] FOG: GGDEF domain 
TIGRFAM ID[TIGR00254] diguanylate cyclase (GGDEF) domain 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGCGGGGC TCCGCGTAGT CGGGGGCGTT GTGCCAGGCC TTCTGCGTTC GGCTGACTGG 
TCCCACGTGC CACGGTTGTT CCGGTACGCC TTGGCGGGCT GCCTGCTGTA CGTCGGGCTG
GCGGCTGTGT TCGCGGCGCT GCTGGCCCCG GGGCCGGCCG GGATCGCCAC GACACTCGTC
GCGTTCGCCT CCGAGGTCAC CGCCGCAGTG GCCTGCTTCT GGAGCGCTCG GCACGCCGCA
ACGGATGATC GACGATGGCG GGTGCTCATC GGCATGTTCG TGGTTGGCCT CGCGGGAGGC
GCCCTCATAA CTGCGGTGAC GTTGCTGAAG GGCGACCCGA TCACGTCCGC TGTGACCTCG
GAGTATCTGG GCCTGATTGT CTTCTACGGA CTGGCGCTGG CAGGGCTGTT ATGCCTGCCG
ACCTACCCGG TTGAGGGCCG GGGCGTGCGT GGGCGGGGAG GTGACCTGAG CCGTTGGCAT
GCGATCGTCG CGCTCGACAG CGTACTGATC GTCGGCTCGG TCCTTCTCCT GGAATGGGGG
ACGTCGCTGG AGGCGATCGC ACGGGCAAGC GGGCCCGACC CTGCGCAGCT CCTCGGCGCG
CTCGTCCACC AGCTGTCGGT GCTGATCCTC GCGGCGACTG TGCTGCTGAT CGCGACGTTC
CGCCGACCGC GGTCCCCGGC GACGTTGGCG CTGCTGGGCA GTGGCCTGCT GGCGTACGCC
CTCATGAACA TCATCGTCGT CTACCGCTTC GCCCACGGCC ACTACGACCT TCCGGCGTGG
AGCCTGATGC CGCTCGTCGT CTCCCTCCAG TTGATAGCCC TCGCCGCGCT GGCACCGGTT
CGTGGCCCGG TGGATCGGGA CAGTGCGGCC GCGCCCGGTC CGCGGGCGAT GTGGGCGCAT
GCCGCCTTGC CGTATGCCGT GCTCGGCGTG ACCAGCCTGT TGCCTCTTGG CAAGCTGGTG
GCGGGCACGC CGCTCGACCG GATCGAGGCG TATGGCGCGG TGTCGCTCCT GGCCTTGGCG
TTCACAAGGC AGATGATCAC CATTGCCGAG AACACCCATC TGCTCACCGC GGTGAGGGAA
CGCGAGAAAC AGCTGCACTA TCAGGCGTTT CATGACCCCT TGACCGGTCT GGCGAACCGG
GCGCTGTTCG CCCGACGCCT GCAGCGCGAA GTCGACCATG GCATCGAGCC GAGGAACGAC
GGCGCACCCA CTGGCGGACA GGCCGCTGTC TCCGTTCTGT TCCTAGACCT GGACCAGTTC
AAACGGGTCA ACGACACGTT CGGGCACGCC ACCGGCGACG AGCTTCTCAA GATCATCGCA
GAGAGGCTGC GGGCCGGAAC CCGCGCCAAC GACACGGTCG CCCGCCTCGG TGGCGACGAG
TTCGCGGTCA TCCTCGACGG CGCCGGCCCG GACAAACCAG TCCAGATGGC CGAGCGCCTC
GCGGCCGCGG TACAGACGCC TTGCCAGCTC GCGGGCCAGA CCTACCTCCC ACGCGCCAGT
CTCGGCCTTG TCACCCTCGA CCCCGACGCG CGACCAGCAA GCCCCGACAG CCTGCTCCAC
CAGGCCGACC TGGCGATGTA CGCAGCGAAA CGCGCCCAGA CGAGCAGACT TGTCGTCTAC
GACCGCCACC TGACGGTCCG CCGCGGCCGC GATCAACCGT ACCGTCACTA G
 
Protein sequence
MAGLRVVGGV VPGLLRSADW SHVPRLFRYA LAGCLLYVGL AAVFAALLAP GPAGIATTLV 
AFASEVTAAV ACFWSARHAA TDDRRWRVLI GMFVVGLAGG ALITAVTLLK GDPITSAVTS
EYLGLIVFYG LALAGLLCLP TYPVEGRGVR GRGGDLSRWH AIVALDSVLI VGSVLLLEWG
TSLEAIARAS GPDPAQLLGA LVHQLSVLIL AATVLLIATF RRPRSPATLA LLGSGLLAYA
LMNIIVVYRF AHGHYDLPAW SLMPLVVSLQ LIALAALAPV RGPVDRDSAA APGPRAMWAH
AALPYAVLGV TSLLPLGKLV AGTPLDRIEA YGAVSLLALA FTRQMITIAE NTHLLTAVRE
REKQLHYQAF HDPLTGLANR ALFARRLQRE VDHGIEPRND GAPTGGQAAV SVLFLDLDQF
KRVNDTFGHA TGDELLKIIA ERLRAGTRAN DTVARLGGDE FAVILDGAGP DKPVQMAERL
AAAVQTPCQL AGQTYLPRAS LGLVTLDPDA RPASPDSLLH QADLAMYAAK RAQTSRLVVY
DRHLTVRRGR DQPYRH