Gene Franean1_4199 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_4199 
Symbol 
ID5672554 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp4999899 
End bp5001554 
Gene Length1656 bp 
Protein Length551 aa 
Translation table11 
GC content74% 
IMG OID641243072 
Productdiguanylate cyclase 
Protein accessionYP_001508489 
Protein GI158315981 
COG category[T] Signal transduction mechanisms 
COG ID[COG2199] FOG: GGDEF domain 
TIGRFAM ID[TIGR00254] diguanylate cyclase (GGDEF) domain 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.57955 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.854948 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGGACG GACGGGAAGC GACACCGGGC GGGGCCCGCG GCCGCGCCGA GCCGGACGGG 
CTGCCCGCGC GTCCGGTGTC CCGCGGCGCG TTCCGCGCGA TGGCCGCCAT CGGGCTGGCC
TTCGTCGTGC TGGCCGCGGC AGTCGCGATG GCGCTGCCCG AGCACATCGG CAAACAGGCG
GCGCAGGCCG CGGTGACGGC TGCCGCGCTC ACCTCCGCGA TCTGCTGCCT CGTCACCGCG
CGACGGGTGC CCCGGCACGA GCGCCGGTGG CGGCGGTTCG AGGCGGCGGC GATCACGTCA
AGCTTCGTGG CGGGAGTGGC CGTCTCAATG GTGGCCAGCG GGGAGACCGT CCCCGATCTC
GCCGCGACCG GGGCCGCGCC GCTGCTCGGC TATGCCCTGG GCCTGGCCGG GCTGCTGACC
TTCCCCACCG AGCGGGCCCG CGCCATCCCG ATCAGCACGA ACAAACGTGG CGGGATGGCC
TGGTACCTCG AGGCCGTCCT CGATGGGCTG CTCGTCGTCG GATCACTGCT GCTGCTCGTC
TACGCGATCC TGGTCGTCCC GCTCGTGCGG TCGACGCAGA TCAACCAGGC GTCGCTGGCG
TTCGGGGTGG CCGGCGGGGC GGGCAAGCTC ATCCTGCTCT CCGCCGTCAT CTTCATCCTG
ATCTTCCGAC GGCCGACGGG CTCGGGATCG CTCAGCCTGC TCAGCGCGAG CCTGCTCCTG
TTCGCCGTGA CCGACGGCGC CGCGCTGAAC GCCTACGCGA AGGCCACCGA CGGCCCGGAC
GCGGTCGTGC TGGTCGGCTT CGCCGGCGGC AGCGTGCTGA CCGCGCTGGC CGCCGCCGCC
TCCCACGACG GCATGATCAT CCGGACGAGG CGCAGCCCGC GCGGGGTCTG GGCCCGCGTC
GCCGTGCCCT ACCTGCCCCT CGGCTGCGTC GGCGCCCTGC TCGCGGCCCA GATCATCACG
CACGCCGACA TCCCGGTCGC GCAGATCATC GGCATGCTCG CGCTGATGCT GCTCGCGCTC
GCCCGCCAGC TCGTGACCAC GATCGACAAC ACGCTGCTGC TGGCCCGCTA CGAGGACAGC
CGGGCCCGGC TGCACCACCA GGCCTTCCAC GACCCGCTGA CCGGCCTGGC GAACCGCACG
CTTTTCTTTC GCCGGCTGCG CGCGGCCATC GACCGGCACG AGCGCACCGG CCACCCGGTG
GCGTTGCTGT TCTGCGACCT GGACGACTTC AAGGTCGTCA ACGACAACCT CGGCCACGCC
GCCGGCGACC GGGTCCTGCG CACGGCGGCC CACCGGCTGG CCCAGGCGGC GGGCCCGACC
GACACGGTGG CCCGCCTCGG CGGCGACGAG TTCGCCGTCC TGTTCGACAC CGGGGCCGCG
GACCGGGCGG GCGGGCGCAC CGAGGAGCTG CGCACGGCCG GCGACCGGAT CCGCGCGGCG
CTGCGGATCG CCGTGGCCAT CGAGGACCGC CGGCACCTGG TGCGCGCCAG CCTCGGCCTG
GTGATCGTGG GCACGGAGGC GGGACCGGTC TGCCCCGACG AAGTGCTCCA GCACGCCGAC
CATGCCATGT ACGCGGCGAA ACGGCTGGGG AAGGGCAACC TCGTCGTCTA CTCCCCGGAG
ATCGACGACT CCCGGCAGGC CGGGATGGCC GGCTGA
 
Protein sequence
MPDGREATPG GARGRAEPDG LPARPVSRGA FRAMAAIGLA FVVLAAAVAM ALPEHIGKQA 
AQAAVTAAAL TSAICCLVTA RRVPRHERRW RRFEAAAITS SFVAGVAVSM VASGETVPDL
AATGAAPLLG YALGLAGLLT FPTERARAIP ISTNKRGGMA WYLEAVLDGL LVVGSLLLLV
YAILVVPLVR STQINQASLA FGVAGGAGKL ILLSAVIFIL IFRRPTGSGS LSLLSASLLL
FAVTDGAALN AYAKATDGPD AVVLVGFAGG SVLTALAAAA SHDGMIIRTR RSPRGVWARV
AVPYLPLGCV GALLAAQIIT HADIPVAQII GMLALMLLAL ARQLVTTIDN TLLLARYEDS
RARLHHQAFH DPLTGLANRT LFFRRLRAAI DRHERTGHPV ALLFCDLDDF KVVNDNLGHA
AGDRVLRTAA HRLAQAAGPT DTVARLGGDE FAVLFDTGAA DRAGGRTEEL RTAGDRIRAA
LRIAVAIEDR RHLVRASLGL VIVGTEAGPV CPDEVLQHAD HAMYAAKRLG KGNLVVYSPE
IDDSRQAGMA G