Gene Francci3_1613 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_1613 
Symbol 
ID3903748 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp1934288 
End bp1936054 
Gene Length1767 bp 
Protein Length588 aa 
Translation table11 
GC content70% 
IMG OID637878950 
Productdiguanylate cyclase 
Protein accessionYP_480718 
Protein GI86740318 
COG category[T] Signal transduction mechanisms 
COG ID[COG2199] FOG: GGDEF domain 
TIGRFAM ID[TIGR00254] diguanylate cyclase (GGDEF) domain 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.560816 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCCCCG TATACCCGAT CACCCACGAT AGGTGGTACA CGCTGGCGAG GACGCCCGCC 
GTCGAGTTCT TCCAGACGAG GACTCGGGAA CTTGCCCCGC AGCGGCCAGC CGAATCCGGC
AAGACCTGGG GAAAACTGGT CGTAAATCCG GAGACCCCCC GAATCGCGTC GATTTTCCTC
GACGCCGCTG CCCAACTGGC CACTGCGGCC GCCTGCTTCC GGACCGCCCG GCAGGTTCGC
GGGACGGAGC GCCGCCGGCG GTTCGCCCGC CTCGCCGCTG GCGGCGTGCT GGTCGCCAAC
CTCGCGACGG CGGCGACGTT CCTCGCCGGC GGCTTCCTGA CTGGCCGCAT GTCCCCGAGC
TATGCCATCT TCCTCGTTCT CTACGGGGTG GCCCTGGCCG GGCTGCTGTC CCTACCCACC
GCACCGGTGG AAGGCCAGGG CGAGAACACC AGGACCGCGC GACACAGCGG ATACCGCTGG
CATACGATCA CCCTGCTGGA CTGTGTGCTG ATCGTCGGCT CGATCATCCT GCTGGCGTGG
GGGACGACTC TCGCCGCGCT CGTCCCGGCC GGCCCGCCCG ACCCCATCCA GCTCTTGGCC
GCCCTGCTCC ACCCGGTCGC CAGCCTGATC CTCGCCACGG CGGTGCTGCT GATCACCTGC
TTCCGCCGGC CGCGTTCCCC GGCGGCGTCG GTACTGCTCA GCACGGGCCT GCTGATCGAC
GGACTCACCA GCAACATCTC CGTCTACAGC ACCGCGCCGG ACAGCCACCA CCTTCCCCCC
TGGGCCATGA TCGGATTCAC CCTCGCCTTT ATGATGATCT TCCTTGCCGC TCTGCTACCG
GCGCGCACGC ACCCGGACAA CCCCACACCC GACAGCCTCA CGACCGACAG CCTCACGACC
GACAGCCTCA CCCCGGGCGG CCCCCGGGCG GTGTGGGCGC ACGCCGTGCT GCCTTACGCC
GTGCTCGGTG CGGCCGGCCT GCTGATCCTG GGCAAACTGT CGACCGGTGC GCAGCTCGAC
CGGTTCGAGG CGTACGGCAT GGTCGGACTG CTGCTGGTGG CGCTGGTCCG GCAGATGGTC
ACCCTGGCGG AGAGTAACCG ACTGTTGGCC GAGGTACGCC AGCGCGCACG GCAGCTTCAC
CATCAGGCGT TCCACGACCC GTTGACCGGC CTGGCGAACC GGGCGCTGTT CACTCGGCGG
CTGCAACGAG CCCTCACCCA CGACCCCGAC AACCCCGACA ACCCCGACGC CTGCCACGCC
GGCGACTCCG GGACGGTGTC CGTCCTGTTT TTGGATCTCG ACGGGTTCAA GCTGGTGAAC
GACACGCTCG GCCATGCCGC CGGGGACGAA CTCCTCAAAA TCAGCGCGGA GCGACTGCGG
GCGGACACCC GCGCGGTGGA CACCGTCGCC CGGCTCGGCG GGGACGAGTT CGCCATCATT
CTCGGCAGCG GCGGCGTCGA CGCCCGGAAG GTGGGTGAGC GGCTCGCGAT GGCAGTCCAG
GAGCCGTGCC TGCTGGCCGG GCAGATCTAC ACCCCGCGGG CCAGCTTCGG CCTAGTCACG
CTCGACGGCT CCACCACCCG GCCGGCCAGT CCCGACAGCC TGCTGCACCA GGCCGACCTG
GCCATGTACG CGGCCAAGCG GGAACGCGCG GGCAGGCTGG TCGTCTACCG GCCGGAGCTG
TCCGCCCTCC CCGAGCACCG CCACCCGGAC CCACCCAGCC AGGACGACCC GGTGGGCGCC
TACGGCCGAG TAGTAGGCCT GATGTGA
 
Protein sequence
MTPVYPITHD RWYTLARTPA VEFFQTRTRE LAPQRPAESG KTWGKLVVNP ETPRIASIFL 
DAAAQLATAA ACFRTARQVR GTERRRRFAR LAAGGVLVAN LATAATFLAG GFLTGRMSPS
YAIFLVLYGV ALAGLLSLPT APVEGQGENT RTARHSGYRW HTITLLDCVL IVGSIILLAW
GTTLAALVPA GPPDPIQLLA ALLHPVASLI LATAVLLITC FRRPRSPAAS VLLSTGLLID
GLTSNISVYS TAPDSHHLPP WAMIGFTLAF MMIFLAALLP ARTHPDNPTP DSLTTDSLTT
DSLTPGGPRA VWAHAVLPYA VLGAAGLLIL GKLSTGAQLD RFEAYGMVGL LLVALVRQMV
TLAESNRLLA EVRQRARQLH HQAFHDPLTG LANRALFTRR LQRALTHDPD NPDNPDACHA
GDSGTVSVLF LDLDGFKLVN DTLGHAAGDE LLKISAERLR ADTRAVDTVA RLGGDEFAII
LGSGGVDARK VGERLAMAVQ EPCLLAGQIY TPRASFGLVT LDGSTTRPAS PDSLLHQADL
AMYAAKRERA GRLVVYRPEL SALPEHRHPD PPSQDDPVGA YGRVVGLM