Gene Franean1_6440 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_6440 
Symbol 
ID5674755 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp7831104 
End bp7832495 
Gene Length1392 bp 
Protein Length463 aa 
Translation table11 
GC content76% 
IMG OID641245288 
Productadenylate/guanylate cyclase 
Protein accessionYP_001510683 
Protein GI158318175 
COG category[K] Transcription
[T] Signal transduction mechanisms 
COG ID[COG2114] Adenylate cyclase, family 3 (some proteins contain HAMP domain)
[COG2197] Response regulator containing a CheY-like receiver domain and an HTH DNA-binding domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.084335 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0956308 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCTGCTCG CCGACGACAA CCTGATCGTC CGGGAGGGCG TGCGGGCGCT GCTCTCCCGC 
GTCGCCGACA TCGCGGTCAT CGGTGTCGCC GCTGACCACG ACGAGCTGGT CCGCGCCGCC
GCGCGGCTGC GCCCCAGGGT CGTCGTCACC GACATCCGGA TGCCGCCGGA CTTCACCGAC
GAGGGGGTCC GGGCGGCCCA GCGCATCCGG GCGGCGAGCC CGGCGACAGG CATCGTGCTG
CTCTCCCAGT ACGACGAGCC CGACTACGCG ATCACACTGC TGCGTGACGG CCTGTCCGGC
TACGCCTACC TGCTCAAGGA GCGGGTCGCC GACCCCGACC TGCTGGCGCG CGCCATCCGC
GAGGTCGCCG GCGGCGGCTC CATGCTGGAC CCGGACATCG TCGCCGCGGT GATAACGCCC
GTCCGCGAGG CCGGGACGCT GACCCGCGAG CAGGAGGAGC TGCTGCGCGA CGTCGCCCAC
GGGCGCCCCG TGGCCGCCAT CGCCGCCGAC CACGGCGACA CCCCGGCCGC CGTCAACGCG
GCGATCGACG CGCTGCTGCT GCGGCTCGCG CAGGGCGCGA GCTCGGGCCA GGCCGGCGCG
CTGCGCCGCC TGCGCATGCT GCACGAGGCG CTGATGAAGC GTGAGGAACA GGACGTCGCC
CTGACCAGGC TGCTCCCCAG CGGCCTGGCG CAGAAGCTGC GCGACGATGC CACCGCCGCG
AGCCGCGCCG AACGCCTTGA TGTCACCGTG CTCATGTCGG ACGTCCGCGG CTACTCGGGC
ATCGCGCAGC GCACCGACCC GATGGTGCTG GCGCACCAGC TCGACGAGCA CCGCCGGGAG
ATGAACGCGG CGATCCTCGC CGAGGGCGGC ACCGTCCTGC AGTACGCCGG GGACGCGGTG
ATCGCTGTCT TCGGCGCGCC GGACCCGACC AGTGACCACC GCGAGCGTGC GGTGCGCGCC
GCGGCCGCGA TGCACCGGCG CCAACGCGGC CTGGACGAGC GGTGGACCAG GCGCCGGCTG
GAGCCGTTCG GGCTGGGCAT CGGGGTCTCC ACCGGCGAGG TCGCGGCGGC CCTGCTGGGC
TGCGACGCGC ACCGGGAGTA CACGCTGGTC GGGGACACGA TGAACCTCGC GCAGCGGCTC
CAGGCGATGG CGGAGGCGGG TGTCACGATC GTGAGCGCCG CGACCGCGGC GGCGCTGCCC
GCCGACGTCG TCGTCCCCCT CGACCCGCGG CCGGTGAAGG GACGGGTCGG GGACGTCCGG
GCCTACCGCG TCATCGCCGG CGGCGCCTCC GCCCCCATCC CCGTCCCCAC TCCCGTCGTT
CCGGCGCCTG CCCTCACCGC GCCGGTCCCG CGGCCGCGCC GCCGCCGTCT GCTGGCCCGC
TCCGCCGCCT GA
 
Protein sequence
MLLADDNLIV REGVRALLSR VADIAVIGVA ADHDELVRAA ARLRPRVVVT DIRMPPDFTD 
EGVRAAQRIR AASPATGIVL LSQYDEPDYA ITLLRDGLSG YAYLLKERVA DPDLLARAIR
EVAGGGSMLD PDIVAAVITP VREAGTLTRE QEELLRDVAH GRPVAAIAAD HGDTPAAVNA
AIDALLLRLA QGASSGQAGA LRRLRMLHEA LMKREEQDVA LTRLLPSGLA QKLRDDATAA
SRAERLDVTV LMSDVRGYSG IAQRTDPMVL AHQLDEHRRE MNAAILAEGG TVLQYAGDAV
IAVFGAPDPT SDHRERAVRA AAAMHRRQRG LDERWTRRRL EPFGLGIGVS TGEVAAALLG
CDAHREYTLV GDTMNLAQRL QAMAEAGVTI VSAATAAALP ADVVVPLDPR PVKGRVGDVR
AYRVIAGGAS APIPVPTPVV PAPALTAPVP RPRRRRLLAR SAA