Gene Franean1_0329 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_0329 
Symbol 
ID5668753 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp392138 
End bp395995 
Gene Length3858 bp 
Protein Length1285 aa 
Translation table11 
GC content76% 
IMG OID641239260 
ProductTPR repeat-containing adenylate/guanylate cyclase 
Protein accessionYP_001504701 
Protein GI158312193 
COG category[R] General function prediction only 
COG ID[COG3899] Predicted ATPase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.902018 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.090123 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTGATCA GCTCCCCCAC GACCGGTCCC CTGGTCTGCC CCAGATGCGG CACCCCGACG 
GTGCCCGGCG GGCGGTTCTG CTTCAACTGC GGGCTCCGGT TCACCCAGAC CGACACGCCG
CGCGCCGAGG CCGCCGAGCG CCGGGTCGTC ACCGTCCTGT TCGGTGACCT GTCCGACTTC
ACCGCCTGGG CGGAGGACCG CGACCCCGAA CGGGTCGGCG AGGTCACCGA CCGGGTCCTC
GCCGCGCTCG CGCGCGAGGT CGACGAGGTC GGCGGGCGGG TCGACAACCT CAGCGGTGAC
GGAATCATGG CCGTCTTCGG CGCCCCGACG GCGCACGAGG ACGACCCAGA ACGCGCCGTG
CGCGCCGCGG CGGCGATGCA GGAGACGGCC CGTCGCCTGA TCAAGGACGA GGGCGGCGAC
AGCAGCCCGC TCGGCCTGCG GGTCGGCGTC AACACCGGCG AGGTGCTCGC CGGTGTGCAG
GCGGCGCTGT CGTACACGGT CATCGGGGAC ACGGTGAACA CCGCCGCGCG GTTGTCCGGC
GCGGCCGCGG TGGGCACCGT CTACGCCGGC CGGGACACGG TCACCGCCAC CCGCTCCGTC
GCCACCTGGC GCGATCTGCC GCCCCTGGTG CTCAAGGGCA AGCGCGAGCC GGTGCCCGCC
TACGAGCTGG TCAGCCTGCG TCCGTCCAAC ATCGCCCGGC CCGGCCTGGC CGACGACGCG
ACGTTCATCG GGCGCGAGCA GGAGCTGGCC GAGCTCAGCC GCTACTTCGC CGGCGTGGTC
GACGCCGGCC AGCCCGACAT GGCGGTGGTC ATCGGCGAGG CCGGCATCGG CAAGACCCGG
CTGGCCCGGG AGCTCGCCGA GACCGCCGAG GCGACCGCGG GCGCGACCGT GCTGTGGGGG
CGCACCGTGC GCCACGGCGA CGTCCGCGAC CTCGCCCCGC TGATCGACCT CATCCGCGGG
GCGTGCGGCA TCACCGACGG CGACACCCTC GACCGGGTCG CCGACCGGGT ACGCCGCACG
GTCGCCGGCC TCACCCATCC GATCTTCGAA ACGAAGCTGC CGAGCGGCAT CGGAGACCGC
CTGCTCGGGA TGCTCGGCGT GCCGCCGGAC CGCCGTCCCA GCGCCGGGGC GACCACCCCG
CCGGGTGACC CGGTCGGCCG GGCCTCCCAC GGCGAGGCGG TCCAGCGGGC CGCCGACGAC
GCCGTCGAGA CGGTCGGCCT CCTGCTGAGC GCGCTGGCGG TCCGCAACCC GGTGCTGGTC
GTCGTCGACG ATCTGGAGTG GGCGACCAGC CGGCTGACGA CGGCCCTGCG GCGGCTGACG
ACGACCCTCA GCGGCCCGGT GATGCTCGTC CTGCTCGACC GGGAGAGCCC GCAGTTCTCC
GATCTCGTCG GCGCGCGCCG CATCGCGCTC AGTCCGCTGC CGGAGAGCGC GGCGCGTGCC
CTGCTCAAGG ACTACCTGGG CGAGCCCGGC CTGTCCCGGG CGACCCAGGC CGAGCTGCTG
GCCCGGGTGC ACGGGAACCC GTTCTTCCTG ACCGAGCTGC TCAACCTGCT GGTGGACCGC
GGTCTGCTGC ACCAGGTGCC CGACCCTGGC GGCGAGGGGA TGCGCTGGGT GCTGGAGGGA
TCGCTGGCCG CGACCGTCCT GCCCGCCGGT GTCCAGTCGG TGCTGGCAGC CAGGATCGAC
GACCTGGACG CGGCCGCCAA GGCGGCGCTG CGCGCGGCGG CCGTGCTCGG CCCGCGCTTC
CCCGCCGAGG CGCTCCAGGT GGTCGGCGAG CGCCCGGCGG CCGAGGTCGA ACGCGCGCTG
TCGGTGCTCA CCGAGCGCCA GCTCGTCCGC CCACCGCGGC AGGGCGAGAC CCTGTGGCGG
TTCGTCCACC CGATGGCGCG CGACGTCGCC TACTCGGGCC TACCGAAGGT GGAGCGGGCC
CGCCGGCACG CCACGGCGGC GCGCTGGGGC GTCACGGCAA TGATCGGTTC GTCCCGCGAG
GTGTCGACCT TCGTCGCCAC CCACGCACTG CGCGCCTACG ACCTGGCCGC CTCGATGGCG
CTGCCCGCGG GCGACCCGGC CTGGTCGGCG CGGGAGGCCG GCTTCCACGC GCACGTCCGG
CTGGCCCGCT CGGCGCTCGC CCGCGACGAC CACCGGGCCG CCGCGGACCT GCTCGCCGAC
GCCCGCCGGC TCGGGCGGGG CGTCATCGAC GGCGACGACG ACATCAACGT GCGGATCCTG
CACGCCGAGG CGCTGGTCTG GCTGCGCCGG CTCGACGAGG CGGAGCGGAC GCTGCGCCCG
GCGCTGCGGG TGAACGTCCC GAGCCGGCGG GCCGCCGCCT ACGCGGTGCT CGGTGAGCTA
CGGCAGAAGC AGGGCCGTGG CGAGGAGGCC CGCCAGTCGC TGATCACGGC GCTGGAGGCT
GCGCACCGGG CCGGCGACGA CCGGGCGGTG GCCGCGGCGC TGCGCCGGCT CGGCCTGCTC
GAATACGCCG CCGGCCGCAT CCAGGCCGCG GAGGAGCGCT ACCGCGAAGC ACTCTCGCTC
GCCCGCCGCG TCGACGACCC CCGCGGGGTC GGCTGGGCGC TGCAGCACCT GGCGTGGAGC
GCCACCACCC GCGGGGACTA CGCGAAGGCC GAGCGCACGC TGCGCGAGGC GTCCGCGGTG
TTCGAGCGGC TGGAGGACGC CGGCGGGCTC GGCTGGTGCT CGGGCACCGA GGCCCTCGTC
CTGCTGCTCT CGGGTCAGCT CACCCGGACC CGCAAGGTCT CCCGCGTGCT CATCAACCTC
GCCGAGTCGA TGCGCGAGCG CTGGGGGCAG GCGGTCTGCC TGACGATCGA CGCGGTCGCC
GCGGCCGAGC TCGGCGACAT CGAGACCGCC GAGAGCGAGG CGGCCCGCGC GGCGGAGCTG
TTCACCGAGA CCGGCGACAG TTGGGGGCGC ACCCTTACCC TCATCGCGCG CGGGCTGGCC
GCCCGTGGGG CTGGCCGCCC CCGGCGCGGT GCGGACCTGC TGACCGAGGC GTGCGCGGAG
GCGGCGTCCA CCGGGCATGT GCTCATCGGC GCCTTCGCCC AGGTGCTGCT CGGCCTGACC
AGGCTGGACG CCGGCGAGGT GGATGCCGCG GAGGAGGCCG CCCGGCGCGC CCTGGCCGAC
CTGGACCGGC TGGAACTGCG CCCGCACGCC CAGCTCGGGG CGAAGGTGCT CGTCGCGCAG
ATCGCGCGGG CGCGCGGCCG CCTCGACGAG GCGATCACCG AGCTGCGGGG GGCCCTGTCC
GCCAGCGAGC CGGCGACGCT GATGTTCCCG CGCCGGCAGG CCTACGCCCA CCTCGCAGGC
ACCCTGCTCG ACGCGGGCGA GCCCGACGAG GCCCTGCGGG TGGCACGCCA GGCGGTCGGA
GTGGACGCCG AGGACGTCCG GGCACAGGTA CTGGCATACC GGGCGCTGGG GACGGTCCTG
GCGGCGCACG GTGACGTCGA GGGCAGCCGC GAGGCCTACG AGCAGGCCCT GGCCGCCGCG
ACGGCCACCG GCGCGATCAG CGAGGCGCCG CAGACCCGGC GGCTGCTCGC GGCCCTGCCG
GGATCCGCCC CGGACGCCCT CGACGCACCG GACGGGCTCG ACGCAGCAGA CGGACTCGGC
GCAGCAGACG GACTGGGCGT GGCGGACGGG CTGGGCACAC CGCACCAGGA CGGGCTGGCC
GCCCAGGATG ACCTGAGCGC ACAGGACGGG CTGGGCGCGG CGGCCGCGCT GGCGAAGGTG
AGCGTGAACG GGACGAGGGC GCGGGCCAGG ACGGCGGCGC GTCTGCCGGG ATCACCGGTG
GTCGTGCCGG TGGCCGACCC GTCCGGTGAG GCCGTGCCCT CGGTGGAGAT GCCGCCCGCC
GAGGGCACCA GCCGGTAG
 
Protein sequence
MVISSPTTGP LVCPRCGTPT VPGGRFCFNC GLRFTQTDTP RAEAAERRVV TVLFGDLSDF 
TAWAEDRDPE RVGEVTDRVL AALAREVDEV GGRVDNLSGD GIMAVFGAPT AHEDDPERAV
RAAAAMQETA RRLIKDEGGD SSPLGLRVGV NTGEVLAGVQ AALSYTVIGD TVNTAARLSG
AAAVGTVYAG RDTVTATRSV ATWRDLPPLV LKGKREPVPA YELVSLRPSN IARPGLADDA
TFIGREQELA ELSRYFAGVV DAGQPDMAVV IGEAGIGKTR LARELAETAE ATAGATVLWG
RTVRHGDVRD LAPLIDLIRG ACGITDGDTL DRVADRVRRT VAGLTHPIFE TKLPSGIGDR
LLGMLGVPPD RRPSAGATTP PGDPVGRASH GEAVQRAADD AVETVGLLLS ALAVRNPVLV
VVDDLEWATS RLTTALRRLT TTLSGPVMLV LLDRESPQFS DLVGARRIAL SPLPESAARA
LLKDYLGEPG LSRATQAELL ARVHGNPFFL TELLNLLVDR GLLHQVPDPG GEGMRWVLEG
SLAATVLPAG VQSVLAARID DLDAAAKAAL RAAAVLGPRF PAEALQVVGE RPAAEVERAL
SVLTERQLVR PPRQGETLWR FVHPMARDVA YSGLPKVERA RRHATAARWG VTAMIGSSRE
VSTFVATHAL RAYDLAASMA LPAGDPAWSA REAGFHAHVR LARSALARDD HRAAADLLAD
ARRLGRGVID GDDDINVRIL HAEALVWLRR LDEAERTLRP ALRVNVPSRR AAAYAVLGEL
RQKQGRGEEA RQSLITALEA AHRAGDDRAV AAALRRLGLL EYAAGRIQAA EERYREALSL
ARRVDDPRGV GWALQHLAWS ATTRGDYAKA ERTLREASAV FERLEDAGGL GWCSGTEALV
LLLSGQLTRT RKVSRVLINL AESMRERWGQ AVCLTIDAVA AAELGDIETA ESEAARAAEL
FTETGDSWGR TLTLIARGLA ARGAGRPRRG ADLLTEACAE AASTGHVLIG AFAQVLLGLT
RLDAGEVDAA EEAARRALAD LDRLELRPHA QLGAKVLVAQ IARARGRLDE AITELRGALS
ASEPATLMFP RRQAYAHLAG TLLDAGEPDE ALRVARQAVG VDAEDVRAQV LAYRALGTVL
AAHGDVEGSR EAYEQALAAA TATGAISEAP QTRRLLAALP GSAPDALDAP DGLDAADGLG
AADGLGVADG LGTPHQDGLA AQDDLSAQDG LGAAAALAKV SVNGTRARAR TAARLPGSPV
VVPVADPSGE AVPSVEMPPA EGTSR