Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_7278 |
Symbol | |
ID | 5675579 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 8885727 |
End bp | 8889011 |
Gene Length | 3285 bp |
Protein Length | 1094 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 641246115 |
Product | adenylate/guanylate cyclase |
Protein accession | YP_001511503 |
Protein GI | 158318995 |
COG category | [R] General function prediction only |
COG ID | [COG3899] Predicted ATPase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.137597 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGATGTGGC ATAGGTCACT GCCGGCGCGG CGCGTGTCCC CATCCCCTGC CGCGCCGCAC CGCGATCGGC CTACGATCCC GCACATGGTG CCGTGCCCGA CATGCGCGGA GGAGAACCCG AAGCGAGCGC GCTTCTGCTC CGAGTGCGGG AACCCGCTGC CCGTCGCGGG CAATGGGTCG CGCAAGACAG TGACCATCAT GTTCGTCGAC ATAACCGGTT CGACGGACAT CGGTGAGAAG ATCGACTCCG AGCCGCTGCA GCAGGTCATG TGGCGGTTCT TCACGACCGT GCGGGAGGTC ATCCACGCCC ATGGCGGCAC GGTCGAGAAG TTCATCGGGG ACGCGGTGTT CGCCGTGTTC GGTATCCCGG TGCTGCACGA GGACGACGCG CTGCGCGCCG TGCGGGCCAG CCTGGACATC CGCGCGGCGA TGGAGGAGCT CAACGCCGAC CTCCAGCGCG AGTGGGGGCT GCGGCTGCGG GTGCGCATCG GCATCAACAC CGGCGAGGTG ACCGTCGCCG GCGGCGGCGC GACCGGCGAC CCGGTGAACG TCGCCTCCCG GCTGGAGGAG GCGGCGCCAC CCGACGAGAT CCTCATCGGT GACAGCACGT ACCGGTTCAT CCGGCACAGC GTCACCGTCA GCTCGGTGGG CCCGCTGGTC GTGCAGGGCA AGCGGGAGCC GGTGCGGGCG CACCGCCTGA TCGGGCTGGT CGACCCGACC GCGGGCCCGG GCAGCCGGGC CCCCACCCAG GGCTTCGCGG CGCCGGTGAT CGGCCGAAAC CGGGAGCGCC GCCGCCTCCA GGACGCCTTC GAGGCCGTCG TCGAGGAGCG GACCTGCCAC CTGTTCACCG TGCTCGGCCC GGCCGGCATC GGAAAGTCGC GGATGGTCGG CGAGTTCTGC CAGGCCACCG CGTCGCGCGC CACCGTGCTC ACCGGCCGCT GCCTGTCCTA CGGCGAGGGC ATCGCCTACT GGCCGCTGAT GGAGATGGTC CGCCAGGCCA CCGGCATGTC CGCCGACGAG AGCGCGCCGG ACGGCCGCCT GCGCCTCAAG GAACTGCTGG CGGACGTCCC GCAGGCCGCC GAGGTGGTCG ACGCGCTCGC CCCGCTGGTC GGACTCGGCG GCGCGGAGCT GGGCACCCAG GAGAGCTTCT GGGCGGTCCG CACCTTCTTC CAGGCCCTGG CCGAGAAGCG CCCGCTGGTG CTGTGGTTCG ACGACGTCCA CTGGGCCGAG CCGACCCTGC TGGACCTGAT CGAGCACGTC GCCGACTGGT CGCGGGACGC GCCGATCCTG CTGCTGTGCC TGTCCCGCCC GGAGCTGCTG GAGGACCGGC GCGAGTGGGG CGGCGGCAAG CTGAACGCCA CGTCGATGCT GCTGGCGCCG CTCACCGAGG CACGCTGCCA CCGGCTGATC CGCACCCTGA TGGGCTCGGA CGACCTCGAC CCCGCGCTCG TCTCGCGGAT CACCACCTCG GCGGCGGGCA ACCCGCTGTT CGTCGAGCAG ATGGTCGCCG CGCTGGTCGA CGACGGCCTG CTGCGCCGGG AGGGCTCCCG CTGGATCGCC ACCGGGAACC TGCGCGGGGT CACGGTGCCG CCGACGATCT CCGCGCTGCT GGCCGCCCGA CTCGACCGCC TCGAGCCCCC GGAGCGCCGG GTGCTCGAGC GCGCCGCCGT GGTCGGCGAG CGGTTCTACC TGGACGCCGT CGCCGAGCTC TCCGACCCGG CCGAGCGCCC GTACGTCACG GAGCACTGCC TGTCACTGGT CCGCAAGGAG CTCGTCCACC CGGACCGCTC GGACCTGCCC GGCGTCGACG CCTACCGGTT CCTGCACGTC CTGCTGCGCG ACTGCGCGTA CCAGGCGACG TCCAAGCGCC AGCGCGCCGA CCTGCACCAG CGTTTCGCCG GCTGGTTGCA GGCGCGGATG GTCGGCGGGC CGGGTGAGCA CGACGAGCTG GTGGGCTACC ACCTCGAGCA GGCCTACCGT TACCGGGTCG AACTCGGGCA GCGCGACAAC GAGATGATCG AACTCGGCCG GGCGGCGGCC ACCTGCCTGA TCGAGGCGGC CGACCGCGTC CGCCAGGGTG ACGAGACCGG CGCCGCCCAG CTCCTCAAAC GCGCGATCAC GCTGCTGCCG GAGACCGACC CGCTGCGGCT GCGGGCCGAG ATCGACCTCG GCTGGGCGCT GTACTCCTTC GGCCGCCTCT CCGACGCCGA GCGCATCCTG CGGCAGGTCA CCGAGCGGGC GCGGCGCGCC GGCGAGGACG GCCTGCGGGC GCACGCCCGG CTCGCCCAGC TGCGGGTGCT GTTCGCGACC GACCCCGAGG GCCTGGTCGC CAGCACGCTC ACCGAGGCCA CGGCGGCGCT GGCCGCGTTC GTCCGGGACG GTGACGACGT CGGCGCGGCG CTGGCCTGCC GCAGCCAGTC GGCGGCGTAC GGCGCGGCCG GCCAGTTCGC GGCCGCGGAG AAGGCCATGG AGACCGCGGT GCGCCACGCC GAGGAGTCCG GGGTGCCCAG GGCCGCGCAC TCGCTGCGGC GCGAGCAGGT CCTGCTGCTG AGCCTCGTGC CGCGCCCGGT GCCCTACGCG ATCGCGAAGG CGACCAGGGC GCTGGAGGCG GCCGGCGACG ACCGCGGCCT GCGTCGGGCG GTGCTCGCGC AGCTCGCGCT GCTCAACGCC ATGTCCGGTG ACCTGGAGGC GGCGCGCACC CACCTCAAGG ACGCCGAGGA GATCGTCGGG GACATCCGCG GCGCCCGCAC GGACCCGTTC CACAGCGGCT TCGTGGTGGC CCGCGTGGCG CTGCTCGCCG ACGAGCCGGA GATCGCCGAG CGCGAGCTGC GCCGCAGCTG CCGCCAGCTC TCCCGGATGG GCGAGCGCGC GCACCTGGCC ACCCGGGCCG CCCTGCTGTC GGACGTGCTG GTCCGGCGCG GCAAGCTGGA GGAGGCGCAG AAGTACGTCA ACCGGTGCCG GGACGCCGCC GCCGGCGACC AGCTCCCCGC CCAGGCGGGC TGGTGCTCGG TGCATGCCAA GCTGCTGGCG ATCTCCGGGC GGGACGCCGA GGCGCTGCGC TTCGCCGACA CCGCGTGCGA CCTGGCCGGC CGCACCGACG ACATCGACGG GCAGGGCAAC GCGCTGCTCG CCCGCGCCGA GGTGCTCTAC CGGGCCGGGC GCAAGGACGA CGCCGCCGAG AGCCTGGAGG CGGCCAGCGC CCGGTTCCTG CAGAAGGGGA ACCGGGCGGC GGCGGCCACC AGCCGACGGC TGTTCGAGAC CCTCGACGCC GAGCGCTTCG GCTGA
|
Protein sequence | MMWHRSLPAR RVSPSPAAPH RDRPTIPHMV PCPTCAEENP KRARFCSECG NPLPVAGNGS RKTVTIMFVD ITGSTDIGEK IDSEPLQQVM WRFFTTVREV IHAHGGTVEK FIGDAVFAVF GIPVLHEDDA LRAVRASLDI RAAMEELNAD LQREWGLRLR VRIGINTGEV TVAGGGATGD PVNVASRLEE AAPPDEILIG DSTYRFIRHS VTVSSVGPLV VQGKREPVRA HRLIGLVDPT AGPGSRAPTQ GFAAPVIGRN RERRRLQDAF EAVVEERTCH LFTVLGPAGI GKSRMVGEFC QATASRATVL TGRCLSYGEG IAYWPLMEMV RQATGMSADE SAPDGRLRLK ELLADVPQAA EVVDALAPLV GLGGAELGTQ ESFWAVRTFF QALAEKRPLV LWFDDVHWAE PTLLDLIEHV ADWSRDAPIL LLCLSRPELL EDRREWGGGK LNATSMLLAP LTEARCHRLI RTLMGSDDLD PALVSRITTS AAGNPLFVEQ MVAALVDDGL LRREGSRWIA TGNLRGVTVP PTISALLAAR LDRLEPPERR VLERAAVVGE RFYLDAVAEL SDPAERPYVT EHCLSLVRKE LVHPDRSDLP GVDAYRFLHV LLRDCAYQAT SKRQRADLHQ RFAGWLQARM VGGPGEHDEL VGYHLEQAYR YRVELGQRDN EMIELGRAAA TCLIEAADRV RQGDETGAAQ LLKRAITLLP ETDPLRLRAE IDLGWALYSF GRLSDAERIL RQVTERARRA GEDGLRAHAR LAQLRVLFAT DPEGLVASTL TEATAALAAF VRDGDDVGAA LACRSQSAAY GAAGQFAAAE KAMETAVRHA EESGVPRAAH SLRREQVLLL SLVPRPVPYA IAKATRALEA AGDDRGLRRA VLAQLALLNA MSGDLEAART HLKDAEEIVG DIRGARTDPF HSGFVVARVA LLADEPEIAE RELRRSCRQL SRMGERAHLA TRAALLSDVL VRRGKLEEAQ KYVNRCRDAA AGDQLPAQAG WCSVHAKLLA ISGRDAEALR FADTACDLAG RTDDIDGQGN ALLARAEVLY RAGRKDDAAE SLEAASARFL QKGNRAAAAT SRRLFETLDA ERFG
|
| |