Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Francci3_3013 |
Symbol | |
ID | 3904366 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. CcI3 |
Kingdom | Bacteria |
Replicon accession | NC_007777 |
Strand | - |
Start bp | 3574965 |
End bp | 3579521 |
Gene Length | 4557 bp |
Protein Length | 1518 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 637880333 |
Product | glutamate synthase (NADH) large subunit |
Protein accession | YP_482099 |
Protein GI | 86741699 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0069] Glutamate synthase domain 2 |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.0945186 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCGAGTG CGCAAGGTCT TTACGACCCC ACCTTCGAAC ACGATGCCTG TGGCGTCGGC TTCGTCGTCG ATGTACACGG CAGGCGCAGT CACGAGCTGG TCGAGCAGGG CCTGACCGTG CTGCGTAACC TCGACCACCG GGGTGCCTCG GGCAGCGACC CGGACACCGG CGACGGAGCC GGCATCCTGG TGCAGGTCCC CGACCTGTTC CTGCGTGACG TCGTCGACTT CACGCTGCCC GCCCCGGGGC GTTACGCCGT CGGGATCGCC TTCCTGCCGC AGGTCTCGGG GGAGCGGGAC GAAGCCGTGC GCACCATCAG CCGCATCGTC CGCCAGGAGG GGCTGCGGGT CCTCGGCTGG CGTGAGGTGC CGGTCGTCAG CCACATCGTC GGGCACGCGG CCCACGAGGT TGAGCCGCGG ATGCGGCAGC TGTTCCTCGC ATTGCCCGGC AGTCTGCCCG CCGCCGGTCC GGTTGAGGGC GGCGCGGGGA ACGGGTTCGA CCAGGCCGAC CTCGAACGTC GGGCGTTCTG CGCCCGCAAG CGGATCCGCC GGGAGACCGG CGTCTACCTG GCGTCGCTGT CGTCGCGGAC CCTGGTCTAC AAGGGGATGC TCACCACCCA CCAGCTCTCG GCCTACTTTC CCGACCTCGA CGACCCCCGG TTCACCAGTG CCATCGCGCT GGTGCACAGC CGGTTCTCCA CGAACACCTT CCCGAGCTGG CCGCTGGCGC ACCCGTACCG GTTCGTCGCC CACAACGGGG AGATCAACAC TGTCCGCGGG AACCGGAACT GGATGCGGGC CCGCGAGGCG CTGCTCGCCA GCGACCTGAT CCCCGGTGAC CTCTCCCGGC TGTTCCCCGT CTGCGCGGAC GGGGCGAGCG ACTCGGCGAG CTTCGACGAG GTCCTGGAAC TGCTGTACCT GGGCGGTCGC AGCCTGCCGC ACGCGGTGCT GATGATGATC CCCGAGGCGT GGGAGAACCA CACCGAGATG GACCCGGCGC TGCGCGCCTT CTACCAGTTC CACTCCACGC TGATGGAGCC GTGGGACGGC CCGGCCTCCA TCGCGTTCAC CGACGGGACG GTCATCGGGG CGGTGCTCGA CCGTAACGGG CTGCGGCCCT CGCGGTACTG GGTCACCGAC GACGGCCTGG TCGTGATGGC CTCCGAGGTG GGCGTGCTCG ACATCCCGCC GCACAAGGTG GTGCAGAAAG GCCGGCTCCA GCCGGGGCGG ATGTTCCTGG TGGATACCGC CGAGGGGCGC ATCGTCAGCG ACGAGGAGAT CAAGTCCGAG CTCGCGAACG CGGCGCCGTA CGCGGAGTGG TTGCACGCCG GTCTGATCGA GCTGGACGAC CTGCCCGAGC GGGAGCAGCT CCTCTACGGG CACTCCTCGG TGCTGCGTCG CCAGCAGGTG TTCGGCTACA CGCTGGAGGA GCTGCGCGTG CTCGTCGCGC CGATGGCGCG CACGGGCGCG GAGCCGATCG GCTCGATGGG CACCGACACG CCGGTGGCGG TGCTCTCCAG CCGTCCCCGG CTGCTGTTCG ACTACTTCAC CCAGTTGTTC GCGCAGGTCA CCAACCCGCC GCTGGACGCG ATCCGCGAGG AGTTGGTGAC GAGCCTGGGC CGGGTGCTCG GCCCGGAGGG CAACCTGCTC GCGCCATCCC CGGCGTCCTG CCGGATGGTG CACCTGCCCT ATCCGGTGAT CAGCAGCAGC CAGCTTTCGA AGATCATCGG CATCAACGAC GACGGTGACA TGCCCGGCTT CGCCTCGGTG ACGGTCCGTG GCCTGTATGA CGTCGACGGG GGCGGCGCGG CGCTCGCCGC CCGGCTGGCG GAAATCTGCG CCGGGGTCAG CGAGGCCATC GCCGACGGTG CCCGCATCGT CGTGCTCTCC GACCGGGACT CCGACACGCG CAAGGCCCCG ATCCCGTCGC TGCTGCTGAC CGCCGCCGTG CACCACCACC TGATCCGGGA GCGGACCAGG ACGAAGGTCG GCCTCATCGT CGAGTGCGGC GACGCCCGTG AGGTGCACCA CATCGCGCTG CTCACCGGCT ACGGCGCCGC GGCGGTCAAC CCGTACCTGG CATTCGAGTC GATCGACAGC CTCATCGCCG AGGGCGAGAT CGTCGGGGTG TCCCGGGAGC AGGCCGAGAA GAACATGATC AAGGCGCTCG GCAAGGGCGT GCTGAAGGTG ATGTCCAAGA TGGGCATCTC GACGGTGGCG AGCTACACCG GTGCCCAGGT CTTCGAGGCG ATCGGGCTGT CCCAGGAACT GATCGACGCC TATTTCGCGG GTACCCCGTC GCGCCTCGAC GGCATCGGAC TCGACGTCAT CGCGGCGGAG GTGGCAGCGC GGCACCGGCG GGCCTACCCG CCGATCGCCT CCGAGCACGC GCACCGCACG CTGGAGGTCG GCGGCGAGTA CCAGTGGCGC CGCGAGGGTG AGCTGCACCT GTTCAACCCG GAGACCGTCT TCCTGCTCCA GCACGCCACC CGCACCCGGC AGTACGAGAC CTTCCAGAAG TACACCGCCC GGGTCGACGG TCTGTCCCGG GAGAACGCGA CGCTGCGGGG TCTGTTCGAG CTGCGTACCG GGGTGCGCAC GCCGGTGCCG ATCGAGGAGG TCGAGCCGGC CTCGGAGATC GTCAAGCGGT TCGCGACCGG TGCCATGTCC TACGGCTCGA TCAGCGCCGA GGCGCACGAG ACGCTGGCTA TCGCCATGAA CCGGCTCGGC GGGAAGTCGA ACACCGGCGA GGGCGGCGAG GACGCGGAGC GGTTCACGCC GGACGCGAAC GGGGACCTGC GCCGCTCCGC GGTCAAGCAG GTCGCCAGCG GCCGGTTCGG GGTGACCAGC GAGTACCTCG CCAACGCCGA CGACATCCAG ATCAAGATGG CCCAGGGGGC GAAGCCCGGG GAGGGGGGGC AGCTTCCTGG CCACAAGGTC TACCCGTGGA TCGCGAAGAC GCGGCACTCG ACGCCCGGTG TCGGGCTCAT CTCCCCGCCG CCGCACCATG ACATCTACTC GATCGAGGAC CTCGCCCAGC TCATCCACGA CCTGAAGAAC GCCAATCCCA AGGCCCGGGT GCACGTCAAG CTGGTCGCCG AGGTCGGCGT CGGCACGGTC GCCGCCGGGG TCTCCAAGGC GCACGCCGAT GTGGTGCTGA TCTCCGGGCA CGACGGCGGG ACGGGCGCCT CGCCGCTGAC CTCGCTGAAG CACGCGGGTG CCCCCTGGGA GCTGGGGCTG GCCGAGACCC AGCAGACGCT GCTGCTCAAC GGCCTGCGCG ACCGCATCGT CGTGCAGGTC GACGGGCAGC TGAAGACCGG CCGCGACGTC ATCGTCGGGG CGCTGCTGGG GGCCGAGGAG TTCGGTTTCG CCACCGCCCC GCTGGTGGTT GCCGGCTGCG TGATGATGCG CGTCTGCCAT CTCGACACCT GTCCGGTCGG TGTGGCGACC CAGAACCCGG AACTGCGCAG GCGGTTCACC GGCAGGCCCG AGTTCGTCGA GGCCTTCTTC ACCTTCATCG CCGAGGAGGT GCGCACCTAC CTCGCCGCGC TGGGCTTCCG CAGCCTGCAG GAGGCGGTCG GCCGGGTCGA CCTGCTCGAC GCCCGCGCCG CCGTCGACCA CTGGAAGGCC TCCGGCCTCG ACATCACCCC GTTGCTGCAC ACCCCGGAGC GGCCCTTCGG CGGCTCGCTG AACTGCACGT CCAGCCAGGA CCACGGGTTG GACAAGGCGC TGGACAACTC GCTGATCCAG CTGTGCGAGG GAGCGCTCGA CGACGGCCGG CCGGTCTGGC TGGAGATGCC GATCCGCAAC GTCAACCGGA CCGTGGGCAC CATGCTCGGC TACGAGGTGA CGAAGCGCTT CGGCGCGGCG GGGCTGCCCG ACGACACGAT CCAGCTGCGG TTCACCGGCT CCGCGGGGCA GAGCTTCGGC GCCTTCGCGC CCCGCGGCAT GACACTGACC CTCGAGGGCG ACGCGAATGA CTACGCCGGC AAGGGCCTGT CCGGCGGCAA GATCTTCGTC TTCCCGCCGA AGGAGTCCCC GCTGCGGGCC GAGGAGAACA TCGTCGCCGG TAACGTCCTG CTCTACGGGG CGACGGGCGG CGAGGCGTTC TTCCGCGGAA TCGTCGGCGA GCGGTTCTGC GTGCGCAACT CCGGGGCGAC CGCGGTGGTC GAGGGGGTCG GCGACCACGG CTGCGAGTAC ATGACCGGCG GCACCGTGCT GGTGCTCGGG GCCATCGGGC GCAACTTCGC CGCGGGCATG AGCGGCGGCG TGGCCTACCT GTACGACCCG GTCGAGGCGC GGATCAACAC CGAGATGGTG GACGTCGAGG CGCTGGACGA CGCCGACGAG ACGATCGTGC GGGACCTGCT CGTCCGGCAC CGCCGGGAGA CCGGTTCGAC GGTGGCCGCC CGGCTTTACA CCGACTGGGA CACCGTGCGC GGGTCGTTCC GTAAGGTGAT GCCACGGGAC TACAAGCGGG TGCTGACCGC CATCCGGCAG GCGCAGGAGC AGGGGCTCGT CGCCGACGAG GTCATCATGG CGGCGGCTCG TGGCTGA
|
Protein sequence | MPSAQGLYDP TFEHDACGVG FVVDVHGRRS HELVEQGLTV LRNLDHRGAS GSDPDTGDGA GILVQVPDLF LRDVVDFTLP APGRYAVGIA FLPQVSGERD EAVRTISRIV RQEGLRVLGW REVPVVSHIV GHAAHEVEPR MRQLFLALPG SLPAAGPVEG GAGNGFDQAD LERRAFCARK RIRRETGVYL ASLSSRTLVY KGMLTTHQLS AYFPDLDDPR FTSAIALVHS RFSTNTFPSW PLAHPYRFVA HNGEINTVRG NRNWMRAREA LLASDLIPGD LSRLFPVCAD GASDSASFDE VLELLYLGGR SLPHAVLMMI PEAWENHTEM DPALRAFYQF HSTLMEPWDG PASIAFTDGT VIGAVLDRNG LRPSRYWVTD DGLVVMASEV GVLDIPPHKV VQKGRLQPGR MFLVDTAEGR IVSDEEIKSE LANAAPYAEW LHAGLIELDD LPEREQLLYG HSSVLRRQQV FGYTLEELRV LVAPMARTGA EPIGSMGTDT PVAVLSSRPR LLFDYFTQLF AQVTNPPLDA IREELVTSLG RVLGPEGNLL APSPASCRMV HLPYPVISSS QLSKIIGIND DGDMPGFASV TVRGLYDVDG GGAALAARLA EICAGVSEAI ADGARIVVLS DRDSDTRKAP IPSLLLTAAV HHHLIRERTR TKVGLIVECG DAREVHHIAL LTGYGAAAVN PYLAFESIDS LIAEGEIVGV SREQAEKNMI KALGKGVLKV MSKMGISTVA SYTGAQVFEA IGLSQELIDA YFAGTPSRLD GIGLDVIAAE VAARHRRAYP PIASEHAHRT LEVGGEYQWR REGELHLFNP ETVFLLQHAT RTRQYETFQK YTARVDGLSR ENATLRGLFE LRTGVRTPVP IEEVEPASEI VKRFATGAMS YGSISAEAHE TLAIAMNRLG GKSNTGEGGE DAERFTPDAN GDLRRSAVKQ VASGRFGVTS EYLANADDIQ IKMAQGAKPG EGGQLPGHKV YPWIAKTRHS TPGVGLISPP PHHDIYSIED LAQLIHDLKN ANPKARVHVK LVAEVGVGTV AAGVSKAHAD VVLISGHDGG TGASPLTSLK HAGAPWELGL AETQQTLLLN GLRDRIVVQV DGQLKTGRDV IVGALLGAEE FGFATAPLVV AGCVMMRVCH LDTCPVGVAT QNPELRRRFT GRPEFVEAFF TFIAEEVRTY LAALGFRSLQ EAVGRVDLLD ARAAVDHWKA SGLDITPLLH TPERPFGGSL NCTSSQDHGL DKALDNSLIQ LCEGALDDGR PVWLEMPIRN VNRTVGTMLG YEVTKRFGAA GLPDDTIQLR FTGSAGQSFG AFAPRGMTLT LEGDANDYAG KGLSGGKIFV FPPKESPLRA EENIVAGNVL LYGATGGEAF FRGIVGERFC VRNSGATAVV EGVGDHGCEY MTGGTVLVLG AIGRNFAAGM SGGVAYLYDP VEARINTEMV DVEALDDADE TIVRDLLVRH RRETGSTVAA RLYTDWDTVR GSFRKVMPRD YKRVLTAIRQ AQEQGLVADE VIMAAARG
|
| |