Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Francci3_4065 |
Symbol | |
ID | 3907026 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. CcI3 |
Kingdom | Bacteria |
Replicon accession | NC_007777 |
Strand | - |
Start bp | 4864790 |
End bp | 4868008 |
Gene Length | 3219 bp |
Protein Length | 1072 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 637881394 |
Product | glycine dehydrogenase |
Protein accession | YP_483144 |
Protein GI | 86742744 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0403] Glycine cleavage system protein P (pyridoxal-binding), N-terminal domain [COG1003] Glycine cleavage system protein P (pyridoxal-binding), C-terminal domain |
TIGRFAM ID | [TIGR00461] glycine dehydrogenase (decarboxylating) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCTGACG TCACCCAGTA CGACGACACC CCCTATGCCG ATGGACCGGC CACGGCTGCC GATGGACCGG CCACGACCGC CACGACCGCC ACCATCAGTC CCGCGTCGTC GCACGCGCGG TCGGGGGCGC CGCGACAGGG CAGTGCCGCG GCGGTGGGCA GGAACGGTGC GAGGCGGCTG CCCGCCGCGG CCATCCCGCG GTTCGCCGAT CGGCACATCG GCCCGGACCC GTCATCCCAA CGGGAGATGC TGGATGCGCT GCGGGTGGAA TCACTGGCCG CGCTCACCGA CGCGGCCGTC CCGGCAAGCA TCCGTGACCA TGATCTGGAT CTTCCCGCGG CGCTGAGCGA GCCCGCGGTG CTCGCGGCCC TACGCGCGTT CGGTAGCAGG AACCGACCGG TTTCCTCGAT GATCGGGCTG GGATTCCATC CCGCGGTGAT GCCGGGAGTC ATCCAGCGCA ACGTCCTGGA GAACCCCGCG TGGTATACGG CGTACACGCC GTACCAGCCG GAGATCTCCC AGGGCCGCCT GGAAGCGTTG CTCAACTTCC AGACGATGAT CACTGATTTG ACCGGCCTTG CGGTGGCCGG AGCCTCCCTG CTGGACGAGC CGACCGCGGC GGCCGAGGCG ATGCAGATCG CGTTTCGGAC GGCGAAGGGT TCCCGCGCGA CGTTCCTCAT CGATGCCGAC ACCCTCCCGC AGACCGTCTC GGTGGTCGCG ACCAGGGCGG AGGCGCTGGG GATCAATGTG GTGGTCGCGG ATCTCGCAGC CGGACTCACG GCCGTGGGCC CGGCGCAGCT GGATGCGGCC TTCGGCCTGC TGCTGTCCTA TCCCGGGCCG GCGGGCGTCC TGCGGGATGT GCGTGCTGTG ATCGCATCGG CGAGGGAGCG CGGCATCGTC GTCACGATCG CCGCCGATCC GCTGGCGCTG ACCCTGCTGC GGGCGCCTGG CGACCTCGGC GCGGACATCG CGGTGGGGAG CACGCAGCGG TTCGGACTAC CGCTGTCCTT CGGAGGCCCG CACGCCGGCT ACCTCGCAGT GCGCAAGGGG CTGGAACGGT CCCTGCCGGG GCGGCTCGTC GGGGTGTCGG TCGATGCGGA CGGCGCACCC GCGTACCGGC TCACCCTGCA GACCCGTGAG CAGCACATCC GACGGGAGAA GGCGACGAGC AACATCTGCA CGGCCCAGGT GCTGCCCGCG GTACTCGCCT CCATGTACGC GGTCTACCAC GGCCCGGAGG GCCTGGCCGG GATCGCGCGT CGCATCCACG GGCACGCGGT GCGCCTCGCC GAAGGCCTGC GCGCCGCCGG TGTGACGGTG GTGCACGACG CGTTCTTCGA CACCGTCCTC GCCGCGGTGC CGGGCCGGGC CACCCAGGTG GTTGCGGACG CGTTGGCGCG GGGCGTGAAC CTGCGGCTGG TCGACGACGA CCACGTGGGT ATCTCCTGCA ACGAGACGAC GGGCCAGGCC GAGCTGGAGG CCGTCCGGTC GGCCTTCGGG GTGGGCCCGG AGGCGACGGC GGGCATCACT TGGACCGGGA CCGGGACCGG GACCGAGATC CGGACCGGGG CCTGGACCGG GTCCGTCGCG GCGGCGAACG ATCATCTCGT GGCCCGCGCG GACCAGGCGG ACCAGGCGGA CCAGGCGGAC CAAGCGGCGG ACGAGCCGGT GGCGCTGCCC GCCGAACTGG TGCGTACGGA TCCCTACCTG CAGCATCCGG TGTTCCACGA CCATCGGTCG GAGACCGCGA TGCTCCGCTA TCTGCGTCGA CTGTCCGATC TTGATCTGGC TCTTGACCGG GGCATGATCC CCCTGGGGTC GTGCACGATG AAACTCAACG CGACCACTGA GATGGCCGCG GTCACCTGGC CGGAATTCGC TGACATCCAT CCATTTGCCC CGTTGGACCA GGCCGCCGGA TACCTCGCGA TGATTCAGGA CCTGGAGCGT TGGCTCGCGC AGATCACCGG ATATGCGGGG GTCTCCCTCC AGCCGAATGC CGGCAGCCAG GGTGAGCTCG CCGGGCTGCT CGCCATCCGG GCCTATCACC GCGATCATGC TGTTCCCGGA TCCGTGGTGC GAAACATCTG TCTCATTCCC TCCTCGGCGC ACGGGACGAA TGCGGCGAGC GCCGCGATGG CGGGAATGCG GGTGGTCGTC GTCTCCTGTG ACGACGACGG AAACGTCGAC CTGAACGATC TGGCCCGCAA GGCCCGCGCG AACGCGGACG CCTTGGCCGC GCTGATGGTC ACCTACCCGT CGACCCATGG TGTGTACGAG GAGGGCATCG GGCAGGCGTG CGCGATCGTG CATGAGGCCG GCGGTCTGGT GTACGTCGAC GGAGCGAATC TCAACGCTCT GGTGGGGCTC GCCAGGCCGG GGCAGTTCGG GGCCGACGTG AGCCACCTGA ACCTGCACAA GACGTTCTGC ATCCCGCATG GGGGCGGGGG TCCGGGGGTC GGCCCGGTGG CCGTGGTCGA GAAACTCCTG CCCTATCTGC CGAACCACCC GCTGCGGCCG GAGGCAGGAC CGGCCACCGG AGTGGGCCCG ATCTCGGGAT CCCCGTGGGG CTCGGCCGGA ATTCTTATGA TTCCGTGGGC CTACATTCGG ATGATGGGGG CGGACGGCCT GCGCCGGGCA ACCTCGGTGG CCGTCCTGAA CGCCAACTAC ATTGCCCACC GGCTGCATCC GTACTATCCG GTGCTCTACG CGGGCCGGGA CGGGCTGGTC GCCCATGAGT GCATTCTGGA CCTACGGCCG TTGACGAAGC TGACCGGCGT CACCGTGGAC GACGTGGCGA AGCGTCTCAT CGACTACGGT TTCCATGCTC CGACCATGTC ATTCCCGGTT GCCGGAACAC TGATGGTCGA GCCGACGGAG AGTGAGGATC TCGGCGAGAT CGATCGTTTC TGCGACGCAA TGATCTCCAT TCGGGCCGAG GCGGACAAGG TCGGAGACGG TATCTGGCCG CGGACCGACA ATCCCCTGCA TAATGCTCCG CATACTGCAC AGATGGTTAC CGCCAACGAA TGGTCACACG CTTATCCGCG ATCGGTGGCG GCCTATCCCG TCGCTTCGCT GCGGGCCGCC AAGTACTGGC CCCCGGTACG TCGGATCGAC GGCGCCTATG GTGATCGCAA CCTGGTCTGC ACCTGCCCAC CGGTGGGATC CTTCGCCGCG GAGCCGGTCG ACGAGCAGAT TCTCGCCGGA GCCCGCTGA
|
Protein sequence | MPDVTQYDDT PYADGPATAA DGPATTATTA TISPASSHAR SGAPRQGSAA AVGRNGARRL PAAAIPRFAD RHIGPDPSSQ REMLDALRVE SLAALTDAAV PASIRDHDLD LPAALSEPAV LAALRAFGSR NRPVSSMIGL GFHPAVMPGV IQRNVLENPA WYTAYTPYQP EISQGRLEAL LNFQTMITDL TGLAVAGASL LDEPTAAAEA MQIAFRTAKG SRATFLIDAD TLPQTVSVVA TRAEALGINV VVADLAAGLT AVGPAQLDAA FGLLLSYPGP AGVLRDVRAV IASARERGIV VTIAADPLAL TLLRAPGDLG ADIAVGSTQR FGLPLSFGGP HAGYLAVRKG LERSLPGRLV GVSVDADGAP AYRLTLQTRE QHIRREKATS NICTAQVLPA VLASMYAVYH GPEGLAGIAR RIHGHAVRLA EGLRAAGVTV VHDAFFDTVL AAVPGRATQV VADALARGVN LRLVDDDHVG ISCNETTGQA ELEAVRSAFG VGPEATAGIT WTGTGTGTEI RTGAWTGSVA AANDHLVARA DQADQADQAD QAADEPVALP AELVRTDPYL QHPVFHDHRS ETAMLRYLRR LSDLDLALDR GMIPLGSCTM KLNATTEMAA VTWPEFADIH PFAPLDQAAG YLAMIQDLER WLAQITGYAG VSLQPNAGSQ GELAGLLAIR AYHRDHAVPG SVVRNICLIP SSAHGTNAAS AAMAGMRVVV VSCDDDGNVD LNDLARKARA NADALAALMV TYPSTHGVYE EGIGQACAIV HEAGGLVYVD GANLNALVGL ARPGQFGADV SHLNLHKTFC IPHGGGGPGV GPVAVVEKLL PYLPNHPLRP EAGPATGVGP ISGSPWGSAG ILMIPWAYIR MMGADGLRRA TSVAVLNANY IAHRLHPYYP VLYAGRDGLV AHECILDLRP LTKLTGVTVD DVAKRLIDYG FHAPTMSFPV AGTLMVEPTE SEDLGEIDRF CDAMISIRAE ADKVGDGIWP RTDNPLHNAP HTAQMVTANE WSHAYPRSVA AYPVASLRAA KYWPPVRRID GAYGDRNLVC TCPPVGSFAA EPVDEQILAG AR
|
| |