Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_1754 |
Symbol | pyrG |
ID | 5670156 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 2104159 |
End bp | 2106117 |
Gene Length | 1959 bp |
Protein Length | 652 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 641240675 |
Product | CTP synthetase |
Protein accession | YP_001506098 |
Protein GI | 158313590 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0504] CTP synthase (UTP-ammonia lyase) |
TIGRFAM ID | [TIGR00337] CTP synthase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.0206256 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 3 |
Fosmid unclonability p-value | 0.00103098 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | GTGGCGCAGG CGCATGTGAC CAAGCACGTG TTCGTGACGG GAGGTGTGGC GTCCAGTCTC GGTAAGGGAC TGACGGCGTC CAGCCTCGGC CGTCTGCTCA AGGCCCGCGG TCTGCGGGTC ACGATGCAGA AGCTCGACCC GTATCTGAAT GTTGATCCGG GAACGATGAA CCCGTTCCAG CACGGCGAGG TGTTCGTCAC CAACGACGGC GCCGAGACCG ACCTGGACAT CGGGCACTAC GAACGCTTCC TCGACGTCGA CCTCGACGGC AGCGCGAACG TGACCACCGG GCAGGTGTAC TCGGCGGTCA TCGCCCGCGA GCGGCGCGGC GGGTACCTCG GTGAGACGGT GCAGGTCGTC CCGCACATCA CCGACGAGAT CAAGGGCCGG ATCCGCCGGC TGGCCACCGA TTCGGTGGAC GTGGTGATCA CCGAGGTGGG TGGCACCGTC GGCGACATCG AGTCGCTGCC CTACCTGGAG GCGATCCGCC AGGTCCGGCA CGAGGTCGGC CGGGACAACG CGCTGACGGT GCACGTCAGC CTCGTCCCGT ACCTCGCGCC GTCGGGTGAG CTGAAGACCA AGCCGACCCA GCACTCGGTC GCGGCGCTGC GCAGCATCGG CCTGCAGCCG GACGCGGTGG TCTGCCGCTC CGACCGGCCG TTGCCCGACG CGCTCAAGCG CAAGATCGCC ATGATGTGCG ACGTCGACGA CGAGGCGGTC GTCGGCGCAC CGGACGCCGG CTCGATCTAT GACATCCCGA AGGTGCTGCA CAAGGAGGGC CTCGACGCCT ACGTCGTGCG CCGGCTCGGC CTCAGCTTCC GGGACGTCGA CTGGACGGAG TGGGACACCC TGCTCCGCCG TGTGCACCAT CCCTCGAACA CCGCCACGGT GGCGATCGTC GGCAAGTACG TCGACCTGCC CGACGCCTAC CTGTCGGTGA CCGAGGCGCT GCGCGCCGGC GCGTTCGCCG TCGACGCCCG CGCGGACATC CGCTGGATCG GGTCGGACGA GTTCACCTCG CCCGCCGCCG CCGCGCACGC GCTGGAAGGC GTGGACGGGA TCATCGTCCC GGGCGGGTTC GGGATCCGCG GCATCGAGGG CAAGATCGAG GCGCTGCGGC ACGCCCGCGA ACACGGCATT CCCACGCTGG GGATCTGCCT GGGCCTGCAG TGCATGGTGA TCGAGGCCGG CCGTTCGCTG GCCGGGCTGG CCGGGTCGAA CTCCACCGAG TTCGACCCGG AGACCGCCCA CCCGGTGATC TCCACGATGG CCGACCAGCA CGACGTGGTC GCCGGCAAAC GGGATATGGG CGGCACGATG CGGCTCGGCC TGTACCCGTG CGCGCTGGGC GCGGGGACCG TCGCCCGCCG GGAGTACGGC GAGCCGGAGG TGCTCGAGCG CCACCGGCAC CGCTACGAGG TCAACAACTC CTACCGGGAG CGGCTGACCG CCGCCGGGCT GGTGTTCTCC GGCACCTCGC CGGACGGCCG CCTGGTCGAG GTCGTCGAGC TGCCGGCCGA CGTCCACCCG TTCTACGTGG GGACGCAGGC GCACCCGGAG TTCCGGTCGC GGCCGACGCG CGCGCACCCG CTGTTCCGCG GGCTCGCCGC CGCTGCCGTG GCCCACGCCG ACCGGCGCCG GGGCCGGCTG CCCGTCGACA TCCCGGACGG TGGTGTCGCG GGCGGTGGTG TCGCGGGCGG CGGGCCCGCC GATGGTGACA TGATCGGGTC GGGTGACGGT GACGCGGCCG GTGCGCGGGC GGGCAACGGT GCCCGGTCGG CAGGTGGGCG GGCCCGCCGG GGCCGCGCGG AGCAGAGCGC GGCGGCCGGC GCCGGCACGA CCGCCGGTGT CAGCGGTGCC GCGGGTACCG GTGGTGCCGT GGGTGCGGCA TTGCCCGGTG GGTCCACGGG GCCGGCCGCG GCGGCCGGTG TGAGCGACGG AGAGCTGGTT TCCCCATGA
|
Protein sequence | MAQAHVTKHV FVTGGVASSL GKGLTASSLG RLLKARGLRV TMQKLDPYLN VDPGTMNPFQ HGEVFVTNDG AETDLDIGHY ERFLDVDLDG SANVTTGQVY SAVIARERRG GYLGETVQVV PHITDEIKGR IRRLATDSVD VVITEVGGTV GDIESLPYLE AIRQVRHEVG RDNALTVHVS LVPYLAPSGE LKTKPTQHSV AALRSIGLQP DAVVCRSDRP LPDALKRKIA MMCDVDDEAV VGAPDAGSIY DIPKVLHKEG LDAYVVRRLG LSFRDVDWTE WDTLLRRVHH PSNTATVAIV GKYVDLPDAY LSVTEALRAG AFAVDARADI RWIGSDEFTS PAAAAHALEG VDGIIVPGGF GIRGIEGKIE ALRHAREHGI PTLGICLGLQ CMVIEAGRSL AGLAGSNSTE FDPETAHPVI STMADQHDVV AGKRDMGGTM RLGLYPCALG AGTVARREYG EPEVLERHRH RYEVNNSYRE RLTAAGLVFS GTSPDGRLVE VVELPADVHP FYVGTQAHPE FRSRPTRAHP LFRGLAAAAV AHADRRRGRL PVDIPDGGVA GGGVAGGGPA DGDMIGSGDG DAAGARAGNG ARSAGGRARR GRAEQSAAAG AGTTAGVSGA AGTGGAVGAA LPGGSTGPAA AAGVSDGELV SP
|
| |