Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_3698 |
Symbol | |
ID | 5672064 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 4378183 |
End bp | 4379889 |
Gene Length | 1707 bp |
Protein Length | 568 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 641242581 |
Product | thiamine pyrophosphate binding domain-containing protein |
Protein accession | YP_001508001 |
Protein GI | 158315493 |
COG category | [E] Amino acid transport and metabolism [H] Coenzyme transport and metabolism |
COG ID | [COG0028] Thiamine pyrophosphate-requiring enzymes [acetolactate synthase, pyruvate dehydrogenase (cytochrome), glyoxylate carboligase, phosphonopyruvate decarboxylase] |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCGGTAA AAGTCTACGA GCGCATCCTC CAGTTGTTCG AAGCCGAGGG CATCAAGACG ATCTTCGGTA TTCCCGACCC GAACTTCGTG CACATGTTCC ACCTCGCCGA GGAACGCGGC TGGACCGTGG TCTCACCGCA CCATGAGGAG TCCGCCGGGT TCATGGCGGA GGCGGTGTCC CGGATGACCG GCAAGGCCGC GGTGGCGATC GGCACCCTGG GCCCGGGTGT CGCGAACCTG GCCGGGGCGA TGATGTGCGC CAAGGTCGAG AACTCGCCGG TCATCTTCCT CGGCGGGCAG CGTGCCCGGA TCACCGAGCA GCGGGTGCGG CGCGGCCGGA TCCAGTTCGT GCGGCAGGCG GGCCTGTTCG AGCCGTCGGT GAAGTACTGC GCCAGCATCG AGTACGCCGA CCAGACCGAC GAGGTCATCC GCGAGGGCCT GCGCAAGGCC CTGTCCGGCA CCCCGGGCCC GGTCTACATC GAGTACCCCT CCCACATCAT CCAGGAGGAG CTCGACGTTC CCCCGCCGCT GCCGCCGGCG GCGTACCGGC TGGTGAACCA GACCGCCGGC CCCGACAGGA TCGCCGAAGC CGTGCAGTAC ATCCGCGCGG CCAAGCAGCC GGTCCTGCTC GTCGGGCACG GTGTGCACAC CTCCCGCGCG GGAGAGTCCG TCCGCGCCCT CGCGGAGGCG ATGGCCTGCC CGGTCATCCA GACCTCCGGC GGCACCTCCT TCATCAAGGG CCTGGAGGAC CGCACCTTCC CCTACGGCTT CTCCGCGGCG TCGATCGACG CGGTGGTCAA GTCCGACCTG TGTCTGGCGA TCGGCACCGA GCTCGGGGAG CCGGTCCACT ACGGCCGCGG CCGGCACTGG GTCGCGAACG AGGCCAACCG CAAGTGGATC CTCATCGAGC AGGACCCCGA GGCCATCGGG GTGAACCGGT CGATCGACGT GCCCCTCGTC GGTGACCTGC GCGCGGTCGT CCCGCAGCTC GTCGACGCCC TCAAGGACAC CCCGCGCACG CCGACCCCCG AGCTCGACGG CTGGATCCGC CAGGACGCCG CCCAGCTGGC GGAGCTCGCG GAGACTGCCC CGGCGGGCAT GTCACCCGTG CACCCGGCGC GCCTGATCGT CGAGGCCACC AAGGTCTTCC CGCCGGACGG CATCATGGTG CGCGACGGCG GCGCCACCAC GATCTTCGGC TGGACGTACT CACAGGCCAA GCCGCACGAC GTGATGTGGA ACCAGAACTT CGGCCACCTG GGCACCGGGC TGCCCTACGC GGTCGGCGCC GGGGTGGCGG ACGGCGGCAA GCGCCCGATG ATGCTGATCA CCGGGGACTC GTCCTTCCAG TTCCACATCG CCGAGCTGGA GACCGCCGCC CGGCTGAACC TCCCGCTGGT CTGCGTCGTG GCCGTCGACT ACGCCTGGGG CCTCGAGGTC GGCGTCTACA AGCGCACCTT CGGCCAGGGC TCGCTGGAGA CCGGCGTCCA CTGGAGCAAG AACACCCGCC TGGACAAGGT CGCCGAGGGC TTCGGCTGTT ACGGCGAGTA CGTCGAGCGC GACGAGGACA TCGCCCCCGC CATCAAGCGC GCCTACGCCA GCGGAAAGAC CGCCGTCATC CACGTCGCGG TCGACCCGAA GGCCAACTCG GAGGAAATGC CGAGCTACGA CGAGTTCCGG ACCTGGTACG CCGAGGGAAC GCAGTGA
|
Protein sequence | MAVKVYERIL QLFEAEGIKT IFGIPDPNFV HMFHLAEERG WTVVSPHHEE SAGFMAEAVS RMTGKAAVAI GTLGPGVANL AGAMMCAKVE NSPVIFLGGQ RARITEQRVR RGRIQFVRQA GLFEPSVKYC ASIEYADQTD EVIREGLRKA LSGTPGPVYI EYPSHIIQEE LDVPPPLPPA AYRLVNQTAG PDRIAEAVQY IRAAKQPVLL VGHGVHTSRA GESVRALAEA MACPVIQTSG GTSFIKGLED RTFPYGFSAA SIDAVVKSDL CLAIGTELGE PVHYGRGRHW VANEANRKWI LIEQDPEAIG VNRSIDVPLV GDLRAVVPQL VDALKDTPRT PTPELDGWIR QDAAQLAELA ETAPAGMSPV HPARLIVEAT KVFPPDGIMV RDGGATTIFG WTYSQAKPHD VMWNQNFGHL GTGLPYAVGA GVADGGKRPM MLITGDSSFQ FHIAELETAA RLNLPLVCVV AVDYAWGLEV GVYKRTFGQG SLETGVHWSK NTRLDKVAEG FGCYGEYVER DEDIAPAIKR AYASGKTAVI HVAVDPKANS EEMPSYDEFR TWYAEGTQ
|
| |