Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Lcho_2497 |
Symbol | |
ID | 6160676 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Leptothrix cholodnii SP-6 |
Kingdom | Bacteria |
Replicon accession | NC_010524 |
Strand | - |
Start bp | 2718370 |
End bp | 2719899 |
Gene Length | 1530 bp |
Protein Length | 509 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 641665267 |
Product | thymidine phosphorylase |
Protein accession | YP_001791527 |
Protein GI | 171059178 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0213] Thymidine phosphorylase |
TIGRFAM ID | [TIGR02645] putative thymidine phosphorylase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.0000000364151 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAACCCGC CTGACCTGCG CATCCGCCGC GTCGCCATCG ACACCTGGCG CGAGAACGTC GCCTACCTGC ACCGCGACTG CCCGGTGGTG CGGGCCTCGG GTTTCCAGGC GCTGTCGAAG GTGACGGTGC ACGCCAACGG CACCACCATC AGCGCGGTGC TCAACGTGGT CGACGACGAG CGCATCGTGC AGGCCTGCGA GCTGGGCCTG TCCGAAGACG CCTTCGCACG CATGAACGTG CCCGAGGGTC ACGCCGCCCA TGTGGCGCCG GCCGAGCCGC CTGCCTCGAT CGGCGCGCTG CACCGCAAGA TCGGCGGCGA GCGGCTGTCG CGCGACGACC TGCACGCGAT CGTGCGCGAC ATCGCCCAGG CGCGTTATTC CAAGATCGAG CTGGCCGCCT TCGTGGTCGC CACCAACGGC TACGACCTCG ACCGCGACGA GGTGCTGCAC CTCACCGAGG CCATGATCGC CGCCGGCCGC CGGCTCGACT GGCAGCACCA GGTGCGCGGC GGGCCGGTGG TCGACAAGCA CTGCATCGGC GGCATCCCGG GCAACCGCAC CTCGATGCTG GTGGTGCCGA TCGTCACGGC GCACGGCATG CTGTGCCCCA AGACCTCGTC GCGCGCCATC ACCTCGCCGG CCGGCACCGC CGACACGATG GAGGTGCTGG CCGAGGTCGA GCTGCCCTTC GAGCGGCTGA CGCAGATCGT GCGCGACACC AACGGCTGCC TGGTGTGGGG TGGCCGCGCC GGGCTGTCGC CGGCCGACGA CATCCTGATC TCGGTCGAGC GGCCGCTGGC GATCGACTCG CCCGGCCAGA TGGTGGCCTC GATCCTGTCC AAGAAGATCG CCGCCGGCTC GACCCACCTG GTGCTCGACA TCCCCGTCGG CCCGACCGCC AAGGTGCGCT CGATGCCGGC CGCGCAGCAG CTGCGCAAGC TGTTCGAGTA CGTCGCCGAC CGGCTCGGCC TGCACCTCGA GGTGGTGATC ACCGACGGCA GCCAGCCGAT CGGCAGCGGC ATCGGCCCGG TGCTGGAGGC GCGCGACGTG ATGCGCGTGC TGCGCAACGA CCCGGCGGCG CCGGCCGACC TGCGCGACAA GTCGCTGCGC CTGGCCGGCC GCGTGATCGA GTTCGACCCC GACGTGCGCG GCGGCGACGG CTGGCGCATC GCCCGCGACA TCCTCGAATC GGGCCGTGCG CTGGCGCAGA TGAACGCCCT GATCGACGCC CAGGGCCGCC GCCTGCAGCC GCCCGCGCTC GGCGAGATGA CGCACGAGGT GATCGCGCCG GCCGACGGCG CGGTCAGCGC GATCGACAAC CTGCAGCTCG CCCGCATCGC CCGCCTGGCC GGCGCCCCGC AGGTGATCGG CGCCGGCGTC GACCTGTTGC GCAAGCAGGG CGATGCGGTG CAGGCCGGCC AGCCGCTCTA CCGCATCCAC GCCTGCTACG CCGCCGACCT GGGGTTCTCG CGCGACTTGG CCGCGCGCGA CAGCGGCTAC ACCGTCGCCG CCACCACCCG CCCGTCATGA
|
Protein sequence | MNPPDLRIRR VAIDTWRENV AYLHRDCPVV RASGFQALSK VTVHANGTTI SAVLNVVDDE RIVQACELGL SEDAFARMNV PEGHAAHVAP AEPPASIGAL HRKIGGERLS RDDLHAIVRD IAQARYSKIE LAAFVVATNG YDLDRDEVLH LTEAMIAAGR RLDWQHQVRG GPVVDKHCIG GIPGNRTSML VVPIVTAHGM LCPKTSSRAI TSPAGTADTM EVLAEVELPF ERLTQIVRDT NGCLVWGGRA GLSPADDILI SVERPLAIDS PGQMVASILS KKIAAGSTHL VLDIPVGPTA KVRSMPAAQQ LRKLFEYVAD RLGLHLEVVI TDGSQPIGSG IGPVLEARDV MRVLRNDPAA PADLRDKSLR LAGRVIEFDP DVRGGDGWRI ARDILESGRA LAQMNALIDA QGRRLQPPAL GEMTHEVIAP ADGAVSAIDN LQLARIARLA GAPQVIGAGV DLLRKQGDAV QAGQPLYRIH ACYAADLGFS RDLAARDSGY TVAATTRPS
|
| |