Gene Lcho_2497 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagLcho_2497 
Symbol 
ID6160676 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameLeptothrix cholodnii SP-6 
KingdomBacteria 
Replicon accessionNC_010524 
Strand
Start bp2718370 
End bp2719899 
Gene Length1530 bp 
Protein Length509 aa 
Translation table11 
GC content72% 
IMG OID641665267 
Productthymidine phosphorylase 
Protein accessionYP_001791527 
Protein GI171059178 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0213] Thymidine phosphorylase 
TIGRFAM ID[TIGR02645] putative thymidine phosphorylase 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000000364151 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAACCCGC CTGACCTGCG CATCCGCCGC GTCGCCATCG ACACCTGGCG CGAGAACGTC 
GCCTACCTGC ACCGCGACTG CCCGGTGGTG CGGGCCTCGG GTTTCCAGGC GCTGTCGAAG
GTGACGGTGC ACGCCAACGG CACCACCATC AGCGCGGTGC TCAACGTGGT CGACGACGAG
CGCATCGTGC AGGCCTGCGA GCTGGGCCTG TCCGAAGACG CCTTCGCACG CATGAACGTG
CCCGAGGGTC ACGCCGCCCA TGTGGCGCCG GCCGAGCCGC CTGCCTCGAT CGGCGCGCTG
CACCGCAAGA TCGGCGGCGA GCGGCTGTCG CGCGACGACC TGCACGCGAT CGTGCGCGAC
ATCGCCCAGG CGCGTTATTC CAAGATCGAG CTGGCCGCCT TCGTGGTCGC CACCAACGGC
TACGACCTCG ACCGCGACGA GGTGCTGCAC CTCACCGAGG CCATGATCGC CGCCGGCCGC
CGGCTCGACT GGCAGCACCA GGTGCGCGGC GGGCCGGTGG TCGACAAGCA CTGCATCGGC
GGCATCCCGG GCAACCGCAC CTCGATGCTG GTGGTGCCGA TCGTCACGGC GCACGGCATG
CTGTGCCCCA AGACCTCGTC GCGCGCCATC ACCTCGCCGG CCGGCACCGC CGACACGATG
GAGGTGCTGG CCGAGGTCGA GCTGCCCTTC GAGCGGCTGA CGCAGATCGT GCGCGACACC
AACGGCTGCC TGGTGTGGGG TGGCCGCGCC GGGCTGTCGC CGGCCGACGA CATCCTGATC
TCGGTCGAGC GGCCGCTGGC GATCGACTCG CCCGGCCAGA TGGTGGCCTC GATCCTGTCC
AAGAAGATCG CCGCCGGCTC GACCCACCTG GTGCTCGACA TCCCCGTCGG CCCGACCGCC
AAGGTGCGCT CGATGCCGGC CGCGCAGCAG CTGCGCAAGC TGTTCGAGTA CGTCGCCGAC
CGGCTCGGCC TGCACCTCGA GGTGGTGATC ACCGACGGCA GCCAGCCGAT CGGCAGCGGC
ATCGGCCCGG TGCTGGAGGC GCGCGACGTG ATGCGCGTGC TGCGCAACGA CCCGGCGGCG
CCGGCCGACC TGCGCGACAA GTCGCTGCGC CTGGCCGGCC GCGTGATCGA GTTCGACCCC
GACGTGCGCG GCGGCGACGG CTGGCGCATC GCCCGCGACA TCCTCGAATC GGGCCGTGCG
CTGGCGCAGA TGAACGCCCT GATCGACGCC CAGGGCCGCC GCCTGCAGCC GCCCGCGCTC
GGCGAGATGA CGCACGAGGT GATCGCGCCG GCCGACGGCG CGGTCAGCGC GATCGACAAC
CTGCAGCTCG CCCGCATCGC CCGCCTGGCC GGCGCCCCGC AGGTGATCGG CGCCGGCGTC
GACCTGTTGC GCAAGCAGGG CGATGCGGTG CAGGCCGGCC AGCCGCTCTA CCGCATCCAC
GCCTGCTACG CCGCCGACCT GGGGTTCTCG CGCGACTTGG CCGCGCGCGA CAGCGGCTAC
ACCGTCGCCG CCACCACCCG CCCGTCATGA
 
Protein sequence
MNPPDLRIRR VAIDTWRENV AYLHRDCPVV RASGFQALSK VTVHANGTTI SAVLNVVDDE 
RIVQACELGL SEDAFARMNV PEGHAAHVAP AEPPASIGAL HRKIGGERLS RDDLHAIVRD
IAQARYSKIE LAAFVVATNG YDLDRDEVLH LTEAMIAAGR RLDWQHQVRG GPVVDKHCIG
GIPGNRTSML VVPIVTAHGM LCPKTSSRAI TSPAGTADTM EVLAEVELPF ERLTQIVRDT
NGCLVWGGRA GLSPADDILI SVERPLAIDS PGQMVASILS KKIAAGSTHL VLDIPVGPTA
KVRSMPAAQQ LRKLFEYVAD RLGLHLEVVI TDGSQPIGSG IGPVLEARDV MRVLRNDPAA
PADLRDKSLR LAGRVIEFDP DVRGGDGWRI ARDILESGRA LAQMNALIDA QGRRLQPPAL
GEMTHEVIAP ADGAVSAIDN LQLARIARLA GAPQVIGAGV DLLRKQGDAV QAGQPLYRIH
ACYAADLGFS RDLAARDSGY TVAATTRPS