Gene Lcho_1140 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagLcho_1140 
Symbol 
ID6163813 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameLeptothrix cholodnii SP-6 
KingdomBacteria 
Replicon accessionNC_010524 
Strand
Start bp1220088 
End bp1221452 
Gene Length1365 bp 
Protein Length454 aa 
Translation table11 
GC content70% 
IMG OID641663894 
Producthydroxydechloroatrazine ethylaminohydrolase 
Protein accessionYP_001790174 
Protein GI171057825 
COG category[F] Nucleotide transport and metabolism
[R] General function prediction only 
COG ID[COG0402] Cytosine deaminase and related metal-dependent hydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones45 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGACAC TGTTGATCCA CAACGCGCGC CTGCTGGTGA CGATGGACGC CCAGCGGCGC 
GAGATCGCCG ACGGCGCCGT CTTCGCCCGC GACGGCGTGA TCGAGGCGGT CGGCGCCAGC
GCCGAACTGC CGCAGACCGC CGACGAGGTG ATCGACGCAC GCGATCAGGT CGTCATCCCC
GGCCTGGTGA ACACGCACCA CCACATGTAC CAGACGCTCA CGCGCGTGAT CCGCCCAGCG
CAGGATTGCG AGCTGTTCGG CTGGCTGCAG ACGCTCTACC CGATCTGGTC GCACCTGACG
CCCGAAATGG TGCACGTGTC GACCCAGACC GCGATGGCCG AGCTGCTGCT GTCGGGCTGC
ACCACCAGCA GCGACCACCT CTACATCTTC CCCAACAGCG TGCGGCTCGA CGACAGCATC
GAAGCCGCCG CGCAAATCGG CATGCGTTTC CACGCCGCAC GCGGCTCGAT GAGCGTGGGC
CAGTCGCAAG GCGGCCTGCC GCCCGACGGC GTGGTCGAGA GCGAGCCCGC CATCCTGCGC
GAGACCCAGC GCCTGATCGA GCGCTGGCAC GACCCGGCGC GCCACGCGAT GCAGCGCATC
GTGGTGGCGC CGTGCTCGCC GTTTTCGGTC AGCCGCGAGC TGATGCGCGA TGCGGCGGTG
CTGGCGCGCG AACACGGTGT CTCGCTGCAC ACCCACCTGG CCGAAAACGA CAACGACATC
GCCTACACGC GTGAGAAGTT CAACTGCACG CCGGCCGAAT ATGCCGAGCA GCTCGGCTGG
GTCGGCCGCG ACGTCTGGCA CGCCCACTGC GTCAAGCTCG ACGAAGCCGG CATCGCCCTG
TTTGCGCGCA CCGGCACGGG GGTGTCGCAC TGCCCGGGAT CCAACATGCG ACTCGCCTCG
GGCATCGCGC CGATCCGTGC CATGCGCGAT GCGGGCGTGC CGGTGTCGAT CGCGGTCGAC
GGCTCGGCCA GCAACGACAG CGGCCACATG CTCGGCGAGG CGCGGCTCGC GCTGCTGCTG
CAACGCGTGG CGCACGGCCC GGTCAAGGGA CCGAGTGCAT TGACCGCGCG CGAGGTGCTC
GAGATCGCCA CGCGGGGCGG CGCCGCGGTG CTCAACCGCG ACGACATCGG CGCGCTCGCG
CCGGGCATGA GCGCCGACAT CGTGACGATC CCGCTCGACG ACATCGGCCT GGCCGGTGCG
CACCACGACC CACTGGCCGC GCTGTTCTTC TGCCACGTGC CGCGCGTGAA GCACAGCATC
GTCAACGGCC GCGTGGTGGT GCGCGACGGG CGCATCACGA CGCTGGAACT GCCGGTGCTG
ATCGAGCGGC ACAACCGGCT GGCGGCGGAG CTGGTCAACG CCTGA
 
Protein sequence
MTTLLIHNAR LLVTMDAQRR EIADGAVFAR DGVIEAVGAS AELPQTADEV IDARDQVVIP 
GLVNTHHHMY QTLTRVIRPA QDCELFGWLQ TLYPIWSHLT PEMVHVSTQT AMAELLLSGC
TTSSDHLYIF PNSVRLDDSI EAAAQIGMRF HAARGSMSVG QSQGGLPPDG VVESEPAILR
ETQRLIERWH DPARHAMQRI VVAPCSPFSV SRELMRDAAV LAREHGVSLH THLAENDNDI
AYTREKFNCT PAEYAEQLGW VGRDVWHAHC VKLDEAGIAL FARTGTGVSH CPGSNMRLAS
GIAPIRAMRD AGVPVSIAVD GSASNDSGHM LGEARLALLL QRVAHGPVKG PSALTAREVL
EIATRGGAAV LNRDDIGALA PGMSADIVTI PLDDIGLAGA HHDPLAALFF CHVPRVKHSI
VNGRVVVRDG RITTLELPVL IERHNRLAAE LVNA