Gene Lcho_2110 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagLcho_2110 
Symbol 
ID6161348 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameLeptothrix cholodnii SP-6 
KingdomBacteria 
Replicon accessionNC_010524 
Strand
Start bp2305795 
End bp2306946 
Gene Length1152 bp 
Protein Length383 aa 
Translation table11 
GC content72% 
IMG OID641664879 
Producthypothetical protein 
Protein accessionYP_001791142 
Protein GI171058793 
COG category[L] Replication, recombination and repair 
COG ID[COG4335] DNA alkylation repair enzyme 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value0.498409 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGACAA TGCCGGTCAT GGCTGACGCA TTCAAGAACC TGATCAACCC CGCCACCGTC 
GCGGCGATGG CGCAGCACCT CGGGCGGGTC AGCGCGCATT TTGATCACGA CGCCTTCGTC
GCGCGCGCCT TGCCCGATCT CGACGGCCTC GAATTCAAGG CCCGCGCGAT GCAGCTGGCC
GACGCGCTCG AGCAGGCGCT GCCGGCCGAT TTCAGCCTGG CGTCGCAGGC GCTGGTGAAT
GCGCTCGGGC CGCCGGGGCA GGGCGACGAT CTGAGCGGGC TGCGGACCGG CGATCAGGGC
CTGGCCGGCT GGGCGCTGTG GCCGATGGGC GAGTTCATCG CCCGCCACGG CCTGGCCGAT
CCCGCGCGCG GGCTGCAGGC GCTGCACGCG ATGACACAGC GTTTCAGCGC CGAGTTCGCG
ATCCGGCCGT TCATCCTGGC GCATCCGCAG CTGACGTTCG AGACCCTGGC GCGCTGGGTG
CACGACCCGA GCGCACATGT GCGCCGACTG GTCAGCGAGG GCAGCCGCCC GCGCCTGCCG
TGGGGCCTGC AGCTCAAGCC GCTGATCGCC GACCCGAGCC CGACGCTTGC GCTGCTGGCC
GCGCTGCAGG ACGACCCCAG CGCCTACGTG CGCCGCTCGG TCGCCAATCA TCTCAACGAC
ATCGCCAAGG ATCATCCCGC GCGCGTGGCC GAGTGGCTGC AGCGCCATCT GCCCGATGCG
TCCGACAACC GCCGGGCGCT GCTGCGCCAC GCCAGCCGCA CGCTGATCAA GCAGGGTGAC
GCCGCGGTGC TGACGGCCTG GGGCCTGGGC GCCGAACTGC ACGGCCAGGC GGCGCTGCGC
ATCGGGCCGG CGCGCATCCG GCTGGGCGAA GCCGTCGAGC TGAGCCTGAC GCTGCGTTCG
ACCGCAGCCG CCGCGCAGGC GCTGGTGGTC GACTACGTGG TGCACCACGT CAAGGCCGGC
GGCAGCACGT CGCCCAAGGT CTTCAAAGGC TGGCGCGTGC AACTGGCGGC GGGCGAGCAG
CGGCTGCTCA GCCGCCGCCA CAAGGTGACG CCGATCACCA CCCGCACCTA TCACGCGGGT
TGGCATCGCG TGCAGGCGCA GGTCAACGGA CGGGTGGTGG CCGAAGCGGG GTTCGAGCTG
GGTGTCGATT GA
 
Protein sequence
MATMPVMADA FKNLINPATV AAMAQHLGRV SAHFDHDAFV ARALPDLDGL EFKARAMQLA 
DALEQALPAD FSLASQALVN ALGPPGQGDD LSGLRTGDQG LAGWALWPMG EFIARHGLAD
PARGLQALHA MTQRFSAEFA IRPFILAHPQ LTFETLARWV HDPSAHVRRL VSEGSRPRLP
WGLQLKPLIA DPSPTLALLA ALQDDPSAYV RRSVANHLND IAKDHPARVA EWLQRHLPDA
SDNRRALLRH ASRTLIKQGD AAVLTAWGLG AELHGQAALR IGPARIRLGE AVELSLTLRS
TAAAAQALVV DYVVHHVKAG GSTSPKVFKG WRVQLAAGEQ RLLSRRHKVT PITTRTYHAG
WHRVQAQVNG RVVAEAGFEL GVD