Gene Lcho_3670 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagLcho_3670 
Symbol 
ID6160418 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameLeptothrix cholodnii SP-6 
KingdomBacteria 
Replicon accessionNC_010524 
Strand
Start bp4104862 
End bp4105917 
Gene Length1056 bp 
Protein Length351 aa 
Translation table11 
GC content68% 
IMG OID641666443 
Productcupin 2 domain-containing protein 
Protein accessionYP_001792689 
Protein GI171060340 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG3435] Gentisate 1,2-dioxygenase 
TIGRFAM ID[TIGR02272] gentisate 1,2-dioxygenase 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones55 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATTGGGG AACACACCAT GCAAGAACTT GGACGCCTCG AAGACCTGCC CGCCGACTAC 
GTGCAGGCAC TGCGTGACCT GAACCTCGTG CCGCTGTGGC CGAGCCTGCG CGGCGTGCTG
CCGCCGGGCA AGCCGCGGCC CAACACCCGC GCCACCGCCT GGGCCTACGA ATCGATCAAG
CCGCTGCTGC TGAAAGCCGG CGAACTGACG CCGATCGAGA AGGCCGAGCG CCGCGTGCTG
GTGCTCGCCA ACCCCGGCCA CGGCCTGGAG AAGATGCAGG CCAGCGCCGC GATGTACCTC
GGCATGCAGT TGCTGCTGCC GGGTGAGTGG GCGCCGTCGC ACCGCCACAC GCCCAACGCG
GTGCGCATGA TCGTCGAGGG TGAAGGCGCC TACACCACGG TCGACGGCGA GAAGTGCCCG
ATGTCGCGTG GCGACCTGAT CCTGACACCC ACCGGCCTGT GGCACGAACA CGGCCACGAC
GGCAGCGAGC CGGTGGTCTG GCTCGACGTG CTCGATCTGC CGCTGGTCTA TTACATGGAG
GCCTCGTATC ACATCAACGG CGAGCGCCAG ACCGTCAAGC CCGGCCAGGG TGACCGCGCC
TATGCACGCG GCGGCGTGGC GCCGACGGTG ATGTTCGATC GCTCGGACAA GCGCTACCCG
ATGCTGCGCT ACCCGTGGGT CGACGCACGC GCCGCGCTGG TGTCGCTGGC CGCCGACCGG
CCGGATCTGG ACGCGGTGCA GGTCACCTAC GTCAACCCCG AGACCGGCGC CGACGTCGAG
AACATCCTCG GTTTCTACGC GCTGATGCTG CGCCCGGGCC AGACGCTGCG CCTGCCGGTG
CGCTCGCCGG CGATGGTGTT CCACGTCATC GAAGGTGGTG CCGAGGTGAA GGTCGAAGAC
CAGCGTTTCA CGCTCACCGA GGCCGACACC TGCTGCGCGC CCGGCTACAC CGAGGTGAGC
CTCGTCAACC GCTCGGCCGA CACGCCCACC TTCGTCTTCA TCGCCGACGA ATCGCCGCTG
CACCGCAAGC TCGGCGTGTT CGAGAACCGC GGCTGA
 
Protein sequence
MIGEHTMQEL GRLEDLPADY VQALRDLNLV PLWPSLRGVL PPGKPRPNTR ATAWAYESIK 
PLLLKAGELT PIEKAERRVL VLANPGHGLE KMQASAAMYL GMQLLLPGEW APSHRHTPNA
VRMIVEGEGA YTTVDGEKCP MSRGDLILTP TGLWHEHGHD GSEPVVWLDV LDLPLVYYME
ASYHINGERQ TVKPGQGDRA YARGGVAPTV MFDRSDKRYP MLRYPWVDAR AALVSLAADR
PDLDAVQVTY VNPETGADVE NILGFYALML RPGQTLRLPV RSPAMVFHVI EGGAEVKVED
QRFTLTEADT CCAPGYTEVS LVNRSADTPT FVFIADESPL HRKLGVFENR G