Gene Lcho_3335 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagLcho_3335 
Symbol 
ID6160769 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameLeptothrix cholodnii SP-6 
KingdomBacteria 
Replicon accessionNC_010524 
Strand
Start bp3713767 
End bp3715191 
Gene Length1425 bp 
Protein Length474 aa 
Translation table11 
GC content74% 
IMG OID641666110 
Productpeptidase M48 Ste24p 
Protein accessionYP_001792358 
Protein GI171060009 
COG category[R] General function prediction only 
COG ID[COG4783] Putative Zn-dependent protease, contains TPR repeats 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones52 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATCCCC TCTTCAAGCC CGCCCGACCC GCCTTGCGCG CCTTCGCAGC GCGCCTGCAC 
GACGGCCGAC CGAACCGGCG CGACATCGGC CTGCTGTTCG GCAGCGCGGC GGGCTTGCTC
GGCGGCTGCG CCAGCCCGGG CGGCGTGCCG CCGACCGGCG ACACGGCCGC ACCGGCTGCG
CGGGCGACCC CGTCGGCATC GCCCGCAGCC CGGCCGCCGC CCAAGCCGGC CCCGATCGAC
GACGCCGGCG CCCGCGCGCT CGACGCGCAG CAGGCGCCGC AGCAGTTCTC GCTCGACCTG
GGCGCGCTGC AGGACGTGGC CATCAACACC TACGTCGGCG AGATCGGCTT CGCCATCCAG
GCCCAGGCGC CGCGGCGTGG CCTGCCCTAC AGCTACCGCG CGCTCAACGC CCACCACCTG
AACGCCTACG CCTTCCCGGC CGGCGGGCTG GGCATCACGC GCGGCCTGCT GATCGAGCTG
CAGGACGAGG CCGAGCTCGC CGCGCTGATC GGCCAGCAGC TCGGCCACGT CAACGCCCGC
CACGCACTGA GCCGCCAGCG CACCGATTCG GTGGCGCAGG CGGTGGTCAC GAACACCGTG
GCGGCCAGCC AGGAATCGGC CTGGACGCCG CCGATCGGGC TGGCCGGACA GATCGGCGCC
AGCGCGCTGA TCCCGACCTA CTCGGCCGAA CAGATGCGCG AGGCCGACGC GGCGGGGCTG
CAGTACCTGG TCGGCGCGGG CTATCCGGGG CTCGGCATGG TGACGCTGCA GCAACGCCTG
GCCGAAGCCG GGCAGCAGCG CCCGGCCCTG CTGGCGGCGA TGGCGGCGGC GCAGCCGACC
AGCCCCGAGC GGCGTGACGC GGTGCGCCGC AACGTCGAGA CCCTGCACGC CGGCAGCCGC
AACAGCAGCA CGCGCCGCGA GCGTTTCATG GACCGCACCG CCAGCCTGCG GCACATGCGC
GCGCTGATCG AGGCCTGCAA GAACGGCGAA CTGGCGCTGG CCCGCAAGGA CCTGACCGAA
GCGCACGCGC AGTTCAAGTC GGCGCTCGAG ATGGCGTCGC AGGACTACGC CGCCAACCTG
CGCATGGCGC AGTGCCTGCA GGCGATGGGG CAGGTGCGCG AGGCGCGTGC ATTCGCGATC
GCCGCGCGCG ACGCCTATCC GCAGGAGGCC CAGGCGCACA AGCTGGCCGC CACGCTGGCA
CTGGCGCAGC GCGACGCCGC CGCCGCCTGG CAGGACCTCG AGGCGCATGA CCGCCTGCTG
TCGGGCGACC CCGGCGTGGT GTTCCTGAAA GGCGTCACAC TTGAGCTCAT GGGACAAAGC
AAACGCGCCG CCGAACACTA CCGCGCCTAC CTCGGCTACA CCGAACAGGG CCAGGCCGCC
CAATACGCCG CCACCCGCCT GAAGCTGCTC GGTCATGACC GCTGA
 
Protein sequence
MDPLFKPARP ALRAFAARLH DGRPNRRDIG LLFGSAAGLL GGCASPGGVP PTGDTAAPAA 
RATPSASPAA RPPPKPAPID DAGARALDAQ QAPQQFSLDL GALQDVAINT YVGEIGFAIQ
AQAPRRGLPY SYRALNAHHL NAYAFPAGGL GITRGLLIEL QDEAELAALI GQQLGHVNAR
HALSRQRTDS VAQAVVTNTV AASQESAWTP PIGLAGQIGA SALIPTYSAE QMREADAAGL
QYLVGAGYPG LGMVTLQQRL AEAGQQRPAL LAAMAAAQPT SPERRDAVRR NVETLHAGSR
NSSTRRERFM DRTASLRHMR ALIEACKNGE LALARKDLTE AHAQFKSALE MASQDYAANL
RMAQCLQAMG QVREARAFAI AARDAYPQEA QAHKLAATLA LAQRDAAAAW QDLEAHDRLL
SGDPGVVFLK GVTLELMGQS KRAAEHYRAY LGYTEQGQAA QYAATRLKLL GHDR