Gene Lcho_2988 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagLcho_2988 
Symbol 
ID6162642 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameLeptothrix cholodnii SP-6 
KingdomBacteria 
Replicon accessionNC_010524 
Strand
Start bp3290684 
End bp3292486 
Gene Length1803 bp 
Protein Length600 aa 
Translation table11 
GC content67% 
IMG OID641665765 
Productpeptidase U35 phage prohead HK97 
Protein accessionYP_001792015 
Protein GI171059666 
COG category 
COG ID 
TIGRFAM ID[TIGR01543] phage prohead protease, HK97 family 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones90 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCCGCA CTGAGCTGGT CAACCGCTCT GCCACCTTCG AGCGGGCGCT GGTCGACCTG 
GCCGCGCGGA CCGTGCCGGT TTCGTTGTCC AGCGAACGGC CAGTGATGCG TTACGGCGAG
TTGGAGGTTC TGCGCCACGA ACCCCAGTGC ATCGACCTGA GCCGCGCCAT CGGGGGCTTG
CCGCTGCTGT ACTGCCACGA CCACGCCCAG CCTGTCGGCG TGGTCGAGAA CGTGCGCCTC
GATGGCCGGC GCCTGGTCGG CACCGCACGC TTCGGCCAGT CGGAGAAGGC CCAGGAGGTC
TTCCAAGACG TGCGCGACGG CTTGTTGCGC GGCATCTCGG TGGGCTACCG CATCAACGCC
ACCGAACCCA TTCTCGGCGG CATTGCCGCC ACCTCGTGGA CGCCCTACGA GGCGTCCGTC
TTGGCCGTTC CTGAAGATTC AACCGTCGGT ATCGGCCGCA GCGCTGGCCA ACTCCAAACC
CCTGAAGGGA ATTCCCCCAT GCCCGCAAAC ACCCTCGACC TGCACACCCG CGCAGCTGCC
CCCAACGAAG TGCGCGAGCT GGTCAAGCTC CACGGCCTCA ACGCCAGCGT GGCCGATGGC
CTGATCCAGC GCGGCGCCAC CCTGGACGCG GTGCGCGCCC ACGTCCTGGA CGCCCTGGCG
TCGAGCGACC GGGCCTCGGG TGGCCACCTG AACACGACCT CCAACGGCAT GGAGTACCAC
GGCCGCAGCC TGGCCAGCAT CGGCCACGAA ATCCACGGTC CGCAGGTTGA GCAGATGCAA
GAGGCCCTGG TGGCTCGGAT GGGCGGCCCG GCCGCCAAGA CCGGCAACCA GTACCGCCAC
GCCCGCATGG CCGACATGGC GCGCGACCTG CTGGAGCATC GCGGCCTTCG CACAACGTCG
ATGGCACCGC GCGAGCTGGT CGAGCGGGCG TTGCATACCA CGAGCGACTT TGCGGGCCTG
TTGCAGGGCG CTGGAAATCG GCTGCTGCGC CAGGGCTACG AGTCGGCCCC CAGCATCAAG
CGGGTGTTCA AGGCCAGCAC CGTGGCCGAT TTCCGCGCCA AGCAGAAGCT GAACTTGGGC
GAAGCGCCAG CCCTGCTGAA GGTCAACGAA CACGGGGAGT TCAAGAGCGG CTCCATGGCC
GACACGACCT CGAGCTACAG CCTGGCCACA TTCGGCCGCA TCTTCGGCAT CTCGCGCCAG
GCACTCGTGA ACGACGACCT GAACGCCTTC GGCGACATGT CCGTGCGCTT GGGTAAGGCG
TCGGCCGAAT TCGAAAATCA GTTCCTCGTG GATCTGCTGA CCAGCAATCC CTCGATGTAC
GACGGCACCG CGCTGTTCCA CGCCGCCCAC GGGAACCTGG CCACCGGCGC AGGCTCTGCG
CTACAGCTGT CCGCTCTGAC GGTGGCCCGC CAGGCGATGC GACTGCAGAA AGGCCTGGAC
GGCAAAACGC CGATCGATGC TTCCCCGCGT TACCTGGTGG TGCCGGCTGC ACTGGAAACG
ACCGCCGAGC AGTTGGTGAG CGCCATCACA CCGAACCAGT CTTCCAGCGT CAATCCGTTC
GCCGGCCGGC TGGAGTTGGT GGTAGATCCG CGCCTCGATG CGGTGTCTCC GACGGCCTGG
TATCTGGCCG CCGATTCGGC CGTGATCGAG ACGATCGAGT ACGGCTACCT GGACTCGGCC
AACGGCCCGG AGATCTTCAC CGAAGAAGGC TTCGAAATCG ACGGCCTGCA CATGAAGGTT
CGCCTCGACT TCGGCGGCGG TGTGATCGAC TGGCGCGGCC TCTACAAGTC CGTAGGCGCC
TGA
 
Protein sequence
MSRTELVNRS ATFERALVDL AARTVPVSLS SERPVMRYGE LEVLRHEPQC IDLSRAIGGL 
PLLYCHDHAQ PVGVVENVRL DGRRLVGTAR FGQSEKAQEV FQDVRDGLLR GISVGYRINA
TEPILGGIAA TSWTPYEASV LAVPEDSTVG IGRSAGQLQT PEGNSPMPAN TLDLHTRAAA
PNEVRELVKL HGLNASVADG LIQRGATLDA VRAHVLDALA SSDRASGGHL NTTSNGMEYH
GRSLASIGHE IHGPQVEQMQ EALVARMGGP AAKTGNQYRH ARMADMARDL LEHRGLRTTS
MAPRELVERA LHTTSDFAGL LQGAGNRLLR QGYESAPSIK RVFKASTVAD FRAKQKLNLG
EAPALLKVNE HGEFKSGSMA DTTSSYSLAT FGRIFGISRQ ALVNDDLNAF GDMSVRLGKA
SAEFENQFLV DLLTSNPSMY DGTALFHAAH GNLATGAGSA LQLSALTVAR QAMRLQKGLD
GKTPIDASPR YLVVPAALET TAEQLVSAIT PNQSSSVNPF AGRLELVVDP RLDAVSPTAW
YLAADSAVIE TIEYGYLDSA NGPEIFTEEG FEIDGLHMKV RLDFGGGVID WRGLYKSVGA