Gene Lcho_3036 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagLcho_3036 
Symbol 
ID6162108 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameLeptothrix cholodnii SP-6 
KingdomBacteria 
Replicon accessionNC_010524 
Strand
Start bp3353799 
End bp3355589 
Gene Length1791 bp 
Protein Length596 aa 
Translation table11 
GC content71% 
IMG OID641665811 
Producttail sheath protein 
Protein accessionYP_001792061 
Protein GI171059712 
COG category[R] General function prediction only 
COG ID[COG3497] Phage tail sheath protein FI 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.00018204 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCCTGAAT ACCTGGCACC CGGTGTCTAC GTCGAAGAGA CGAGCTTTCG CGCCAAGTCC 
ATCGAGGGCG TGGGCACCAG CACCACTGCC TTCGTCGGCC CGACCCGGCG CGGCCCGGTC
GGCACCCGCT CGGAGCTGAT CACCAGCCTG CCGGACTTCG AGCGCTACTA CGGCAGTCTC
GAGAACCTCG CGCTCAGCGA CGTCAGCGCG GCGGCGAATC GGCTCAACTT CCTGGCCCAC
GCGGTGCGGG CGTATTACGA CAACGGCGGC TCGCGGCTCT ACGTGGTGCG CACCGCCAAC
GGCGCCGGCA GCGCCAGCGC GCCGCTGATC CTGACCGGCG CCCCGCCGGC TGAACCCGCC
ACCGAGGCCG ACCGCGTCAC CGCCCGCGCC CGTTTCCCCG GCGCCAGCGG CAACGGCCGC
CTGCTGCTGC GCGAAGTGCC GCGCCCGGCT ACCGCCCGCA CGCTCGACAC CGCACCCGCC
GGCAGTCTGG CGCGTGTCAC CGTGAGCAGC GGCGCGCCAC CGGTCGACAC CGTCACGCTC
TACCAGAAGG TCGGCAACCT CTGGCAGGAC AGCGCCACGC CCGCCGGCAC GCTGACCGTG
GCCGACCCGA TCCGCGCCGA GCTGCTGACG CTGGCCGCCG AGTTCATCGA CGCCGACGGC
TACGTGCAAG GCTGGGACGA GCTCGGCCTG GCCGCCGGCC ACCCGCGCGC CATCGGCACC
GTGCTGGCCG AACACCCGAG CACCCGCAGC GACGACCTGC AGAACCTGGT GTGGCTGCAG
ATCGGCAGCG GTGTCAACGC CTTCGAGCTG CAGGCCGCGA TCGCCGCGCT GCCCGCGCTG
GCCGGCGACA CCGAAGGCCG ACGCTTCGCC AACCTGAGTG GCGGCAGCGA CGGCGCCGCA
CCCGGCGTGG GCGCCGCCAC CAGCGTCGGC AGCTACGCCC ACGCGCTGGC GCAACTGCTC
GCGCTCGAAG ACGTCGCCAT CGTCGCCGCC CCCGGCAGCA GCGCCTACGC CGACGCGCAG
GCGGTGCAGA ACGCGCTGAT CGGCCACGCC GAAACGCGCC GCGCCTACCG CATCGCCGTG
CTCGACACGC CGCCGCTGCA GACCCCCGGC CAGGTGCGCG ATGCGCGTGG CCGCATCGAC
AGCAAATACG CCGCGCTCTA CTACCCGTGG GTGGTCACGC CCAACCCGCT GGCACGGCCC
GGCCGCGACG ACATCCCGCG CGAGATCACG CTGCCGCCGT CGGGCTTCGT GGCCGGCGTC
TACGCCCGCA ACGACATCGA ACGCGGCGTC TACAAGGCGC CGGCCAACGA GGTGGTGCGC
GGCGCGCTGC GTTTCGAGAC CGACATCAAC TTCGCCCAGC AGGAGGTGCT CAACCCGATC
GGCATCAACT GCCTGCGCTA CCTGAGCGGG CGCGGCTACC GGGTGTGGGG TGCACGGCTG
GCGTCGAGCG ATCCGGAATG GAAGTACGTC TCCGACCGGC GCTACTTCAA CTACCTCGAA
TCGTCGATCG ACCGCGGCAC GCAGTGGGCG GTGTTCGAGC CCAACGGCGA GCGCCTGTGG
GCCAACGTGC GGCAGACCAT CTCCGACTTT TTGTACAACG AATGGCGCGG CGGCGCGCTG
CTCGGCGGCT CGGTCGAAGA GGCGTTTTTC GTGCGCTGCG ACCGCAGCAC CATGACGCAG
AACGACCTCG ACAACGGCCG CCTGATCTGC CTGATCGGCG TGGCCATCAT CAAGCCGGCC
GAGTTCGTGA TCTTCCGCAT CGGCCAGAAG ACCGCCGACG CCCGCGCCTG A
 
Protein sequence
MPEYLAPGVY VEETSFRAKS IEGVGTSTTA FVGPTRRGPV GTRSELITSL PDFERYYGSL 
ENLALSDVSA AANRLNFLAH AVRAYYDNGG SRLYVVRTAN GAGSASAPLI LTGAPPAEPA
TEADRVTARA RFPGASGNGR LLLREVPRPA TARTLDTAPA GSLARVTVSS GAPPVDTVTL
YQKVGNLWQD SATPAGTLTV ADPIRAELLT LAAEFIDADG YVQGWDELGL AAGHPRAIGT
VLAEHPSTRS DDLQNLVWLQ IGSGVNAFEL QAAIAALPAL AGDTEGRRFA NLSGGSDGAA
PGVGAATSVG SYAHALAQLL ALEDVAIVAA PGSSAYADAQ AVQNALIGHA ETRRAYRIAV
LDTPPLQTPG QVRDARGRID SKYAALYYPW VVTPNPLARP GRDDIPREIT LPPSGFVAGV
YARNDIERGV YKAPANEVVR GALRFETDIN FAQQEVLNPI GINCLRYLSG RGYRVWGARL
ASSDPEWKYV SDRRYFNYLE SSIDRGTQWA VFEPNGERLW ANVRQTISDF LYNEWRGGAL
LGGSVEEAFF VRCDRSTMTQ NDLDNGRLIC LIGVAIIKPA EFVIFRIGQK TADARA