Gene Lcho_3038 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagLcho_3038 
Symbol 
ID6162110 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameLeptothrix cholodnii SP-6 
KingdomBacteria 
Replicon accessionNC_010524 
Strand
Start bp3356080 
End bp3357684 
Gene Length1605 bp 
Protein Length534 aa 
Translation table11 
GC content74% 
IMG OID641665813 
Producthypothetical protein 
Protein accessionYP_001792063 
Protein GI171059714 
COG category[R] General function prediction only 
COG ID[COG3497] Phage tail sheath protein FI 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.000011603 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAGCCTCG TCTTCGAAGT TGCCAGCGCG CCGGTCGAAC CCGATCGGCA GCGCGCCGAC 
GTGGCCTGTT TCATCGGCCA CGTGGCGCGC CGTCGCGGGG CGGCGCTGCC GGCGTCGGTG
CGGGCGCAAC TGCGGGCCGA CGGCTGGATC GACGGCGTCT GGGCCCGGCC GGCGGTGCAG
GTCGAGGCGC TCGACAACCT GCCCGTCGTG ATCGACAACT GGGCGCTGTT CGACCAGCTC
TACGCCTGGG AGCGACGCCC GCTCGCGGCC ACGGGCAGCG CCGTCTGCGC CACCTACCTG
GGCGCCGCGC TGCGCAGCTT CTTTGCCCGC GGCGGCCGGC GCGCGATCGT GATCCGGGTC
GGCGATCCGT GGCCCTTCCT CGAAGACGAC ACCCAGCGCA GCGCCCAGTT ACGCCCACGC
CTGCGCCGCC TGATCCCCGA CTTCGCCCAG ACCGTCGCGC CCAGCCGCCC GTTCGCACCG
CACGATCCGA GCACCTGGCA AGGCATCGAA CACCTCTACG GCCTGCGCGA CAACAGCTTC
GTGCTGTTCC CCGACCTGGC CGACGCCTGC GCCAGCCAGC CGGTGGCGCC GCTCGCCAGC
CCGGCCCAGA CCACGCCCGA AACCTTCGTC GAATGCAGCG TCGAGGCCGA GCCGCCGCCC
GACACCGGCC TGCGCCGCCT GCCCGCGCCG CGGCTCGACA GCGCCGGCTA CGCCGCCTGG
ATGCTGGCGG TCGGGGCCGC GCGCGGTTTC CTGGCACGGC GCCAGCGCGA GCACCTGCTG
CTGGCCTCGC TGCCGCTGCC ATGGGTCGAC ACCCGCCGCA CCGCGAGCGG CGGCGCGGCG
GTGCACGCAC AGGCCGACAT GCGCGCCTAC CTGGAGCGCA TCGGCGTGCT GCGCCCGGAC
GGCAGCCGCG CCCCGGACGA CGACACCGGC AGCGCCAGCG CCTTCGTGCA GCTGGCGTGG
CCGTGGCTGC GCAGCGCCGC CGGCACCGAT CTGCCCGAAG GCCTGGAGCC GCCCGAAGGC
GTGCTCGCCG GCCTGCTGGC CAGCAACGCC ACGCGGCGCG GCTGTTTCCG CTCGGTGGCA
GGCGACTTCT CGCTGCCGTA TCTGCGTGAC CTGTTCGATG CGGAGCCGCC GCTGTCGTGG
GGCCTGGGCG AAGGTGGCGC GGTGCAGCGG CTGGCGCGCC AGGTGTGTGT CTTCACGCCC
GGCCCGCTGG GCTGGATGCT GCAGTCGGAC GTGACCACCT CGCCGCAGGA GGCCTGGCGC
TTCGGCGGCG CCAGCCGGTT GCTGGCGAGC ATCCTGCGCA CCGCACGCGC GCAGGGCGAC
CACCTCGCCT TCGAGACCAA CGGCACGCAG ACCTGGGCCC GGCTGCGCCG CAGCCTCGAA
GACCTGCTGC TCGGCTACTG GAACGAAGGC GCCTTTGCCG GCGCCAACGC GGCGCAGGCC
TTCCAGGTGC GCTGCGACCG CAGCACCATG ACGCAGGCCG ATCTCGACGC CGGTCGGCTG
ATCGCCGAGA TCAGCGTGCG GCCGGCGCAG GCGATCGAAG TCATCACGGT GCGGCTGCAG
CTGGGCAATG CGCTCGGTGC GACCGGACTG CGGGAGGCGG CATGA
 
Protein sequence
MSLVFEVASA PVEPDRQRAD VACFIGHVAR RRGAALPASV RAQLRADGWI DGVWARPAVQ 
VEALDNLPVV IDNWALFDQL YAWERRPLAA TGSAVCATYL GAALRSFFAR GGRRAIVIRV
GDPWPFLEDD TQRSAQLRPR LRRLIPDFAQ TVAPSRPFAP HDPSTWQGIE HLYGLRDNSF
VLFPDLADAC ASQPVAPLAS PAQTTPETFV ECSVEAEPPP DTGLRRLPAP RLDSAGYAAW
MLAVGAARGF LARRQREHLL LASLPLPWVD TRRTASGGAA VHAQADMRAY LERIGVLRPD
GSRAPDDDTG SASAFVQLAW PWLRSAAGTD LPEGLEPPEG VLAGLLASNA TRRGCFRSVA
GDFSLPYLRD LFDAEPPLSW GLGEGGAVQR LARQVCVFTP GPLGWMLQSD VTTSPQEAWR
FGGASRLLAS ILRTARAQGD HLAFETNGTQ TWARLRRSLE DLLLGYWNEG AFAGANAAQA
FQVRCDRSTM TQADLDAGRL IAEISVRPAQ AIEVITVRLQ LGNALGATGL REAA