Gene Lcho_1941 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagLcho_1941 
Symbol 
ID6162547 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameLeptothrix cholodnii SP-6 
KingdomBacteria 
Replicon accessionNC_010524 
Strand
Start bp2105221 
End bp2106336 
Gene Length1116 bp 
Protein Length371 aa 
Translation table11 
GC content69% 
IMG OID641664710 
Productprotein of unknown function DUF395 YeeE/YedE 
Protein accessionYP_001790973 
Protein GI171058624 
COG category[R] General function prediction only 
COG ID[COG2391] Predicted transporter component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value0.368246 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAAGCCT CCGAGGCATC GAATCTGACG GCCCAGGTAC TCTGGGCCGT GTTTCTTTTG 
GCGCTGCTCT ACGGGGCGAT CGCCGAGCGC AGCCATTTCT GCACCATGGG CGCGGTCGCC
GATGTCGTCA ACTTCGGCGA CTGGCGCCGG CTGCACATGT GGGCGCTGGC GGCCGGGGTC
GTGACGATCG GATTCAACCT GATGGTGGCC GCCGGCTGGG TCAGCGCCGA CAAGACGCTG
TACGGCGGCT CACGCTGGCA ATGGGCGTCG GCCCTGGTCG GCGGCGCACT GTTCGGCTTC
GGCATGGTGC TGGCGTCGGG TTGCGGCAGC AAGAACCTGG TGCGGCTGGG CGGCGGCAAC
CTCAAGGCGC TGGTAGTGCT GCTCGTGATG GGGCTTTCGG CCTGGATGAC GCTGCGCGGG
CTGACCGCGA TGTGGCGTGT CGAGACAGTC GACCGCTGGG CCGTGTCACT GCCCGGCTCG
CAGGACTTGC CGAGCCTGAT CGCCGCCGGC TACGGCGGAT CGACGCCCGA TGTCGCGCTG
GTTGTCGGCA CCGTCGTCGG CACCGCCTTG CTGGCCTGGG CACTGTGGGC GCGACCGGGC
CGTGACGCCG GACTGCTGCT GGGCGGCATC GGCATCGGCG CCGTGGTGCT GGCGGTCTGG
TGGGTGTCCG GGCGCTTCGG TCATCTGGCC GAGCATCCCG AAACCCTCGA AGAAGCCTTT
CTCGGCACCG CCAGCCGGCG CATGGAGGCA CTCAGTTTCG TGACGCCGGT GGCGCAAGGT
CTCGAGTGGC TGATCTTCTA CTCCGATCAG GGCCGCCGGC TGAGCACCGG CGTGGTGGCG
GTTTGTGGCC TGGTGGCCGG CTCCTGGCTG GTCAACATCC GGCAGAAGAG CTTCCGATGG
GAAGGTTTCG GCGACTCCGC CGATACCGCT CGCCACCTGA TCGGCGCCAC GATGATGGGC
ATCGGCGGCG TGACCGCCAT GGGCTGCACC ATCGGACAGG GCCTGAGCGG CCTGTCCACG
CTCGGGCTGA CCAGCCTGGT CGCGGTGGCG GCGATCATTG CCGGGGCGGT CGCCGGATTG
CGTTTCCTGA ACTGGCAGCT GGAGCGAGCC GCATGA
 
Protein sequence
MQASEASNLT AQVLWAVFLL ALLYGAIAER SHFCTMGAVA DVVNFGDWRR LHMWALAAGV 
VTIGFNLMVA AGWVSADKTL YGGSRWQWAS ALVGGALFGF GMVLASGCGS KNLVRLGGGN
LKALVVLLVM GLSAWMTLRG LTAMWRVETV DRWAVSLPGS QDLPSLIAAG YGGSTPDVAL
VVGTVVGTAL LAWALWARPG RDAGLLLGGI GIGAVVLAVW WVSGRFGHLA EHPETLEEAF
LGTASRRMEA LSFVTPVAQG LEWLIFYSDQ GRRLSTGVVA VCGLVAGSWL VNIRQKSFRW
EGFGDSADTA RHLIGATMMG IGGVTAMGCT IGQGLSGLST LGLTSLVAVA AIIAGAVAGL
RFLNWQLERA A