Gene Lcho_0043 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagLcho_0043 
Symbol 
ID6160067 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameLeptothrix cholodnii SP-6 
KingdomBacteria 
Replicon accessionNC_010524 
Strand
Start bp47734 
End bp49365 
Gene Length1632 bp 
Protein Length543 aa 
Translation table11 
GC content72% 
IMG OID641662787 
Producttranscriptional regulator domain-containing protein 
Protein accessionYP_001789083 
Protein GI171056734 
COG category[S] Function unknown 
COG ID[COG5616] Predicted integral membrane protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones35 
Fosmid unclonability p-value0.930122 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCCATC GCTACACCAC CGGCGAATTC GAGGTCCGGC CCGACGAACG CCGCGTGCTC 
CTGCGCGGCG AGCCGGTTCC GCTGGGCGCC CGGGCCTTCG ACGTGCTGAT GTGCCTGATC
GCGCAGCGCG AGCGTGTCGT CACCAAGAAC GAGCTGCTCG AACAGGTCTG GCCCGGCATG
GTGGTGGAGG AGAACAACCT CACCGTGCAC GTCTCGGCGC TGCGCAAGGT GTTCGGCGCG
CAGGCGGTGG CGACGATCCC CGGCCGCGGT TATCGCTTCG TGATGGCGCT CGACGAAACG
ACCGGCGCGC CGGCCGTTGC ACCGGTGCCC GCACCCGCGC CCGCATCCGT CGTGGGGGCC
GATGGCCCGA CCGGGCTGAC ACTGCCCGAC CGTCCCTCGA TCGCCGTGCT GCCGCTCGAC
AACCTGAGCG GCGACGCACA GCAGGATCTG CTGATCGAGG GCCTCAGCGA GGACGTCATC
ACCGAGCTGT CGCGTTTTCG CTCGCTGTTC GTGATCGCGC GCAACAGCAG CTTCAGCTTT
CGCGGCCTGG CCCACGGCGA GCGCGACGTG CGCGCCATCG CCCGCGAGTT GGGCGTGCGT
TACGTGCTCG AAGGCAGCTT GCGCCGCGCC GGTGATCGGG TGCGCGTGGC GGTGCAACTG
GTCGATGCGG TGAGTGCCAC CCAGCTCTGG GCCGAGAAGT ACGACCGCGC GGTCGATGAT
CTGTTCGCGC TGCAGGAGGA GCTCACCCGC GCCATCGTCG GCGCGATCGC GCCGCAGATC
GAAGCCGGCG AGTTCCAGAA GATCCGCAGT GCGCGCGGGC GCGATCTCAA CGCTTACACG
CTCGCCATGC GCGCCCGCGA CACCGCGCGC CGCGCCGACC GCGAGGGCGA CGCCACCACG
CGCGACGAGG CGCTGCGGCT GGCGCACGAA GCGGTGGAGA TCGACCCCGG CTGCGGCGTT
GCGCTGGCCA CCATTGCCTT CGTCCAGTGG CAGCAGATCT GGGCCGGCAG CGCCGCCTCG
CCCGTCGATG CGGCCGTGGC CGGCCTGGCC GCAGCGCGCC GGGCGATCGC GCTCGACGGC
TCGGATCACC ACGCCCATCT GTGGAAAGCG ATGCTGCAGC TGTTCACGCA CCAGCACGCC
GCCGGCCTGG CCGACCTGCA GCGTGCTCAC GAACTCAACC CCAACGACGC GTTGACGCTG
AGCCTGCTCG GCCAGTACCT GGCTGCCGAG GCGGACAGCG CGGCCGACGC GGCAACCGGC
GTGCGCCATG TGCTCGACGC CCTGCGCCTG AGCCCGCGCG ATCCGCTGCG CTGGTCGTTC
CTGAACTCGC TCGCGTGGGC CTCGTTTGCC GCGTTCGACC ATGCCGGCGC GGTCGACGCC
GCCAGCCGCG CCCTCGGGGA GGCGCCGCAG TTCCCGCCAG CGCGGCTGTG CCGGGTGATC
GGCCAGGTCG GCCTGGGTGC GATCGACGCG GCGCGGGCCG ATTTCGAGAC CTTGCGTGCG
CTGGCGCCGC AGATGGTCGC CACCCGGCTG GCCGGCGACT GGTCCTACGC CAACCCGGCG
CTGGTGCAGC GCGCCACCGC CTTCCTGCGG GTGGCGGCAG GCCTCGACGA GCCGGGCCGG
CTCGACACCT GA
 
Protein sequence
MSHRYTTGEF EVRPDERRVL LRGEPVPLGA RAFDVLMCLI AQRERVVTKN ELLEQVWPGM 
VVEENNLTVH VSALRKVFGA QAVATIPGRG YRFVMALDET TGAPAVAPVP APAPASVVGA
DGPTGLTLPD RPSIAVLPLD NLSGDAQQDL LIEGLSEDVI TELSRFRSLF VIARNSSFSF
RGLAHGERDV RAIARELGVR YVLEGSLRRA GDRVRVAVQL VDAVSATQLW AEKYDRAVDD
LFALQEELTR AIVGAIAPQI EAGEFQKIRS ARGRDLNAYT LAMRARDTAR RADREGDATT
RDEALRLAHE AVEIDPGCGV ALATIAFVQW QQIWAGSAAS PVDAAVAGLA AARRAIALDG
SDHHAHLWKA MLQLFTHQHA AGLADLQRAH ELNPNDALTL SLLGQYLAAE ADSAADAATG
VRHVLDALRL SPRDPLRWSF LNSLAWASFA AFDHAGAVDA ASRALGEAPQ FPPARLCRVI
GQVGLGAIDA ARADFETLRA LAPQMVATRL AGDWSYANPA LVQRATAFLR VAAGLDEPGR
LDT