Gene Lcho_2071 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagLcho_2071 
Symbol 
ID6163530 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameLeptothrix cholodnii SP-6 
KingdomBacteria 
Replicon accessionNC_010524 
Strand
Start bp2253814 
End bp2254953 
Gene Length1140 bp 
Protein Length379 aa 
Translation table11 
GC content74% 
IMG OID641664840 
Productendo-1,4-D-glucanase 
Protein accessionYP_001791103 
Protein GI171058754 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3405] Endoglucanase Y 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.000124631 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGCCACGC GGGCGCAGTC TGCCAACCCC TGCAGCGCCG TCGAGGGCCC GACCGGCTGG 
CCGGCATGGC AGACCCTGCG CCGGACCCTG ATGAGCCGCG ACGGCCGTGT CATCGACCGC
TACGCGAGCG ACGCCACCAC CTCCGAAGGC CAGGCCTACG GCCTGTTCTT CGCGCTGGTG
GACAACGACC GCGCGGCCTT CGAGCTGCTG TTGCGCTGGA CCGAAGACAA CCTGGCGGCC
GGCGACCTGG CCGCGCGCCT GCCGGCCTGG CGCTGGGGCC GGCGCGCGGA CGGCAGCTGG
AACGTGATCG ACGCCAACTC CGCCGCCGAC GCCGACCTGT GGCTGTCCTA CGTGCTGTCC
GAGGCCGGCC GGCTCTGGAA GAACCGCCGC TACGACGCGC TCGGGCGGGT GCTGGCGCGC
CGCATCGCCG CCGAAGAGGT GATCGAGCTG CCCGGCCTCG GCACCACCCT GCTGCCCGGG
CCGCAGGGAT TTCGCCGCGG CGAACGCGGC GCCAAGCTCA ACCCGAGCTA CCTGCCGCCG
CAGCTGCTGC GCTGGTTCGC GCGCAACCGC ACCGAGTCCG TCTGGGTGCC CCTGCGCGAC
GCCTCGCTGC GCCTGCTGCA CGACAGCGCG CCCCACGGCC TGGCGCCCGA CTGGACCGTG
TTCGACGCCG ACCGGGGCTG GAGCCTGGCC GAACTGGCCG ACGACGAGCG CAGCGGCAGC
TACAACGCGA TCCGGGTCTA CCTCTGGCTC GGCCTGACCG ATCCGGGCGA TCCGGCGCGC
GGGCGCCTGC TCGCCCGCTA TGCGCCGATG GCGCGGCTGT CCGAACTGCT CGGCGGCGTG
CCCGAGAAGG TCGATCCGGC GCGCCCCGCA CTCGAGCAGT CGGCCGGCGC GCAAGCCAAC
GGGCCGGTCG GTTTCCAGGC CGCCATGCTG CCGTTCGCCG ACGCGCTCGG CCAGACCGCG
CTGAGCGAGC GTCTGGCCGA CCGCGTGGCC ACGCAGGGCG TGCAGCCGGA CGCCTATTAC
GACCAGGTGC TGAGCCTGTT CGCGCTCGGC TTCCGCGAGC GGCGCTACCG CTTCGCCGCC
GACGGATCGC TGCAACCGGG ATGGGCCTCA TGCGACGCAC CGCCTGGATC CTCGCGCTGA
 
Protein sequence
MATRAQSANP CSAVEGPTGW PAWQTLRRTL MSRDGRVIDR YASDATTSEG QAYGLFFALV 
DNDRAAFELL LRWTEDNLAA GDLAARLPAW RWGRRADGSW NVIDANSAAD ADLWLSYVLS
EAGRLWKNRR YDALGRVLAR RIAAEEVIEL PGLGTTLLPG PQGFRRGERG AKLNPSYLPP
QLLRWFARNR TESVWVPLRD ASLRLLHDSA PHGLAPDWTV FDADRGWSLA ELADDERSGS
YNAIRVYLWL GLTDPGDPAR GRLLARYAPM ARLSELLGGV PEKVDPARPA LEQSAGAQAN
GPVGFQAAML PFADALGQTA LSERLADRVA TQGVQPDAYY DQVLSLFALG FRERRYRFAA
DGSLQPGWAS CDAPPGSSR