Gene Ccel_1406 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCcel_1406 
Symbol 
ID7310181 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium cellulolyticum H10 
KingdomBacteria 
Replicon accessionNC_011898 
Strand
Start bp1708909 
End bp1710069 
Gene Length1161 bp 
Protein Length386 aa 
Translation table11 
GC content39% 
IMG OID643608328 
Productbasic membrane lipoprotein 
Protein accessionYP_002505740 
Protein GI220928831 
COG category[R] General function prediction only 
COG ID[COG1744] Uncharacterized ABC-type transport system, periplasmic component/surface lipoprotein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000211605 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAGAT TAACGGCTTT AATGCTAACA ATTGTATTTT TTTGTATGGC AGTATTATCC 
GGCTGTGGGT TAAGTGCTAC TGACAACAGT GGAAATTCAT CAGCATCTAC CTCTACATCA
GCAGGCTCAA CAGCATCAGA TTCATCAGAT GCAAACGGAT TAAAAGTAGG ATTTATCTAT
ATCGGATCAG TTGGAGATGT AGGATATACC TATGCACATG ATCAAGGTCG TAAATACCTT
GAAACTACTC TTAAAGACAA GAATATAAAG ACATTGGCGG TTGAGAATGT TCCTGAAACG
GCAGAATCCG AAAAGGCAAT AAATAACCTC ATTGACCAAG GTTGTAAAGT TATATTCGCA
ACAAGTTACG GTTACATGGA ATTTGTTGAA AAAGCAGCAA AAGAAAATCC GGACGTTAAG
TTCTTTCACT GCTCCGGATA TAAATCAAAT AGTACTAATT TTGTAAACTA TTTCGGGCAG
ATTGAAGAAG GAAGGTATCT TTCAGGTATT GTTGCAGGAC TAAAGACAAA AACAAACAAC
ATCGGATATG TTGCAGCAAT GCAGATTCCT GAGGTTATCA GAGGAATAGA CTTCTTTACA
CTTGGTGTTC GCTCTGTAAA TCCTGATGCA GTGGTAAATG TGAAATGGAC AAATACCTGG
TACGATCCTC AGGTTGAAAA GGATGCGGCT ACAGCACTTC TTAATGAAGG AAACGATGTT
ATTGCTCAGC ATCAGGATTC AACAGCTGCA CAAATTGCAG CTCAGGAAAA GAGTGCATTT
GCAATCGGAT ACAATTCAGA CAGTAGCAAG GCAGCTCCAA AGGCATACCT GACAGCTCCG
GTGTGGAACT GGGGAGTATA CTATGCTGAT CAGGTTCAGA AGATACTGGA CGGAACATGG
AAAGCTGAAA ATTACCTTGG CGGAATGAAG GATGGAGTTG TTGATATCGG GCCAATATCA
GACTTAGTTG AAGATGACAT AAAAGCTAAG GTTGAAGAAG CAAAGAAAAA GATTCTCGAT
GGTTCATTAA ATGTGCTTGC AGGTCCTATC AAGGATCAGA CAGGTGCAGT AAAGATTCCT
GAAGGAAAAG TTATGACTGT TGAAGAACAG CAGTCCTGTA AATGGTTTGT TGAAGGTGTT
AACGGTAAAA TAAATAAATA A
 
Protein sequence
MKRLTALMLT IVFFCMAVLS GCGLSATDNS GNSSASTSTS AGSTASDSSD ANGLKVGFIY 
IGSVGDVGYT YAHDQGRKYL ETTLKDKNIK TLAVENVPET AESEKAINNL IDQGCKVIFA
TSYGYMEFVE KAAKENPDVK FFHCSGYKSN STNFVNYFGQ IEEGRYLSGI VAGLKTKTNN
IGYVAAMQIP EVIRGIDFFT LGVRSVNPDA VVNVKWTNTW YDPQVEKDAA TALLNEGNDV
IAQHQDSTAA QIAAQEKSAF AIGYNSDSSK AAPKAYLTAP VWNWGVYYAD QVQKILDGTW
KAENYLGGMK DGVVDIGPIS DLVEDDIKAK VEEAKKKILD GSLNVLAGPI KDQTGAVKIP
EGKVMTVEEQ QSCKWFVEGV NGKINK