Gene Ccel_1544 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCcel_1544 
Symbol 
ID7310308 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium cellulolyticum H10 
KingdomBacteria 
Replicon accessionNC_011898 
Strand
Start bp1880960 
End bp1882234 
Gene Length1275 bp 
Protein Length424 aa 
Translation table11 
GC content40% 
IMG OID643608473 
Producttransposase IS116/IS110/IS902 family protein 
Protein accessionYP_002505876 
Protein GI220928967 
COG category[L] Replication, recombination and repair 
COG ID[COG3547] Transposase and inactivated derivatives 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00314291 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGCGTA CACAAAATGA GAAGATTTTG CAAATAAAAT TTGAAACATT AGTAGTTGGA 
ATTGACATTG GGAAGGAAAC ACACTATGCA AGAGCCTTTG ATTGCAGAGG ACTAGAACTT
TCAAAGCTCC TTAAATTCAG CAACACAAAT CAAGGTTACG AAGCCCTTGA GGATTGGATG
CAGACAGTGA TGAAGGAGCA CAGTAAGACA GAAGCAATTG TTGGTTTTGA ACCTACAGGA
CATTACTGGT TCACACTAGG AGATCACTTG CAAAGAAAAG GCCATCGTTT GGGAATTGTA
AACCCATTCC ATGTTAAATG TACAAGAGAG CTAGATGACA ATAGCCAAAC AAAGAATGAT
AAAAAAGACC CTAAAACCAT AGCAATGCTG GTCAAGGACG GCAGATTCAG GGATGTATAT
ATTCCAGAGG ATGTTTACCA AGAACTTCGT GAAGCGGTTA GCGAAAGAGA ACGGTTACTT
GAGCAGTTGA TAGGTTTGAG CAATCAGGTT ATACGTTGGC TTGATATAAG GTTCCCAGAG
TTTAATGAGG TATTTAAGGA TTGGACAGGT GATGCAGCTT GGCTGACATT AAAAAACTAT
CCTACACCAG CAAAAATACT GTCTGCAGGA GCTCCAGCAA TTGTAGGTAC ATGGTCAAAG
GAAATGAAGA AGCCTAGCAT AAAAAGAGCT GAAAAACTTG TAAGGCTTGC AAACGTGTCA
ATTGGAAGAA CGGCAGGAAG TGAAGCAGCA GAAGCAGCTT TGCAAAATCT GCTTACACAA
TATGAAATGA TATTAAAACA GAAACAGGAT ACAGAAAGAC TGATGCAGGA ATTACTCATG
AAAGTACCAA ATGCATCAAA ATTAGTTGAT ATTAAAGGGA TTGGAATGGT GGCAGCAGCG
GTCATTGTCA GTGAAATCGG AGATATCAGC CGATTTAAAG ACCCTAGACA AATACAGAAA
ATGGCTGGAC TAAGCTTGCG AGAAAATAGC TCAGGCAAGC ATAAAGGCAA GACTACGATA
AGTAAACGAG GACGAAAACG TTTAAGAGAA GGGTTGTTCA GAGCTATCAT AACAATATTA
GCTACAAACC AAGAATTTCG CATGTTACAT CAGAAGAATC TTGGCAGAGA GAAGAACCCG
CTTAACAAGA TGGAATCCAT AATAGCCCTA TGCGGCAAGC TTATTCGAGT GATTTTTGCA
ATATTGACAA AAGGTAGCGA TTACGATGCA GGTAAAATGA TTGAAGATAT GAAAGCATCG
ATGAAGGCAG CATAA
 
Protein sequence
MKRTQNEKIL QIKFETLVVG IDIGKETHYA RAFDCRGLEL SKLLKFSNTN QGYEALEDWM 
QTVMKEHSKT EAIVGFEPTG HYWFTLGDHL QRKGHRLGIV NPFHVKCTRE LDDNSQTKND
KKDPKTIAML VKDGRFRDVY IPEDVYQELR EAVSERERLL EQLIGLSNQV IRWLDIRFPE
FNEVFKDWTG DAAWLTLKNY PTPAKILSAG APAIVGTWSK EMKKPSIKRA EKLVRLANVS
IGRTAGSEAA EAALQNLLTQ YEMILKQKQD TERLMQELLM KVPNASKLVD IKGIGMVAAA
VIVSEIGDIS RFKDPRQIQK MAGLSLRENS SGKHKGKTTI SKRGRKRLRE GLFRAIITIL
ATNQEFRMLH QKNLGREKNP LNKMESIIAL CGKLIRVIFA ILTKGSDYDA GKMIEDMKAS
MKAA