Gene Ccel_1221 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCcel_1221 
Symbol 
ID7310018 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium cellulolyticum H10 
KingdomBacteria 
Replicon accessionNC_011898 
Strand
Start bp1494391 
End bp1495899 
Gene Length1509 bp 
Protein Length502 aa 
Translation table11 
GC content39% 
IMG OID643608142 
Productalpha-L-arabinofuranosidase domain protein 
Protein accessionYP_002505557 
Protein GI220928648 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3534] Alpha-L-arabinofuranosidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGATAACG CAAAAATGAT ACTTAACAAA GACTACGTTG TAGCACCTGT AGACAAAAGA 
ATTTACGGCT CATTTATTGA GCATTTGGGA AGAGCTGTAT ATGGGGGTAT ATATGAACCG
GGGCACCCTT CAGCAGACAA ATTTGGCTTC CGTCAGGATG TATCAGAAAT GATAAGGGAA
TTACAAGTCC CTATAGTAAG ATATCCCGGT GGAAATTTCG TTTCAGGCTA CAATTGGGAG
GATGGAGTAG GTCCTGTAGA TAAAAGACCC CGACGAACGG AATTAGCTTG GGCTACAGTT
GAAACAAACG AAATTGGAAC CAATGAGTTT GTAACATGGG CTAAAGAAGT AGGGACAGAA
GTTATGATGG CGGTTAATCT GGGAACAAGA GGGGTTGATG CGGCCAGAAA TCTCATTGAA
TACTGCAATC TTACTCAAGG AACATACTGG AGTGACTTGA GAAAATCTCA TGGCTACAGT
CAGCCTCACA ACATAAAGAC CTGGTGTCTT GGAAATGAAA TGGACGGACC TTGGCAGATA
GGAACAAAAA CTGCCGAGGA ATACGGAAGG CTTGCTTGTG AAACTGCAAA GGTTATGAAA
ATGGTAGATC CCACAATCGA ACTGGTAGCC TGCGGAAGCT CAGGAAGCGG TATGCCTACC
TTTGCCCAAT GGGAAGCTAC AGTCCTTGAG CATACTTATG AACATGTTGA TTATATTTCA
CTTCATACGT ATTATGGTAA CCAAGATAAT GATACTGCTA ACTACCTGGC AAAAACTATG
GATATGGATG CCTTTATCAA ATCCGTTGTT GCAACCTGTG ATTATGTAAA AGCAAAAAAA
CGCAGTAAGA AAAAAATAAA CCTCTCCTTT GACGAATGGA ATGTATGGTT CCACTCCAAT
GAAGCGGATA AAAAAATTGA CAGATGGTCT ATTGCACCAC CTCAACTTGA AGATATTTAC
AATTTTGAGG ATGCACTTTT GGTTGGAGGT ATGCTGATAA CTCTGTTAAA GAATGCCGAC
AGAGTAAAGA TGGCTTGTCT TGCACAGCTT GTAAATGTTA TTGCACCAAT AATGACAGAG
AACGGTGGAA GTGCGTGGAA GCAGACAATT TACTATCCAT ACCTCCATAC TTCAGTGTTT
GGAAGAGGTA CTGTTTTAAA TACCATTATG AAAGCACCAA AGTTTGATAC TAAAGATTTT
ACAGACGTTT CAGCTATTGA TGCTACAGCA GTAATTAATG ACAACAACGA TGAAATTACC
GTTTTTGCAG TAAACAGACA TATGGAAAAC AATATTAGTC TGGATGTTGA ACTAAATGGC
TTCGGACAAT TTGAAGTTAT TGAACATATT GTTCTTGAAC ATAATGATGT AAAAGCTACT
AATACAAAAG AAAATCCAAA TAACGTTGTA CCAAACAACA ATGGAAATGC TACCTTGGAA
GATGGAAGTA TCAAAGCTTC CTTAAAGAAT CTTTCCTGGA ATGTTATAAG ATTGAAGAAA
GTAAAATAG
 
Protein sequence
MDNAKMILNK DYVVAPVDKR IYGSFIEHLG RAVYGGIYEP GHPSADKFGF RQDVSEMIRE 
LQVPIVRYPG GNFVSGYNWE DGVGPVDKRP RRTELAWATV ETNEIGTNEF VTWAKEVGTE
VMMAVNLGTR GVDAARNLIE YCNLTQGTYW SDLRKSHGYS QPHNIKTWCL GNEMDGPWQI
GTKTAEEYGR LACETAKVMK MVDPTIELVA CGSSGSGMPT FAQWEATVLE HTYEHVDYIS
LHTYYGNQDN DTANYLAKTM DMDAFIKSVV ATCDYVKAKK RSKKKINLSF DEWNVWFHSN
EADKKIDRWS IAPPQLEDIY NFEDALLVGG MLITLLKNAD RVKMACLAQL VNVIAPIMTE
NGGSAWKQTI YYPYLHTSVF GRGTVLNTIM KAPKFDTKDF TDVSAIDATA VINDNNDEIT
VFAVNRHMEN NISLDVELNG FGQFEVIEHI VLEHNDVKAT NTKENPNNVV PNNNGNATLE
DGSIKASLKN LSWNVIRLKK VK