Gene Ccel_3371 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCcel_3371 
Symbol 
ID7311936 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium cellulolyticum H10 
KingdomBacteria 
Replicon accessionNC_011898 
Strand
Start bp3907868 
End bp3909853 
Gene Length1986 bp 
Protein Length661 aa 
Translation table11 
GC content40% 
IMG OID643610274 
ProductNADH/Ubiquinone/plastoquinone (complex I) 
Protein accessionYP_002507640 
Protein GI220930731 
COG category[C] Energy production and conversion
[P] Inorganic ion transport and metabolism 
COG ID[COG1009] NADH:ubiquinone oxidoreductase subunit 5 (chain L)/Multisubunit Na+/H+ antiporter, MnhA subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGGATTGGT ATTTATTACT GGCAGCTATT CTGCTGCCTG CACTAAGTGC GGCATTATGC 
TTCATTCTAA AGAATCAGAA GGTGCGTATT GCTGTAGTAA GCATTTCTAC TCTACTGCTT
TTTGCAGTTG CAGGCGGCTT CATAGCACTT ATAATGAACT CTCCTGAAGG GAAACTTGTT
TTCCAGATGG ATGAGCATGT ACTCGAGATT GTGGGCTGGT TGATTAAAGG ATTTGACTTC
CTGCTCTTAT GCTACATCGG GTATTTCGGT ATAAAGCATA AAAAAGCAAT AGTACTAGCA
TTAACAGTAT TACAATTTAT TCCTTGGGTT TTATTTGAGA TATTCGGAGT GTTTGGCAAG
GTTCACGAGC CTGCACAGGC ATTTGTAATT GACCATCTGT CATTAATCAT GATAATGCTT
ATTTCAATTA TCGGACCGAT TGTTACATTT TTCGGACTCG GTTATATGAA GGAGCATGAA
CATCACTTGC ATCTAAAAGT ATCAAGACAA CCAAGATTCT TTATGATACT TTTCATATTC
CTTGGTGCAA TGAATGCATT GGTTATGACC GACAACCTTT CCTGGATGTA TTTTTTCTGG
GAAGTAACAA CACTTTGCTC ATTCCTGCTT ATATCACATG ACGGGACTGA AATTGCAGTG
AAAAACGGTT TAAGAGCTTT ATGGTTAAAC TCAGTGGGCG GTGTAGCTTT TATTATCGCT
ATATTAATGG TTAACACCAG TCTGGGCACA CTTTCTATAC AGGCTATTTC AAATTCAGGT
ATGCAGGGTG CTTTCGCAGG GATGATGCCA GTAGCACTTG GACTCCTGTG TATAGCAGGT
TTTACAAAGT CGGCACAGTT CCCGTTCCAG AGTTGGTTGT TGGGAGCCAT GGTTGCACCT
ACACCTGTAT CTGCATTGCT TCACTCAAGT ACAATGGTTA AAGCGGGCGT GTATTTGATT
ATCAGAATTG CTCCTGCTTT TGCAGGAACC AGACTTGGAC AACTGGTAGC TGTTGCCGGA
GGCTTTACTT TTGTAGCCGC CTCAGCCATA GCAATAAGTC AGAGTAATGC AAAGAAGGTT
CTTGCTTATT CAACAATAGC AAACCTTGGC TTGATAATAT GCTGTGCAGG TATCGGTACT
AGGATATCAC TTGCGGCTGC TGTTCTGCTC ATGATTTTCC ATGCTGTATC AAAGGGATTG
CTCTTCCTGT GCGTGGGTAC TATTGAGATG GGTATTGGAA GCCGTGATAT CGAAGATATG
CAGGGCTTAT TCAAGAAGAT GCCGTTTACT ACCGTTATTA CAGTAATCGG TCAGGTTTCG
ATGCTCCTGC CGCCTTTTGG TGTTTTGATA ACAAAGTGGA TGGCAATAGA GACAGCTATA
CATCTTCCGC TGGTGCTTGT GTTTATAATA CTTGGAAGTG CATTTACAAT TGTATTCTGG
TCAAAGTGGA TAGGAATAAT CCTTACAATG TCGTACAAAC CTAAGTATTA TATGGAAAAG
CTATCTTTTT CAATTAAAGC TGCTTTATCT GTGTTAATTG CCTTGGTTAT GGCCGCAAGT
ATAGGAATAG TACCTTTATA CAATCAGTTT GTAAAGCCTC AGATAACAGC ATTTAATATT
GCCGACAAAG AGATGCTTTC AGGAACCGGT ATGGGTTTGT GGTTGGAAAA GGTTAATCAA
GTTGTATATG GAGGTTTCTC TTCCATTATA TTCTTCGGGG TGATTTTATT ATTGATAGTT
GCAATACCTA TCATTATAAA CAGAATCAAA CCAGCAAGAT TAAAGCCGCC GTATCTTTGC
GGCAGTAACG TAGATGATGA TTTGAGAGGG TTGGAGTTTG TAAGTCCAGC TGACAAGGTT
GAAAAGGTAG TTGTTAGAAA CTACTACATG GCATCTGTTT TCGGTGAAAA CAAGCTGACT
TTCTGGTCTA ATCTTTCGGC GGGTGCAATT ATTATAATTA TGTTAGGGGT GGTGATCGGA
TTATGA
 
Protein sequence
MDWYLLLAAI LLPALSAALC FILKNQKVRI AVVSISTLLL FAVAGGFIAL IMNSPEGKLV 
FQMDEHVLEI VGWLIKGFDF LLLCYIGYFG IKHKKAIVLA LTVLQFIPWV LFEIFGVFGK
VHEPAQAFVI DHLSLIMIML ISIIGPIVTF FGLGYMKEHE HHLHLKVSRQ PRFFMILFIF
LGAMNALVMT DNLSWMYFFW EVTTLCSFLL ISHDGTEIAV KNGLRALWLN SVGGVAFIIA
ILMVNTSLGT LSIQAISNSG MQGAFAGMMP VALGLLCIAG FTKSAQFPFQ SWLLGAMVAP
TPVSALLHSS TMVKAGVYLI IRIAPAFAGT RLGQLVAVAG GFTFVAASAI AISQSNAKKV
LAYSTIANLG LIICCAGIGT RISLAAAVLL MIFHAVSKGL LFLCVGTIEM GIGSRDIEDM
QGLFKKMPFT TVITVIGQVS MLLPPFGVLI TKWMAIETAI HLPLVLVFII LGSAFTIVFW
SKWIGIILTM SYKPKYYMEK LSFSIKAALS VLIALVMAAS IGIVPLYNQF VKPQITAFNI
ADKEMLSGTG MGLWLEKVNQ VVYGGFSSII FFGVILLLIV AIPIIINRIK PARLKPPYLC
GSNVDDDLRG LEFVSPADKV EKVVVRNYYM ASVFGENKLT FWSNLSAGAI IIIMLGVVIG
L