Gene Ccel_2087 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCcel_2087 
Symbol 
ID7310788 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium cellulolyticum H10 
KingdomBacteria 
Replicon accessionNC_011898 
Strand
Start bp2445332 
End bp2446672 
Gene Length1341 bp 
Protein Length446 aa 
Translation table11 
GC content38% 
IMG OID643609021 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_002506412 
Protein GI220929503 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000442741 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTATTTA AATTTAAAAG ACCTAGGATT TATGGAAATG TTTCAGATGA CAATTACGAG 
CTGAGCCGTA GGCGTTTTAT TCTTGAGGGT TGCCTTTCAA ATGGTGTCTA TACCCTAACA
GCAGGAGCTT TTTTTGCTGG ATATGCTAAA TTTCTAGGAG CATCAGATCA GATAATAGGC
CTTATTGTTG CGATGCCTTT ACTTGCTAAT ATTCTTCAAA TGTTCAGTCC AATATTTTTA
GAGAAGCTTA CCAGCAGGAA AAGATTGATT GTCACAACTA GTCTTTGTTA TAGGTCATTG
CTTGGACTCA TGATAGTTAT TCCTCTGTTA ACACAGAATA CGTCAGCCAG ACTGCTTTTG
TTGGCAGGGA TGTATCTTAC GGCCTATCTT ATTTTCAGCT TTTCAAACCC CGCAGGAGGC
AGTTGGATAA TAAGTCTTGT TCCTGAGAGA TACAGAGGCA GGTATTTTGG ACTGAGGGAT
ACCTTCATCA TTTCGTCAGC GGCTGTTCTC TCCCTATCTA TGGGAAGGGT ACTTGATATA
CTCAAAGTTT CCGGAAAGGA GTTTTTGGGG TTTATTATCG TTTTTTCTTT GGTATTGGTA
TTGGTGGTAT TGGATATTTA TGTACTTAAT AAGATTAGGG AACCCAAAAT AGTACCTATT
AAGCAGAATG TAAATGTAAA GAGTCTGTTT ACTCTGCCAC TAAAGAATAA GCAGTTCCGT
CCGGTTATTT TTTTAAATGC TACATGGAGT TTTGCAGCAC AGCTTGCTCT TCCTTTCTTT
TCGGTTTATA TGGTAACAGG TTTGGAACTA TCATATACGT TTATAATGGC AGCCAACATT
TTAATGTCGG TAGTACAAGC TTCTACGGCT AAATTGTGGG GAAGGCTGGC TGACAAGTTC
AGCTGGGAGG TTACAACTAT TATTTCCATA GGAATGTTAG GTCTGTGTCA CTTGACTTGG
GCATTTGTAA CCAAGGAAGT TTGCTATTTG ATTATTCCCT TTATACAAAT TTTAGCAGGA
GCCGGATGGT CAGGTGTTAA CATGTCTCTG TTTAACATTC AGTTCAAGCA CGCACCACAG
GAAGGACGCA CAATTTTTGT TGGTTTTAAT GCTGCGATAG CAGGAGTGAC GGGATTTGCA
AGTGCATTGC TTGGAGCATT CCTTGTTGGA GTATTAAGTA ATGTAAAAAT TGATATTGGG
ATAACTGTAC TCAACAATAT GTTGATAATT TTTGGGATAT CCGGCACGTT GGTAATTTTA
TGTGCAGTAT TTTTTGCCTT AAAATTCTGG ACAAAAAATA AGAAAAGAAA AAATAAATCT
GAAAAGAATT TTGCAGGATA A
 
Protein sequence
MVFKFKRPRI YGNVSDDNYE LSRRRFILEG CLSNGVYTLT AGAFFAGYAK FLGASDQIIG 
LIVAMPLLAN ILQMFSPIFL EKLTSRKRLI VTTSLCYRSL LGLMIVIPLL TQNTSARLLL
LAGMYLTAYL IFSFSNPAGG SWIISLVPER YRGRYFGLRD TFIISSAAVL SLSMGRVLDI
LKVSGKEFLG FIIVFSLVLV LVVLDIYVLN KIREPKIVPI KQNVNVKSLF TLPLKNKQFR
PVIFLNATWS FAAQLALPFF SVYMVTGLEL SYTFIMAANI LMSVVQASTA KLWGRLADKF
SWEVTTIISI GMLGLCHLTW AFVTKEVCYL IIPFIQILAG AGWSGVNMSL FNIQFKHAPQ
EGRTIFVGFN AAIAGVTGFA SALLGAFLVG VLSNVKIDIG ITVLNNMLII FGISGTLVIL
CAVFFALKFW TKNKKRKNKS EKNFAG