Gene Acel_2056 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcel_2056 
Symbol 
ID4484735 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAcidothermus cellulolyticus 11B 
KingdomBacteria 
Replicon accessionNC_008578 
Strand
Start bp2331234 
End bp2332817 
Gene Length1584 bp 
Protein Length527 aa 
Translation table11 
GC content63% 
IMG OID639730852 
ProductNa+/solute symporter 
Protein accessionYP_873814 
Protein GI117929263 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG0591] Na+/proline symporter 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones41 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGGTTC CGTTCAACGG CACCGAGTTT GCCCTCGTCG TTGTTCTCTT TGTGTTCGTC 
GCGGTACTCG GTTTTCTCGC GGCGCGTTGG CAGCGTGGCG AAACGCTTGC GCATATCGAT
GAATGGGGCT TGGGCGGCCG CCGTTTCGGC ACGTGGGTCA CGTGGTTCCT CGTCGGCGGT
GACCTCTACA CCGCGTACAC GTTCGTGGCC GTTCCCGCCC TGGTCTTCGG ATCTGGCGCG
CTTGGATTTT TTGCCGTGCC GTATACCATT GTGGTCTATC CGATCGTCAT GTTCGCAGCC
GCCCGCATGT GGTCGGTGGC CCGGCGGCAT GGTTTTGTTA CCGCTGCAGA TTTCGTCCGG
GGCCGGCACG GTTCCCACGC GCTCGCCGTC GCGACCGCGC TCACCGGCAT CGTGGCAACC
ATGCCATACA TCGCCTTGCA GCTGGCCGGC ATCAGAGCGG TGATCGAGGC GGCAGGCATC
AACGGCGATT GGCCGTTATG GATTGCGTTC GCCGTGCTCG CTGCGTACAC GTATTCGTCA
GGATTGCGTG CACCGGCGCT GATTGCATTC GTGAAAGACA TCCTGATTTA CGCCGCGGTG
CTCGCGGCCG TCATCTGGAT TCCGTACAAA CTCCACGGGT ACGACCACAT CTTCTCGGCG
GCGAATGCGC ATTTCGCGGC GGCCAAATCC GGCGGCATTA CGCTCAAGGA CACGCAGTAT
CTCGCATACT CCACGCTGGC CTTCGGTTCG GCGCTGGCCC TGTTCCTCTA TCCGCACTCG
GTCACCGGCG TGCTCGCTGC GAAGAGCCGA GACACGGTCA AGAAGAACAT GTCGCTGCTT
CCGGCGTACA GTCTGATGCT CGGCTTCCTT GCGCTGCTCG GCTACATGGC GATCGCGGCT
GGTGTCAAAC CGGCGCTTGG CGCCGGTGGC AAGGCGGATA CGAACACGAT CGTGCCGATG
CTGTTCCACG CGGAATTCCC GGCCTGGATT GCCGGTCTGG CCTTCGGGGC GATCGCCATT
GGAGCGCTGG TGCCTGCGGC GATCATGTCG ATTGCCGCGG CCAACACCTT CACCCGGGAC
ATCTATCGGC CGTACTTCCG GCCGACCGCC TCCCCGGCTG AAGAGGCGTT GGTCAGCAAG
ATTGTGTCCT TGGTCGTCAA ACTCGGCGCC TTGCTCGTCA TCCTCTTCCT GAACGTCAGC
TTCGCGCTCG ACTTCCAGCT GATCGGCGGG ATCATCATTA TCCAGATCCT GCCGGCCGTC
GTCTTCGGGC TCTACACGCG GTGGTTCCAC CGCTGGGCGT TGATGGCCGG CTGGGTCGTC
GGCATGGTCT CGTCGCTCTG GATGCTCTGG CTGACACCGA AGGCGGGCGG ACACGGGCAC
TTCGGCGGGT CCCAGTGGGC GTTTACGCAT TGGGGCATCC ACACGAAGGT GACCATGTGG
ATCGGGTTGA TCACCTTAGT GTTCAACATC GTCGTCGCTG TCGTTCTCAC CCCGGTGTTG
TCGAAAGCCC CGCGCGGCCT CGACGCCACC CGCGACGAGG ACTACCTCGT GGCGGACGCC
TCCGAAGTGC GGGTGCCTGC CTGA
 
Protein sequence
MKVPFNGTEF ALVVVLFVFV AVLGFLAARW QRGETLAHID EWGLGGRRFG TWVTWFLVGG 
DLYTAYTFVA VPALVFGSGA LGFFAVPYTI VVYPIVMFAA ARMWSVARRH GFVTAADFVR
GRHGSHALAV ATALTGIVAT MPYIALQLAG IRAVIEAAGI NGDWPLWIAF AVLAAYTYSS
GLRAPALIAF VKDILIYAAV LAAVIWIPYK LHGYDHIFSA ANAHFAAAKS GGITLKDTQY
LAYSTLAFGS ALALFLYPHS VTGVLAAKSR DTVKKNMSLL PAYSLMLGFL ALLGYMAIAA
GVKPALGAGG KADTNTIVPM LFHAEFPAWI AGLAFGAIAI GALVPAAIMS IAAANTFTRD
IYRPYFRPTA SPAEEALVSK IVSLVVKLGA LLVILFLNVS FALDFQLIGG IIIIQILPAV
VFGLYTRWFH RWALMAGWVV GMVSSLWMLW LTPKAGGHGH FGGSQWAFTH WGIHTKVTMW
IGLITLVFNI VVAVVLTPVL SKAPRGLDAT RDEDYLVADA SEVRVPA