Gene Cthe_1753 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_1753 
Symbol 
ID4810183 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp2073334 
End bp2074380 
Gene Length1047 bp 
Protein Length348 aa 
Translation table11 
GC content47% 
IMG OID640107166 
Producttransport system permease protein 
Protein accessionYP_001038167 
Protein GI125974257 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0609] ABC-type Fe3+-siderophore transport system, permease component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000172591 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGTTTG GGGATAAAAA GTATAGATAC AGGTTTGCCT TGATTTCTGG AATTTTGCTT 
TTATTTTTTC TGGTGCTGTT TTCGGCGACA GTGGGTGCTG CCAATATATC AATATTTGAC
GCTCTGCGTA TTTTGGTTCG CAGGATACCT TTTGCCGGAA GATTTGTGCC GGCAGGAGAA
ATAAGCAGGA CCCATGAGCT TATAGTGCTC AACATAAGGC TTCCCAGAGT TATTGCCGCC
GCAATCATAG GGGCGGGTCT TTCAGCGGTT GGAGCAACGT ACCAGGGAAT GTTTGCAAAT
CCCATGGCCG ACCCTTATGT TTTGGGAGTA TCGGCGGGAG CGGCCCTGGG AGCTTCGATT
GCTATTGTGA TGGGAACCGA CAAGGTGGTT GGAGGGTTTG GCATTATTAC GGCAGTTGCT
TTTGTTTTTG CGCTTCTTAC GGTTTTTATC GTTTTTAACA TAGCAAAAAC CGGAGTCAAA
CTGTCCAACA CCCATCTTTT GCTTGCCGGG GTGGCGGTCA GCTTTTTTGC ATCTTCCGTT
ATGTCGGTAT TGATGGTTTT GAATCGTGAC AAAGTGTCAA ATATTACATA TTGGATGATG
GGAAGCATAG CCTTTACCTC CTGGAGGCAG GTGCTGATAC TTGCTCCCCT GGTTGTGGCA
GGCATAGTTG TTGTTTGCGT TTTTGCCAGG GAGCTTAACA TAATTGCCGT CGGAGAGGAT
GAGGCAAGAA GTCTTGGTGT TGAGGTGGAG AAGGTAAAAA AGCTGCTGCT TGTAGTTTGC
TCGGTTGTTG TTGCGGCCTG TGTGGCGGTA AGCGGCGTTA TTGGTTTTGT GGGACTGATT
GTTCCGCATA CGGTAAGACT GATATCCCGT TCGGACAACA GGGTGGTTCT TCCCTTTTCG
GCAATAGGAG GGGGAATGTT TCTGGTACTG TGTGACACAA TATCCAGGAT TCCCACGGCG
GAAATTCCAG TGGGCGTGCT GACATCAATG TTTGGTGCCC CGTATTTTAT TTCGGTTTTG
ATAAGAAACA AGAAGAAGGT GGTTTGA
 
Protein sequence
MTFGDKKYRY RFALISGILL LFFLVLFSAT VGAANISIFD ALRILVRRIP FAGRFVPAGE 
ISRTHELIVL NIRLPRVIAA AIIGAGLSAV GATYQGMFAN PMADPYVLGV SAGAALGASI
AIVMGTDKVV GGFGIITAVA FVFALLTVFI VFNIAKTGVK LSNTHLLLAG VAVSFFASSV
MSVLMVLNRD KVSNITYWMM GSIAFTSWRQ VLILAPLVVA GIVVVCVFAR ELNIIAVGED
EARSLGVEVE KVKKLLLVVC SVVVAACVAV SGVIGFVGLI VPHTVRLISR SDNRVVLPFS
AIGGGMFLVL CDTISRIPTA EIPVGVLTSM FGAPYFISVL IRNKKKVV