Gene Tcr_2046 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTcr_2046 
Symbol 
ID3761963 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThiomicrospira crunogena XCL-2 
KingdomBacteria 
Replicon accessionNC_007520 
Strand
Start bp2257144 
End bp2258457 
Gene Length1314 bp 
Protein Length437 aa 
Translation table11 
GC content47% 
IMG OID637786795 
ProductThiol-disulfide isomerase and thioredoxins-like 
Protein accessionYP_392310 
Protein GI78486385 
COG category[C] Energy production and conversion
[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0526] Thiol-disulfide isomerase and thioredoxins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.0189188 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTGAATA ATACACTGCG TATTGGACTG ATACTCGGAT GGTTTTGCAT CAATGCATTT 
CATGTTGCTG TTGCCGAGGT TCCGCATGAG CATACGCTGG AAGGTGCGTT GGAATCGGAA
GTGTATACGG CCACCACACC GATTGCGAAT GTGCTGTGGG TGCCTTCCGA GCACGGGGTG
CTTAAACAAG AACAGGCCTT GGCGGAACAA TTGGCTGAAT CGGGATTTAC TGTGACCATG
CCAAATCTGT TCGAAAGTTA TTTTCTGCCG GTGGCGTCTA GCAGTCTAAG AAAAATCCCT
TCCAATATTA TTGAACGTGA AATCGCTCGG CTGCATGCCA GCGACCTTCC GTTGTTTGTG
ATCAGTTCGA ATGAAGGTGC GGCGCTGGTC ATTAAGGCGC TTGCCTCCTT TCAACAAACG
TCAACGTCGA TGGTGGGCGT GGTCTTGTTG AACCCCAATC TCTATATTGA AACACCACAA
GCAGGACAAA AAGCAGAGTA TTGGCCAACG GTATCACAGG TCAATGCGCC GGTGTACATC
ATTCAGTCTG AGCTATCGCC TTGGCGTTGG CACTTACCTC AGTTACAGCA GCAGCTGAGT
TTGTCCGGTT CGGATGTTTT TATTCGTTTG ATGCCAAAGG TACGGGATCG TTATTATTTT
CGTCCGGATG CGCTTCCAGT CGAGCAAAAG CAAGCACAGA CGTTAGCGTC CGATTTGATG
CAGGCGATGA AAACATTGGC TCCATATTTA CCGGTGTTTC GCGAATCTGC GTTGGCCAAA
AACGCCTCAC CTGAGGAGAG AAGGAATGGT GTTGCCGTAA CCCAATCCCG CTCTCAGTCA
ACGGATAAAA CCGGATTGCA ACCTTACTCT GGTACGCAGC AACGCTATCT AAAACTCAAT
GATATAAACA ATCAATCACA CTCGTTGGAC GCTTATCAAG GCAAGGTGGT TCTGTTGAAT
TTTTGGGCAA GTTGGTGTCC ACCCTGTGTG CATGAAATCC CTTCGATGAC ACGATTGAAA
ACGGTGTTGA AAGACCAACC GTTTGAAATT CTGGCGGCGA ATTTGGCAGA AGAAAAATCC
GATATTCAAG CTTTTTTAAA GCAACACCCG GTCAATTTTC CGATCTTACT TGATCCGAAA
GGATCCGCCG TGCAGGCTTG GCAGGTTTTT GCTTATCCAA GTTCTTACCT GATCGATGGC
AACGGCAAAA TTCGTTATGC CTTATTTGGT GGGCATGAAT GGGATGATCC GCTGACGGTA
CAGAAAATCC AATCACTTAT TCACAAAACG ACGACTTCCA CGAAGACACC ATGA
 
Protein sequence
MLNNTLRIGL ILGWFCINAF HVAVAEVPHE HTLEGALESE VYTATTPIAN VLWVPSEHGV 
LKQEQALAEQ LAESGFTVTM PNLFESYFLP VASSSLRKIP SNIIEREIAR LHASDLPLFV
ISSNEGAALV IKALASFQQT STSMVGVVLL NPNLYIETPQ AGQKAEYWPT VSQVNAPVYI
IQSELSPWRW HLPQLQQQLS LSGSDVFIRL MPKVRDRYYF RPDALPVEQK QAQTLASDLM
QAMKTLAPYL PVFRESALAK NASPEERRNG VAVTQSRSQS TDKTGLQPYS GTQQRYLKLN
DINNQSHSLD AYQGKVVLLN FWASWCPPCV HEIPSMTRLK TVLKDQPFEI LAANLAEEKS
DIQAFLKQHP VNFPILLDPK GSAVQAWQVF AYPSSYLIDG NGKIRYALFG GHEWDDPLTV
QKIQSLIHKT TTSTKTP