Gene Tcr_1654 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTcr_1654 
Symbol 
ID3760932 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThiomicrospira crunogena XCL-2 
KingdomBacteria 
Replicon accessionNC_007520 
Strand
Start bp1806652 
End bp1809027 
Gene Length2376 bp 
Protein Length791 aa 
Translation table11 
GC content44% 
IMG OID637786391 
Productorganic solvent tolerance protein 
Protein accessionYP_391920 
Protein GI78485995 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1452] Organic solvent tolerance protein OstA 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCAATTC GCACTCATTC ATTTTTGCCA CTTTTTTCAT GTCTCATCTT ATGGGGACCG 
CTGTTTGCGA AGGCGGCCAC CAATGAAACA CCCCTCGCAC CGCTTGCGAC AAGCAGTCAA
ACCAAGCCTG AGTGTTTACC CAATTGGATT GCGCCACCGC AATACATTCC AGACGACTCT
GCGTTGAATT CCGCGCAATC CGATACGCTG CAACAACCCA ATTCTCTTAC CTACAAGTTA
ACCGGAAATG TGGTTTTAAA ACAGCCAGGC CTGGTGGTTT TATCCAATCA TGTCCGTTTA
AACCGCCAAA CACAAGAAGC GAATATATTC GGTCAAGTGC AACTGCATCG TAAAGATTTA
ATTGTCACAG GTGATTCCGC TCGCATCGAC GAGCAGGCCA AAACGGCACA AATCAAACAC
ACCAAGTTTC AGTTTACCGC CAACCGATCG CACGGTACGG CGAAACAAGT CGACATCAAC
CAAACAACGC AACTGGCTCA GCTGGACGAT GCAACGTACA CCACCTGCCC GATTGTCGAG
TACAGTTGGC AAGCGCGTAA TGGTAATGTC GTCACCGACA GCAAGTACGA CTGGGAACTC
GATTTTGATC GATTGGACAT TGATAACAAT CGCCGCCGAA TTTACGGTTA CAACACCGTA
TTGTATTTCC AAACCGTCCC GGTTTTTTAT ACACCATATA TCGACTTTCC CATGGACGAT
CGGGCCAGCG GTTTTTTATT CCCGACCATT GGAAGTTATC GCTCACTCAC GCGTGAAACG
GCTGAAAATT ATGTCGCGAT CCCATATTAT TTTAACTTAG CGCCTAACTA TGATGACACC
TTAACCGTTC TTAAAATGCA AGACCGCGGC TGGGTTGTCG AAAACGAATT TCGCTACTTA
CAACCTCATC ACAACGCCGA ACTAACCTTA ACGGGACTGA ATGACCAAGT CACCCAAAAA
GAAGGGTTGA GTTATATTGA TGCCAGTGGG CAACCGGCTT ATGGCAAAAA AATCGATCAG
CGCTGGCGCG GAAAATTAAT CGCCAATCAA CAATGGGGCT CTGGATTCTC AAGCGACCTT
TTATGGCATG AAGTGTCGGA TAAATATTTT TACACCGACA TTCCGGTGGA ATCGGCTTTA
GATACGGTAT CTTACACGCC GCGTTATGCC AGTGTTAATT ATGCCAAAGG CAATTTACAA
GCGGGTGTCC AATTGCTGGA TTACTTACGT TTGAGAGAAA CAGCGCCATA CAATTACGAA
AAACGCCCAG AAGTGACATT AAACTATTAT CGCCCGTTTG AATCAGGTTT TTTTGAAAAC
ACCAGCGTTA ATTTAGCAGC GGAGTCCACC GAGTTTCAAA TTTCAACCAC TGGACACACC
AAGCCAGAAG CGCTCAGAAC GGTACTGTCT CCTTCAGCTC AATACAACCT TCTCAAACCC
TATGGCAGTT TGAAAGCTGA AATTGTCGCC AATAAAGTCA ACTATTTCAT GGAAGACAAT
GGGTACAACA ACACTGGAAG TTCAGAACAT AATATCAACG TACCTCAATA CGCCTTAAAA
GGCGGGCTTA TCTTTGAAAG GGACTTTACG CTGGGTGACA CAGCCATGGT TCAAACACTT
GAACCAGAGC TACAATACTT GTATGTACCG TATCAAAAGC AATCTCAAAT TCCTTTATTC
GACACCGTCT ATAAGAGTTT GGATTTCAGT AATCTGTTCA CTTACAATCG CTTTTCCGGG
ATGGATCGTA TCGGCGATAC CAACCAGGTT TCAGCGGCGT TATCCACACG CTTCTTAAAA
CAGGATGGTC GGCCGATAGC GGAAGCGGGC ATTGGGCAAA TCTTTTACTT AGAAGACCGC
AAACTGACCC TAAATGACAC GCCAACCCAA ACGGAACTGA CACAAAACAC AGCCCATGTT
TCAGATTACT TTGTCAAACT TGGAATGACG GTTGGCCCTT TTCAATTTGC CTCCACCAGC
CAGTATTCTT ATCACAATTA TGAACTCACC AACGCCAACA ATCGCTTAAA ATACGATGTG
TCTCCTCGCT TTAAATTTTT AATGACGAAC ACAATCACCA ATAACAACTT ACCTGGTGAA
CAAGAAGACT TAGCGGCGGG ATTAAACTGG CAAATCAATG ATAAATGGGC GCTTGGCAGT
TACATCAATT ATAACTTTAC GCAAGAGCGT AAAACAGAAG TTCAAAATGC GTTACGCTAT
GACAGCTGTT GCTGGGCTTC AGAGCTATCT GTTAAAGAAA CACAACTCGA TAATGGCCTG
TATAATTACA GCATCCAATA TTTAATCGAA TTCAAAGGAC TCAGTTCGGT CGGAACACCG
TTTAAAAAAT ATTTAAATAA TAAGCTGAAT TTTTAA
 
Protein sequence
MSIRTHSFLP LFSCLILWGP LFAKAATNET PLAPLATSSQ TKPECLPNWI APPQYIPDDS 
ALNSAQSDTL QQPNSLTYKL TGNVVLKQPG LVVLSNHVRL NRQTQEANIF GQVQLHRKDL
IVTGDSARID EQAKTAQIKH TKFQFTANRS HGTAKQVDIN QTTQLAQLDD ATYTTCPIVE
YSWQARNGNV VTDSKYDWEL DFDRLDIDNN RRRIYGYNTV LYFQTVPVFY TPYIDFPMDD
RASGFLFPTI GSYRSLTRET AENYVAIPYY FNLAPNYDDT LTVLKMQDRG WVVENEFRYL
QPHHNAELTL TGLNDQVTQK EGLSYIDASG QPAYGKKIDQ RWRGKLIANQ QWGSGFSSDL
LWHEVSDKYF YTDIPVESAL DTVSYTPRYA SVNYAKGNLQ AGVQLLDYLR LRETAPYNYE
KRPEVTLNYY RPFESGFFEN TSVNLAAEST EFQISTTGHT KPEALRTVLS PSAQYNLLKP
YGSLKAEIVA NKVNYFMEDN GYNNTGSSEH NINVPQYALK GGLIFERDFT LGDTAMVQTL
EPELQYLYVP YQKQSQIPLF DTVYKSLDFS NLFTYNRFSG MDRIGDTNQV SAALSTRFLK
QDGRPIAEAG IGQIFYLEDR KLTLNDTPTQ TELTQNTAHV SDYFVKLGMT VGPFQFASTS
QYSYHNYELT NANNRLKYDV SPRFKFLMTN TITNNNLPGE QEDLAAGLNW QINDKWALGS
YINYNFTQER KTEVQNALRY DSCCWASELS VKETQLDNGL YNYSIQYLIE FKGLSSVGTP
FKKYLNNKLN F