Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tcr_2093 |
Symbol | |
ID | 3761274 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thiomicrospira crunogena XCL-2 |
Kingdom | Bacteria |
Replicon accession | NC_007520 |
Strand | + |
Start bp | 2312296 |
End bp | 2315271 |
Gene Length | 2976 bp |
Protein Length | 991 aa |
Translation table | 11 |
GC content | 37% |
IMG OID | 637786841 |
Product | hypothetical protein |
Protein accession | YP_392356 |
Protein GI | 78486431 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCACCTA AAACGCCTGA ACCATTTATT TATTCCTTTC TCATCTTCTT GTTTGAACCT ACACATGGCA TTGAGATTAA CTCACAGCGC TATTCCAACA TTCGCATCAA ACCCGAACAG GTTGAAGAAT GCTTTGGAAA ATTGGATGTG GGAGTACATG TCTTAGCAAC AGCGTCTCAA ATATATAATA TGCTCGTTGA GCAAGGTAAT ATAGAACAAC TGTGGCAACT GCACTATATT CATGAAATCC ATAAAGTTAA TCGCATTACA AACCCCATTT CTGAACAAAA AGAGTGGCAA CGATACCTAT TTATAGAGAC CTTACATCAA ACCTATCAAG ACGCGATTAC AGCAGTCGGA TTTCAACCTA AATTGCTGAC TAGACAAGAA CAATCCGCCT TGATAATACT CAGTCTGCTA TTTGATAGCG GTGTGCTAAA TCCCAAGTAC CTAATGTCCT TGCTGAGACA ACTTCATGCG CTCGAGAATT ACCAATATTT TAAAGGGCGA TTATTTTTTG AAATTATCAC TCATTCAACT AACCCACCAA AAAAATTCTT TATTTCCAAA CAAACTGAAG TGCTTCTCTA CCGGATAAAA CCTTGGAAAA TGCCATTAAA AAAACAAGAT AAATCTCTTC ATAAAGATCT ACTCGAATCC ATGCGAGCCA TCTTAAAAAA CTGGTCACTG AAGGTATCTA TTTTTCCAAC TAGCTTGCAG GGCTGGTGCC AATGGGTGTC GCTATATCAA TCTAAATATT GGAGTCCTAT CTTACTTGCT TGCAATAAGG GAAAGCACGT CAATCACTCC TTAAACGACT CTGCAACAAA AAGGCTGTTT CGAACTGAAT CTAACATCCA TTCAACGATA GAATTTGATG ACGAAAGTAA TCAGAAGCAA ACAACGTTAC GAAGCTATAC CTTTTCTGAA ATCCGGAAGG TTTTTGATAT CAATGTTGGC AAATCAAACC ATAAAAAACA ATTTGATGAG ATAAAGGCCA AATACGCCTT ACTAGCCCCA AAACTAAAAG ACAACTACTT GTTAATTTTA GACTGGGGAA AAAGCTATAT ACAATTAGAC CATAAGGGTC AATTTAAGAA AAACCCTAAA CAAATACTCA AGAAAATAAG CGCAATTGGC CGCCATCTTA TTGCGGTAGC AAACACACAG TTTTTACAAA AAATCAGCGC CGAAGAAAGA ACCGCTATTT TTAGAGAGGT CATTGAGCAG GCCATTTCAA TAAAAAACAA AAACGACATT CAATATCACT TACGCGACTT TAATATTTGG TTAGAAAAAT CGCATCGTTC GGAAAAAATT CAACATAAGG AAGACGTCTT CGGCACCCCT TCTATGACCG ACATGACAGT CAATGCGAAC CTAATCAGTT TTGATGAATA TGAATCCATA AAATCCTCAC TCACCGAGTT AATCACGAAG TACCCAGAAG ACGAATCTTA CAAAGTCATG TTGGCTATTC TTATACTTGG GTTTAGACTT GGACTCCGTA TAACGGAAGC CATTGAATTA AAGTTTATAG ATTATCTATT TTGTGACACA AGTCCTCAAA TTCTGATTAG AGAAAGTGAA GAAAGAAAAA CCAAGAGCCT TAATGCAAAA CGAGCTCAAA AACTGACGGA TTTTCTGACG GATGATGAAG TTAAATTATT GAATGAGCAT CACCAATTCC AACAAAAACG TTTCGGCCGC TTTACAAATG GTAAAAACCA CTATTATTTC TTTTCTACAG AAGACAGCGG CTGTAAATTA AAATCCGTAG AAGATATTAA AAAGAAGTTA ATGCACCTTA TTCGAGAAGT ATGTAAAGAT ACCAGCCTCA AATATCACCA TTTAAGGCAC AGTTTTGCCT CTTGGCACTT TTTCTCTTCC GCGATCTCTG AATTGGATTT AAATATCGGT GATTACTTTG CTCACCTGCC AAAAACAGAA GCCTGGCTAC AACAAGCTAA CACTCGTAAA CTTCAACACT TACCGACCCA GCTAAAAAGC AAAAAATACC CTTATTGGCT GGCTCAAAGA ATTGGTCATG GTTCGATTGA AACCACTCTA GAGCATTACA TTCACAGCGT AGATCTCATT AATATGCTTT ATCAGGATTC ATTGGTCTCC AATCTAACCA TTAACGACTT GCACGGATTA ACTAACATTC CAATCAGCAC GTTAAAAAAA CAAAAAAATC GTTTAGATTT TGCTTTAACC CGTCTTACAA ATTCCATTCC TAAATTAAAA GCAAAAAAAC ATCAAAGCTT ATTGAGCTTA CGGGAAGAAT GGCTTGCGCC TGAAGAGATT CAAACGTGTA TCGAACCGAT TACTGCGAAT CTGCCCTATT ATCGTTACAT GTCATTTCTT TACCACTCCA GCTCCACAGG CAATATTTCT AACTTAGGGT TTACACGAAA AGAAATTAAG CAATTGACCA GATTATTTCA GGATAACCCT GCATTTCGAA TTCGGGCTTT GAATATTGCT GAGCAATCTC TATTAGCAAG CTATTTAGAT AAAATTATGG ATGTCTATCA ATTTACACCT AATAAAGATA CTGGGTTTCC TCGTCCCCTA GAGGCTATTT TGGATACCTT TAAGAGCAGA CTTCATCCTT ATTCGTCAGA CACGAATAGC TTGCAAATTC AACGCAGCTA TCACTTGATA TTTAGAGATG AAACGAGTGG AAAAATATTG GTCGAATTTG CACAAAAAAT GAATATACCT ATAAAGCTCA TCTTAAGACA CTCTGGTCAA ATGAAACCTT TTCATATTAC AAAAGAAAAG CGTTATTGGA AAAAGCTCTT GGGGTTAAAG CACAGTATTG TATTTACTAC ACAAAAAGAC ACAAACTCAC GCCTAGGTAA TCATGGTCGC CTTGAAATGG TGTTTTTAAG CAATACAGGC AAAAAAGATC ATGCTTTGTA TTTCTTACTT GTAATGTTAA ATGTAGCTAG TCTATGGCAG TCATAA
|
Protein sequence | MPPKTPEPFI YSFLIFLFEP THGIEINSQR YSNIRIKPEQ VEECFGKLDV GVHVLATASQ IYNMLVEQGN IEQLWQLHYI HEIHKVNRIT NPISEQKEWQ RYLFIETLHQ TYQDAITAVG FQPKLLTRQE QSALIILSLL FDSGVLNPKY LMSLLRQLHA LENYQYFKGR LFFEIITHST NPPKKFFISK QTEVLLYRIK PWKMPLKKQD KSLHKDLLES MRAILKNWSL KVSIFPTSLQ GWCQWVSLYQ SKYWSPILLA CNKGKHVNHS LNDSATKRLF RTESNIHSTI EFDDESNQKQ TTLRSYTFSE IRKVFDINVG KSNHKKQFDE IKAKYALLAP KLKDNYLLIL DWGKSYIQLD HKGQFKKNPK QILKKISAIG RHLIAVANTQ FLQKISAEER TAIFREVIEQ AISIKNKNDI QYHLRDFNIW LEKSHRSEKI QHKEDVFGTP SMTDMTVNAN LISFDEYESI KSSLTELITK YPEDESYKVM LAILILGFRL GLRITEAIEL KFIDYLFCDT SPQILIRESE ERKTKSLNAK RAQKLTDFLT DDEVKLLNEH HQFQQKRFGR FTNGKNHYYF FSTEDSGCKL KSVEDIKKKL MHLIREVCKD TSLKYHHLRH SFASWHFFSS AISELDLNIG DYFAHLPKTE AWLQQANTRK LQHLPTQLKS KKYPYWLAQR IGHGSIETTL EHYIHSVDLI NMLYQDSLVS NLTINDLHGL TNIPISTLKK QKNRLDFALT RLTNSIPKLK AKKHQSLLSL REEWLAPEEI QTCIEPITAN LPYYRYMSFL YHSSSTGNIS NLGFTRKEIK QLTRLFQDNP AFRIRALNIA EQSLLASYLD KIMDVYQFTP NKDTGFPRPL EAILDTFKSR LHPYSSDTNS LQIQRSYHLI FRDETSGKIL VEFAQKMNIP IKLILRHSGQ MKPFHITKEK RYWKKLLGLK HSIVFTTQKD TNSRLGNHGR LEMVFLSNTG KKDHALYFLL VMLNVASLWQ S
|
| |