Gene Tcr_1303 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTcr_1303 
Symbol 
ID3760685 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThiomicrospira crunogena XCL-2 
KingdomBacteria 
Replicon accessionNC_007520 
Strand
Start bp1420142 
End bp1421656 
Gene Length1515 bp 
Protein Length504 aa 
Translation table11 
GC content47% 
IMG OID637786035 
Productextracellular solute-binding protein 
Protein accessionYP_391572 
Protein GI78485647 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.173146 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTGAACC GCAGAACGCT TTTTAAAGCC AGCCTCGTTT CGGCAACGGG GCTGTTGTTA 
CAAAGTTGTG CTTCTTCTCA AGGGCAAGAG CGACAAATCA AAGTGGGGGT GACTTCTCGC
CCCCGAATGC TGGACCCGCG TCAGGCAACG GATGCGTTAT CCAGCCGAGT GAATCGATTG
ATTTACCGTC AGTTAATCGA CTTTAATGAA TCGTTCGAGC CGATTCCCGA CTTGGCGACC
TGGCAACAGA TTTCGCCAAC ACACTATCGT TTTACGCTAA CCGAATTTCC ACGGTTTCAT
CATGGCATGC CTTTAACCGC AGAAGATGTG GCGGCGACTT ATCGAAGCAT TCTGGATAAA
ACCTTAGGTT CCCCTCATCG CGGTTCTTTA AAGAAAATTA CTCAAATTGA CGTGTTGAAT
AATGCCGAAT TGGACTTTCA CTTGGAAGCG CCGGATGCGT TGTTTGTTGG GCGCTTGGTG
ATCGGAATTT TGCCGAAAGA TTTGATTGAA AGTCAGCATG CCTTTCAAAA AACACCGATT
GGATCGGGGC CGTGTTTATT TAAGTCAATG ACCGAGCAAA AGCTGGTTCT GGAACGACCG
GATGCTGTGC AACTGGTTTT TATTCCAGTT AAAGACGCGA CCGTGCGCGT GCTGAAATTG
CGAAAAGGCG AATTGGATAT CATTCAAAAT GATTTATCTC CTGAGCTGGT TTCGTATTGT
GACAAACTGG ATGAATTAAA CGTGCAATGG CATGATGGCA CTAATTTTGG GTATGTCGGA
TTTAACTTTG ACGATTCGTT ATTGTCACAG TTAGAAATGC GGCAGGCATT GGCTTACGGT
ATTAACCGTC AGGCGATTGT TGATGCGGTA TTCGATGGTC ATGCGCGTTT GGCAGGCGGC
TTATTGGTGC CGGAACACTG GAGTGGGGTG GCGGACATTC ATGGGTTTGA TTATCGACCT
AATAAAGCCA AACAGCTGGT GGATTCCCTT AAACAAAAAC AACCAGGCCT GGTCAATGAC
GATGGAATGA TAGAGTTGAG TTATAAGACC TCTTCGGACC CGACGCGTAT TCGATTGGCA
ACCATTTACC AATCTCAACT GAGAAAAATC GGGGTGGCCT TAAAAGTACA GAGTTATGAT
TGGGGAACGT TTTACAACGA CATCAAACAA GGCCGTTTTC AACTCTATAG CCTGGCTTGG
GTTGGGGTGA AAAGCCCGGA TATTTTTCAG TATGTGTTTG ACAGTGATGC GATTCCGCCC
AAAGGAGCGA ATCGAGGGCG ATATCGAGAC CCGCAAGCCG ATGCATTGAT TCGTGAGGCG
GGTCATACTC AGTCATTAGC CAAACAGGCC GAGTTGTATC AAGATTTACA GAGACGGTTA
CAAGAAACTT TAGCCGTTAT TCCGTTGTGG TATGAAGATC AGTATGCCGT GACACGCCCA
CAGGTTAAAG GATACCAACT CTATTCAGAT GGACGGTTCG ATGGGTTGTT GTCTGTTGAG
TTGGGCGAGA CATAA
 
Protein sequence
MLNRRTLFKA SLVSATGLLL QSCASSQGQE RQIKVGVTSR PRMLDPRQAT DALSSRVNRL 
IYRQLIDFNE SFEPIPDLAT WQQISPTHYR FTLTEFPRFH HGMPLTAEDV AATYRSILDK
TLGSPHRGSL KKITQIDVLN NAELDFHLEA PDALFVGRLV IGILPKDLIE SQHAFQKTPI
GSGPCLFKSM TEQKLVLERP DAVQLVFIPV KDATVRVLKL RKGELDIIQN DLSPELVSYC
DKLDELNVQW HDGTNFGYVG FNFDDSLLSQ LEMRQALAYG INRQAIVDAV FDGHARLAGG
LLVPEHWSGV ADIHGFDYRP NKAKQLVDSL KQKQPGLVND DGMIELSYKT SSDPTRIRLA
TIYQSQLRKI GVALKVQSYD WGTFYNDIKQ GRFQLYSLAW VGVKSPDIFQ YVFDSDAIPP
KGANRGRYRD PQADALIREA GHTQSLAKQA ELYQDLQRRL QETLAVIPLW YEDQYAVTRP
QVKGYQLYSD GRFDGLLSVE LGET