Gene Tcr_0731 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTcr_0731 
Symbol 
ID3762106 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThiomicrospira crunogena XCL-2 
KingdomBacteria 
Replicon accessionNC_007520 
Strand
Start bp803192 
End bp804595 
Gene Length1404 bp 
Protein Length467 aa 
Translation table11 
GC content45% 
IMG OID637785447 
Productpeptidase S1C, Do 
Protein accessionYP_391001 
Protein GI78485076 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain 
TIGRFAM ID[TIGR02037] periplasmic serine protease, Do/DeqQ family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000000553018 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAATAC AACTAAAATC TTTGTATGGT GCTTGGATGG CTCTGATGCT GGCCTTGAGT 
TTTTCCTCAG TACAAGCCTC TGGACCGACG GTTACATTGC CCGACTTTTC GCAGTTGGCC
TCTGAAAACA GCCCGGTAGT GGTGAATATC AGTACGCTGA AAAAAATTGA AAGACCGGAT
CATCCTCAGT TAAGAGGAAT GCCTGATGAG ATGCTACGCT ATTTTTTTGG AATTCCTGAA
GGACAGGATC CAAGAGGCGA GCGTCAAGAA CAGGTGAGCT CACTGGGGTC AGGGTTTATT
ATTTCATCGG ATGGTTACAT TATCACCAAT CATCATGTGG TTGCGGATGC GGATGATATT
GTCGTGAAAT TGAGTAACCG ACAAGAATTA AAAGCAAAAG TTATTGGCAG CGATGAACGT
TCGGATATAG CGGTGATAAA AGTGGATGCT AAAAATCTGC CTGTGGCTAA AATTGGAACG
TCGAAAAATC TAAAAGTGGG GCAATGGGTG ATGGCGATTG GTGAGCCATT TGGCTTGGAT
TACACCGTCA CGCATGGCAT TATCAGTGCA TTAGGGCGTT CGCTTCCAGA CGATACTTAT
GTACCGTTTA TCCAAACAGA TGTTGCGATT AACCCTGGTA ACTCAGGTGG ACCATTGTTA
AACACCAATG GAGAAGTCAT CGGGGTTAAT GCCCAGATTT ACAGTAATAG CGGCGGTTCA
ATGGGGCTTT CATTTTCGAT TCCGATTGAT ATTGCGATGG ATGTTGCGCA ACAACTTAAA
ACCAAAGGCC GTGTTGAGCG CGGGTATCTT GGCGTCGGCG TTCAAGAAGT TTCGGGCGAC
TTAGCCAAAT CGTTTGATAT GAAAAGACCG ATGGGCGCGC TGGTCACGTC AACAGAAAAG
GATTCGGCCG CCAGTGAAGC TGGGATTCAG CCGGGTGATA TTATTATCGA ATTTGCCGGT
CGAACAATTC AAAAGTCATC CGATTTACCA CCAATTGTGG GGAACTCTGC CGTTGGAGAA
TCGATCAAGG TTAAAATCTT AAGAAATGGA GATTATAAAA CGTTGACGGT TCGTTTGAAG
TCGTTAGATG ATATGAAGTT AGCGGCAGCA GGCGCCGAAG CTGAAAATAC GACTTTGGGT
GTGATGATGA AAGAAGTCAG CCCCAAAGTG CTTGACAAGT TGAATCTACC ATTTGGAATT
GGCGTTTCTA AAGTCAAGCG AGGCAGTGCG GCAGACCGGG CGGGCATTAT CCCTGGGGAT
ATTTTGGTGA CGATTAATTT CAAACCAATT AAGTCCATTA AGGCTTTGAA TGAAATTGTT
GCCGCTGCGC CAAAAGGTCG TTCTCTTCCT GTGAGAGTGG TTAGAGGGAA GCGTTCTGTA
TTTCTTCCTC TGGTATTAAA TTAA
 
Protein sequence
MKIQLKSLYG AWMALMLALS FSSVQASGPT VTLPDFSQLA SENSPVVVNI STLKKIERPD 
HPQLRGMPDE MLRYFFGIPE GQDPRGERQE QVSSLGSGFI ISSDGYIITN HHVVADADDI
VVKLSNRQEL KAKVIGSDER SDIAVIKVDA KNLPVAKIGT SKNLKVGQWV MAIGEPFGLD
YTVTHGIISA LGRSLPDDTY VPFIQTDVAI NPGNSGGPLL NTNGEVIGVN AQIYSNSGGS
MGLSFSIPID IAMDVAQQLK TKGRVERGYL GVGVQEVSGD LAKSFDMKRP MGALVTSTEK
DSAASEAGIQ PGDIIIEFAG RTIQKSSDLP PIVGNSAVGE SIKVKILRNG DYKTLTVRLK
SLDDMKLAAA GAEAENTTLG VMMKEVSPKV LDKLNLPFGI GVSKVKRGSA ADRAGIIPGD
ILVTINFKPI KSIKALNEIV AAAPKGRSLP VRVVRGKRSV FLPLVLN