Gene Tcr_1354 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTcr_1354 
Symbol 
ID3760514 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThiomicrospira crunogena XCL-2 
KingdomBacteria 
Replicon accessionNC_007520 
Strand
Start bp1470231 
End bp1471379 
Gene Length1149 bp 
Protein Length382 aa 
Translation table11 
GC content42% 
IMG OID637786088 
Producthypothetical protein 
Protein accessionYP_391623 
Protein GI78485698 
COG category[S] Function unknown 
COG ID[COG3266] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.0537927 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCCCAAA CCATCTCAGA ATTAGAAAAA GAACGCGCTG AACTGCTCAA AGCAATTGAG 
AGTCAAGCAC AACAAATGTC CTCATCCCGG CCTTTAGGTG AGAAGGATCA AAAAGAACAT
ACTTTAAACG ACTGGTTACA TGCCGCTGAG GAAGTTATGC CAAGCACCCC CAAGAGACCC
ACACAACAAA GTTCCGCACC ACAACCGACT AAAAAAACCA AGGGGAATAA AGCCTCTTTT
TTTGGTGTCG TCATTATGCT GTCATTACTC CTAACCATTT TGGGAGTCGT CTACATTGCG
TATACCAGTA TCCATAATGA ACTTCAAAAA GTGTTAGCGG TAAAAGAAGA CTCAATGAAA
GAAGTTAAAA TGCTGAAGGA AACGGTATCA GAGCTTCAGA AATCAGTTGC TTCAGGAGGT
CAGGGCCAAT TATTCACTCA GCTACAAAAA CGCGTTGAAG CACTGGAAGC TGAAATTACC
ACACTTAAAG CACAGCAATT GACGTTAGAT AATAAAGTGG TGCAACAAGC AACGTCTAAA
AATACAGCGA CGGCTTTACC CGAAGAACTT CCATCCAATG TCGTGACTAC AGAAGTCTTA
GAGTCCAAGT TAAAAGAATA CACGCAAGGC ATTGATCATA AATTAGAAAC GATTTTGAAA
TATTTAAACT TATCGGAAGA AGATAAGGCA GAAACCGCTG CTAAAGTATC GGTGTTCACC
AAGCCAGAAG ATGACGTGAC AGAACCCACC ATAAAAGAAC CAACACAGCC TAAGGTTAAG
CCGCTGGATC AACCTGTCGT CCGTTTGGTG CAAAAAGTTG AAAAGCCTAC CACGCCTGAA
CCGAAGGCGC CGCTTGAAAA CTACACGTCT GATGTTAAAT GGCTAATGGA AGAACCGGCC
TTCAATTACA CACTTCAGCT GGCCAGTATG CCTGAGCGTG ATTCAATTGA AAAAATGATC
GAACAAAAAG GGCTGCAAGG TGCCAAGATT ATTCCATTAG AACGGAAAGG TGAGCCTTAT
TATGTTCTAT TAACGGGCAG TTATGCGTCG CGTTCAGAGG CCGACAAAGC CGCAAGAACC
TACAAAACGA ATTTTGGCAT TTCACCTTGG GTTCGTAAAA TCAAAGACTT AAGTCGTAAA
TTAAAATAG
 
Protein sequence
MAQTISELEK ERAELLKAIE SQAQQMSSSR PLGEKDQKEH TLNDWLHAAE EVMPSTPKRP 
TQQSSAPQPT KKTKGNKASF FGVVIMLSLL LTILGVVYIA YTSIHNELQK VLAVKEDSMK
EVKMLKETVS ELQKSVASGG QGQLFTQLQK RVEALEAEIT TLKAQQLTLD NKVVQQATSK
NTATALPEEL PSNVVTTEVL ESKLKEYTQG IDHKLETILK YLNLSEEDKA ETAAKVSVFT
KPEDDVTEPT IKEPTQPKVK PLDQPVVRLV QKVEKPTTPE PKAPLENYTS DVKWLMEEPA
FNYTLQLASM PERDSIEKMI EQKGLQGAKI IPLERKGEPY YVLLTGSYAS RSEADKAART
YKTNFGISPW VRKIKDLSRK LK