Gene Tcr_1122 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTcr_1122 
Symbol 
ID3762063 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThiomicrospira crunogena XCL-2 
KingdomBacteria 
Replicon accessionNC_007520 
Strand
Start bp1210101 
End bp1211588 
Gene Length1488 bp 
Protein Length495 aa 
Translation table11 
GC content44% 
IMG OID637785843 
ProductNusA antitermination factor 
Protein accessionYP_391391 
Protein GI78485466 
COG category[K] Transcription 
COG ID[COG0195] Transcription elongation factor 
TIGRFAM ID[TIGR01953] transcription termination factor NusA
[TIGR01954] transcription termination factor NusA, C-terminal duplication 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTAAGG AAGTTTTAGC AGTTGTTGAA ATCATGTCTA ACGAAAAAGG CGTGGAAAAA 
GAAATTATCT TTGAAGCCAT TGAAGCGGCT CTAGCGACAG CAACTAGAAA AAGTCATAAT
GATGAAATTG ACGCCCGTGT TTCTATTGAT CGACATACGG GTGATTACGA AACTTTCCGT
CGCTGGGAAG TGATTGAAGA CGATGTCGAG ATTGAAGACC ACGTTGGTTG GTATATTCGC
CATATGGATG CGGTTGACAT TGAACCGCAT ATTGAGCCAG GTGAATTTAT TGAAGAACCA
ATGGAGTCGA TCGAATTTGG CCGTATTGGT GCGCAAACAG CGAAGCAAGT GATTATTCAA
AAAGTCCGTG AAGCTGAGCG TAAAAAAGTG GTCGAAGAAT ATTCGAAACG CATCGGAGAA
ATTTTAACTG GTCAGGTTAA GCGTATTGAT CGCGGTGATG TCATTCTGGA TTTAGGGGAT
AACGTGGATG CGGTCATTCC TCGTTCAGAA TTGATTAACC GCGAAAACTT TAAAATGGGC
GATCGTGTCC GTGCTTATGT TCAAGATGTT TCTTTCCGTC CTCGCGGCCC ACAGATTTTC
ATGTCTCGTG CGTGTAAAGA AATGTTGATG GAACTGTTTA AAATCGAAGT GCCTGAAATT
GGTGACGACT TAATCGACAT TATGAGTGCG GCTCGTGATG TTGGTTTAAG AGCCAAGGTG
GCCGTTCGTG CTAACGACCC ACGCTTAGAT CCTATCGGAG CTTGTGTCGG GATGCGTGGT
GGACGTGTTC AAGCGGTGAC AAATGAATTA AATGGAGAAC GTATCGACAT TATCCTTTGG
GATTCAAACG ATGCGCAGTT TGTTATTAAT GCGATGGCCC CAGCAGAAGT CACGTCCATT
ATGGTGGATG AAGACAAGCA TACAATGGAC TTGGCGGTTG ATGATGAGCA GTTGTCTCAG
GCCATTGGTA AAAACGGTCA AAACATCCGC TTGGCAACCG AACTAACAGG TTGGGAGCTG
AATGTCATGT CTGAAACGGA CATGGCTGCG AAGCATGAAA CAGAATCGAA AGGTCAGATG
GATTTATTCG TCAACGGCTT GGAAGTCGAT GAAGAACTTG CAGAAGTTCT AGTCGCAGAA
GGTTTTACAA CACTTGAAGA AGTGGCGTAT GTTCCGGCTG CGGAAATGTT AGAGATTGAA
GGCTTTGATG AAGAAATTGT TGCAGCTTTA AAAGAAAGAG CTAAAGACGC ACTGTTGACT
CAAGCAATTG CAAACGAAGA AAAGACAGCC ATGGCTGAAC CAGCGCAAGA TTTATTGGAC
TTAGAAGGCA TGACTGAAGA AATGGCAAAA ACGCTCGCTT CTAAAGGAAT CATCACTCAG
GAAGATTTAG CTGAATTAGG CACGGATGAA TTATTAGAAA TAGTAGAGAT GGATGCGGAC
GCAGCTAGCG AATTGATTTT GAAAGCACGC GCACCATGGT TTGAATAA
 
Protein sequence
MSKEVLAVVE IMSNEKGVEK EIIFEAIEAA LATATRKSHN DEIDARVSID RHTGDYETFR 
RWEVIEDDVE IEDHVGWYIR HMDAVDIEPH IEPGEFIEEP MESIEFGRIG AQTAKQVIIQ
KVREAERKKV VEEYSKRIGE ILTGQVKRID RGDVILDLGD NVDAVIPRSE LINRENFKMG
DRVRAYVQDV SFRPRGPQIF MSRACKEMLM ELFKIEVPEI GDDLIDIMSA ARDVGLRAKV
AVRANDPRLD PIGACVGMRG GRVQAVTNEL NGERIDIILW DSNDAQFVIN AMAPAEVTSI
MVDEDKHTMD LAVDDEQLSQ AIGKNGQNIR LATELTGWEL NVMSETDMAA KHETESKGQM
DLFVNGLEVD EELAEVLVAE GFTTLEEVAY VPAAEMLEIE GFDEEIVAAL KERAKDALLT
QAIANEEKTA MAEPAQDLLD LEGMTEEMAK TLASKGIITQ EDLAELGTDE LLEIVEMDAD
AASELILKAR APWFE