Gene Tcr_2037 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTcr_2037 
Symbol 
ID3761766 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThiomicrospira crunogena XCL-2 
KingdomBacteria 
Replicon accessionNC_007520 
Strand
Start bp2248181 
End bp2249887 
Gene Length1707 bp 
Protein Length568 aa 
Translation table11 
GC content49% 
IMG OID637786786 
Productnickel-dependent hydrogenase, large subunit 
Protein accessionYP_392301 
Protein GI78486376 
COG category[C] Energy production and conversion 
COG ID[COG0374] Ni,Fe-hydrogenase I large subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones34 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACAGAAT CCACTAAAAA ACAGAAAATC GTAATCGATC CAGTCACCCG AATTGAAGGC 
CATTTAAGGG TTGAAATTGA AGTGGATGAA CACAATACCG TGACCGAAGC CTGGGCGTCT
GGTCAATTGT TTCGTGGGAT TGAAACCATT CTGAAAGGTC GGGACCCACG GGATACCGGT
TTGATTGCTC AACGTATTTG CGGCGTTTGT ACCAATTCGC ATTATCGTGC GTCAATCAGT
GCTGTGGAAA ATGCTTATGA TATCGTTCCC CCGCGCAATG CTGAAATCGT CCGAAATCTG
GTTTCCTTGG CGTTGTTTGT GCAGGATCAT CTGGTGCATT ATTACCATTT GCATTCGTTG
GATTACGTCG ATGTTACCAG CGCATTGGAA GCCGATTGTC ACAAAGCCAG TGAAATAGCC
CATCAATGGC ATAAACATCC GTATAACTGT TCGCAAGGCG ATTTAGCGGC CGTGCAGGAA
AAGCTGGCAA ATTTTGTCAA CGCAGGGCGT CTAGGGCTTT TCGCCAATGG ATACTGGGGC
CATGCCCAGT ACAAATTGTC GCCGGAAGAA AATCTGATTC ATATGAACCA TTATCTGGAA
GCGTTGCGGA TTCAACGAGA AGTGAGCAAG GCGATTGCCA TTTTCGGGGG GAAAACGCCG
CACCCGCAAA ATCTGGTGGT CGGGGGCGTT ACCAGTGTGA TGGACATGCT GAATCCGCAA
CGCTTGAATG ACTATCTTTT CATTATCAAA GATACGCAGG AATTTTTGAA ACGCGCGTAT
CTGCCGGATA TGAAAATGGT GGTGGCCGCC TATGGCGACA ATATCAAAGC CGGAGAAGGC
CGCGGCCACG GCAACTTTAT GTGTTCGGGC GGCTATCAAT TGTCAGATGA CGAACCGTTG
TTTGCCAGCG GCATTATTTG GGGGCATGAC TTCTCGAAAA TAGACGCGTT TGATGACACT
CAAATCACGG AAGAGGCCTC TCGTTCCTGG TATGCCGATG AAGCGCCGAC CCGTCCGTAT
GATGAAACCA CAGAACCGGA CTACACCGAT ATGAATGCGG ACGGCACCTT GAAAACCGAA
GGCAAATACA GCTGGATCAA AGCCCCGCGT TATCAAGGTC AACCAATGGA GGTGGGGCCA
GCCGCCCGTA TGATCATTGG TTATGTGAAA AAGGCCAAGA CGGTGCGCCC TTATATGCAA
CATTTTATGG ATGATACCGG CTTGGAACTG ATCGATTTTT CGTCTGCCAT TGGCCGCAAT
GCGGCACGCG CCGTTGAAGC CGAAGTGGTG TGCGATCTGA TTTTTGCATT TGTCAGTGAG
CTGATTGAAA ACATTAAATA TTACGATGAA ACCACCTGGA CAAAATACGA TTTTGAGGCC
TTGCCGCTTG AAACCAAAGG GCGAGCTGTG TTGGAAGTGC CACGGGGCAT GTTAAGCCAT
TTCATCCGTA TTGAGGAGGC CAAGGTCAAG AATTATCAGG CAGTGGTGCC CACGACTTGG
AATGCCTCTC CTAAAGACGG TCAGGGCGTT CGTGGGCCTT ATGAAGAAGC GATTGTTGGC
CTTAAATTGG CCGACCCGCA ACAACCTTTG GAAGTGCTTC GTGCCGTGCA TTCTTTTGAT
CCATGTCTGG CCTGTGCCGT GCATGTGATT GATGCGCAAG GCCAAACCTT ATCGGAACAT
CGCGTGAATC CAATCGGAAC ATTTTGA
 
Protein sequence
MTESTKKQKI VIDPVTRIEG HLRVEIEVDE HNTVTEAWAS GQLFRGIETI LKGRDPRDTG 
LIAQRICGVC TNSHYRASIS AVENAYDIVP PRNAEIVRNL VSLALFVQDH LVHYYHLHSL
DYVDVTSALE ADCHKASEIA HQWHKHPYNC SQGDLAAVQE KLANFVNAGR LGLFANGYWG
HAQYKLSPEE NLIHMNHYLE ALRIQREVSK AIAIFGGKTP HPQNLVVGGV TSVMDMLNPQ
RLNDYLFIIK DTQEFLKRAY LPDMKMVVAA YGDNIKAGEG RGHGNFMCSG GYQLSDDEPL
FASGIIWGHD FSKIDAFDDT QITEEASRSW YADEAPTRPY DETTEPDYTD MNADGTLKTE
GKYSWIKAPR YQGQPMEVGP AARMIIGYVK KAKTVRPYMQ HFMDDTGLEL IDFSSAIGRN
AARAVEAEVV CDLIFAFVSE LIENIKYYDE TTWTKYDFEA LPLETKGRAV LEVPRGMLSH
FIRIEEAKVK NYQAVVPTTW NASPKDGQGV RGPYEEAIVG LKLADPQQPL EVLRAVHSFD
PCLACAVHVI DAQGQTLSEH RVNPIGTF