Gene Tcr_2041 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTcr_2041 
Symbol 
ID3761958 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThiomicrospira crunogena XCL-2 
KingdomBacteria 
Replicon accessionNC_007520 
Strand
Start bp2253746 
End bp2254912 
Gene Length1167 bp 
Protein Length388 aa 
Translation table11 
GC content46% 
IMG OID637786790 
Producthydrogenase expression/formation protein HypD 
Protein accessionYP_392305 
Protein GI78486380 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0409] Hydrogenase maturation factor 
TIGRFAM ID[TIGR00075] hydrogenase expression/formation protein HypD 


Plasmid Coverage information

Num covering plasmid clones42 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCACGC TTCAGTTACA AGATTTGTAT CAAGGGTTTC GACAGCCCAA AACCATTCAG 
GCATTGGCAC AGAAAATTCA TCAAACGGCT CAGACGCTGA GTTCGCCACT TCGGATTATG
GAAGTCTGCG GTGGGCATAC GCACACCATT ATGAAGTATG GTTTGAACCA GCAGCTACCA
GAAAACATTG AGTTCATTCA TGGTCCAGGG TGCCCGGTAT GTATTATGCC GAAAGAGCGG
ATTGATCATG CCATTGCGCT GGCGCAAATG CCGAATACGA TTTTGCTGAC ATTAGGCGAT
ATGATTCGTG TGCCTGGCTC GAAAACCAGT TTGGCGAAGC AGCGTGCTTT AGGCAGTGAC
ATTAGAGCCC TGTATTCGCC GTTGGATGCG TTGACAATTG CGCAGGAAAA TCCTGACAAA
CAGGTGGTGT TTTTTGCCAT TGGATTTGAA ACCACTACGC CAATGACAAC GGCGGTGATT
CAGCAAGCAT TGGTATTAAA GTTACCCAAT CTTTTTTTTC ACATCAACCA TGTGTTGGTA
CCGCCTGCCG TGGCCGCAAT CTTGTCGGAT AAAGACTGCC AGATCAATGC ATTGATTGGT
CCTTCCCATG TCAGTGTAAT CAGTGGTGCA CAAATTTATC AGCCACTGGC GGCACAACAC
CGTATACCGA TTGTGGTCAG TGGGTTTGAG CCAGTGGATG TGATGCAAAG TATTCTGATG
ATTGTAGAGC AAATGCTTCA AAAAAGGCAT CAAGTCGAGA TTCAATATTC ACGTGCGGTA
ACAGAACAAG GCAATCAAAA AGCTCAGCAG ATGATTGAAA CCTATCTGGA ACCTCGTTCC
CATTTCCGTT GGCGTGGCTT GGGCGACATT CCGCTCAGTG CTTTGCAATT AAAAGACGCT
TATCGTTTTT TGGATGCCGA AACGGTTTTT AAATCGGTTT TGTCGGATGA ACCGATTGAC
GATCATAAAT TGTGTATTTG CGGTGATATT CTTAAAGGTG TCGCCAAACC ACAAGACTGT
AAGGTGTTTG GTCGAGGCTG CGACCCGGCA CGACCACTGG GCAGTTGCAT GGTATCAAGT
GAAGGTGCTT GTAATGCGTA TTATCGATAT GCTGAAGTGG CCTTGCCCAA AGGAAAAACG
TTTGAGAAAA AACGAGCAAC CGCATGA
 
Protein sequence
MTTLQLQDLY QGFRQPKTIQ ALAQKIHQTA QTLSSPLRIM EVCGGHTHTI MKYGLNQQLP 
ENIEFIHGPG CPVCIMPKER IDHAIALAQM PNTILLTLGD MIRVPGSKTS LAKQRALGSD
IRALYSPLDA LTIAQENPDK QVVFFAIGFE TTTPMTTAVI QQALVLKLPN LFFHINHVLV
PPAVAAILSD KDCQINALIG PSHVSVISGA QIYQPLAAQH RIPIVVSGFE PVDVMQSILM
IVEQMLQKRH QVEIQYSRAV TEQGNQKAQQ MIETYLEPRS HFRWRGLGDI PLSALQLKDA
YRFLDAETVF KSVLSDEPID DHKLCICGDI LKGVAKPQDC KVFGRGCDPA RPLGSCMVSS
EGACNAYYRY AEVALPKGKT FEKKRATA