Gene Tcr_0439 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTcr_0439 
Symbol 
ID3761258 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThiomicrospira crunogena XCL-2 
KingdomBacteria 
Replicon accessionNC_007520 
Strand
Start bp492005 
End bp493567 
Gene Length1563 bp 
Protein Length520 aa 
Translation table11 
GC content46% 
IMG OID637785150 
Productphosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase 
Protein accessionYP_390709 
Protein GI78484784 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0138] AICAR transformylase/IMP cyclohydrolase PurH (only IMP cyclohydrolase domain in Aful) 
TIGRFAM ID[TIGR00355] phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAACCTG TTCGCCGCGC CCTGATTAGC GTCTCAGATA AAAATGGCAT TTTAGAGTTT 
GCCAAGTCTT TAACCGCTAT GGATGTTGTC ATCTTATCAA CAGGAGGAAC CTATAAAGTT
CTTTCTGAAG CCGGCTTACC TGTTACAGAG GTTTCGGAAT ATACAGGTTT TCCTGAAATG
ATGGATGGTC GAGTCAAAAC ACTTCATCCA AAAATTCACG GTGGTTTGTT GGGACGTCGA
GGCACAGATG ATGCTGTTAT GGCAGAACAT GGCATTGATC CTATCGATAT GGTGGTTGTT
AACCTTTATC CTTTCGAAGC CACGGTTGCC AAACCGGACT GTTCACTAGA AGATGCCATT
GAAAACATTG ATATTGGAGG GCCGACGATG CTTCGTTCCG CGGCCAAAAA CCATAAGGAT
GTTGCTGTTG TCACAGACCC GCACGATTAT GCCCGCATTT TAGAAGAAAT GGAATCGAAT
GATGGTCAAC TATCTCATGC AACCCGTTTT GATCTAGCCA TCAAAACATT TGAACAAACA
GCACGCTATG ACGGAGCGAT CTCAAACTAC TTCGGCACCA TGTTCAGCGA CGATAAAGAC
GATACTTTCC CGCGCACATA CAACACCCAG TTTGTGAAAA AACAATCGAT GCGTTATGGC
GAAAACCCAC ATCAGTCGGC GGCTTTTTAT ACAGAACGCA ACCCAACCGA AGCCTCTATT
TCAACCGCTA AACAACTTCA AGGCAAGGCA TTGTCTTTCA ATAACATTGC CGATACAGAT
GCGGCATTAG AGCTGGTCAA AACCTTTGAA GAAACCGCTT GTGTTATTGT CAAACATGCC
AATCCGTGTG GTGTTTCCAT TGGTGAAAAC GTTTTTGAAT CTTATGACCG AGCTTATAAA
ACCGATCCAA CCTCTGCCTT CGGAGGCATT ATTGCATTCA ATCGCGCGTT AGATCAAGAA
ACCGCTCAAG CCATAATTGA TCGTCAGTTT GTTGAAGTCA TCATCGCACC GAATGTGTCT
GAAGATGCTA AGAATGTCAT TGCGGCTAAA CAAAATGTTC GTTTATTGGT GTGTGGTGAT
TTAGGCATCC AAGAGCCTGC TTATGACTAC AAACGCGTAA CAGGTGGTTT ATTGGTTCAA
GACCGTGATT TAGGTTCGGT AACGGAAGAC GAGCTGAAAG TCGTCACCAA ACGAGCGCCC
AGCGAAAAAG AAATGGCGGA CTTGCAATTT GCTTGGAAAG TGGCGAAGTA CGTTAAATCA
AATGCCATCG TTTATGTCAA AGACGGCATG ACCATTGGAG TAGGGGCAGG CCAAATGAGC
CGTGTTTATT CTGCCAAAAT TGCTGGTATT AAAGCGGCGG ATGAAGGCCT TGAAGTGCCA
GGTTCCGTGA TGGCTTCAGA TGCCTTCTTC CCTTTCAGAG ATGGCATTGA TGCGGCCGCT
GAAGCCGGTA TCACAGCCGT GATTCACCCA GGTGGGTCAA TGCGAGACCA AGAAGTGATT
GATGCAGCGG ATGAGCACGG CATCGCGATG GTCTTCACTG GCATGCGTCA CTTTAAACAC
TAA
 
Protein sequence
MKPVRRALIS VSDKNGILEF AKSLTAMDVV ILSTGGTYKV LSEAGLPVTE VSEYTGFPEM 
MDGRVKTLHP KIHGGLLGRR GTDDAVMAEH GIDPIDMVVV NLYPFEATVA KPDCSLEDAI
ENIDIGGPTM LRSAAKNHKD VAVVTDPHDY ARILEEMESN DGQLSHATRF DLAIKTFEQT
ARYDGAISNY FGTMFSDDKD DTFPRTYNTQ FVKKQSMRYG ENPHQSAAFY TERNPTEASI
STAKQLQGKA LSFNNIADTD AALELVKTFE ETACVIVKHA NPCGVSIGEN VFESYDRAYK
TDPTSAFGGI IAFNRALDQE TAQAIIDRQF VEVIIAPNVS EDAKNVIAAK QNVRLLVCGD
LGIQEPAYDY KRVTGGLLVQ DRDLGSVTED ELKVVTKRAP SEKEMADLQF AWKVAKYVKS
NAIVYVKDGM TIGVGAGQMS RVYSAKIAGI KAADEGLEVP GSVMASDAFF PFRDGIDAAA
EAGITAVIHP GGSMRDQEVI DAADEHGIAM VFTGMRHFKH