Gene Hneap_0052 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHneap_0052 
Symbol 
ID8533165 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalothiobacillus neapolitanus c2 
KingdomBacteria 
Replicon accessionNC_013422 
Strand
Start bp58846 
End bp60408 
Gene Length1563 bp 
Protein Length520 aa 
Translation table11 
GC content61% 
IMG OID646382431 
Productphosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase 
Protein accessionYP_003261965 
Protein GI261854682 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0138] AICAR transformylase/IMP cyclohydrolase PurH (only IMP cyclohydrolase domain in Aful) 
TIGRFAM ID[TIGR00355] phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAATA ACCGCACCAC TCCTCGCCGC GCCTTGTTGA GCGTGTCGGA CAAAACCGGC 
CTGTTGGCAT TCGCCCAGAG CTTGAATCGG CACGGCGTGG CGCTGATTTC CACTGGCGGC
ACGGCCAGTA TGCTGCGTGA TGCGGGCCTG CCCGTGACCG AAGTGGCCGA TGTGACCGGT
TTCCCCGAAA TGATGGCCGG ACGGGTCAAG ACACTGAACC CGAAGATTCA TGGTGGCATT
CTGGCCCGAC GCGGTGTGGA CGAGGACGTG ATGGCTGAGC ACGGCATCGA GCCGATCGAC
ATCGTGGTCG TGAATCTCTA CCCGTTTGCC GAAACCGTCG CCAAGCCGAA CTGCCAGTTT
GACGATGCGG TCGAAAACAT CGATATCGGT GGCCCGGCCA TGGTGCGTGC GGCGGCGAAA
AATCATCAAG ATGTCGCGAT TATCGTCGAT CCGTCCGATT ACGGTCGTGT GTTGAGCGAA
GTAGAAGCGG GCGGCATTGA AGCGCAGACC CGTTTCGAGC TGGCCGTGCG TGCTTTCGAA
CACACCGCGC ATTACGACGG CATGATCGCC GATTATTTTG GCAAGATGGT CAGCGGCAAT
GCCTTCGCGC CGACCTTCAA CCTGCAATTG AAAAAAGCGC AGGATTTGCG CTACGGCGAG
AACCCGCATC AGGAAGCCGC GTTTTACGTG GAACATACCC CGCCGGTCGG CAGCATCGCC
GCCGCGCACA TGATTCAGGG CAAGGCATTG TCGTACAACA ATATTGCCGA CTCCGACGCG
GCGCTTGAAT GCGTGAAGCA GTTTGCCGAA CCGGCCTGTG TGATCGTCAA GCATGCCAAC
CCCTGCGGCG TGGCCGTGGC GGAAGACCTG ACCGTGGCGT ACGACCGTGC CTATGCGACC
GACCCGACCT CCGCCTTCGG CGGCATTATT GCCTTCAACC GCCCATTGGA CGGTCACACC
GCGCGCACCA TCGTCGAGCG GCAGTTCGTC GAAGTGGTCA TCGCGCCGGA AATTTCACCC
GAAGCACGCA TCGAGTTCGA AGCCAAACCG AACGTGCGCG TGTTGACTGT CGGTCAATGG
CCAGCGGTGA GTCCGGCCCG GTTGGACTTC AAGCGTGTGC ATGGCGGCCT GCTGGTGCAG
GATGACGACG CGGCGCGCAT CACGGCGCGG GATCTGACGG TTGTCTCGGA GCGCCAGCCG
ACCCCTGAGG AGTTGCGTGA TCTGCTCTTT GCCTGGCAGG TGGCCAAGTT CGTGAAATCC
AACGCCATCA TTTACGCCAG TGGCGAGCAG ACGATCGGCG TAGGTGCCGG TCAGATGAGC
CGCGTTTATT CAGCGCGTAT CGCGGCAATC AAGGCCGAAG ACGCCTGCCT GCCGGTCGCC
GGTTCCGTGA TGGCTTCCGA TGCCTTCTTC CCGTTCCGTG ACGGGATTGA TGCGGCGGCG
GCCGTTGGCA TCCGTGCGGT TATTCAACCC GGCGGATCGA TGCGCGATCA GGAAGTGATC
GATGCGGCCA ACGAACACGG TATTGCCATG GTCTTTACCG GCATACGCCA TTTCCGTCAC
TGA
 
Protein sequence
MKNNRTTPRR ALLSVSDKTG LLAFAQSLNR HGVALISTGG TASMLRDAGL PVTEVADVTG 
FPEMMAGRVK TLNPKIHGGI LARRGVDEDV MAEHGIEPID IVVVNLYPFA ETVAKPNCQF
DDAVENIDIG GPAMVRAAAK NHQDVAIIVD PSDYGRVLSE VEAGGIEAQT RFELAVRAFE
HTAHYDGMIA DYFGKMVSGN AFAPTFNLQL KKAQDLRYGE NPHQEAAFYV EHTPPVGSIA
AAHMIQGKAL SYNNIADSDA ALECVKQFAE PACVIVKHAN PCGVAVAEDL TVAYDRAYAT
DPTSAFGGII AFNRPLDGHT ARTIVERQFV EVVIAPEISP EARIEFEAKP NVRVLTVGQW
PAVSPARLDF KRVHGGLLVQ DDDAARITAR DLTVVSERQP TPEELRDLLF AWQVAKFVKS
NAIIYASGEQ TIGVGAGQMS RVYSARIAAI KAEDACLPVA GSVMASDAFF PFRDGIDAAA
AVGIRAVIQP GGSMRDQEVI DAANEHGIAM VFTGIRHFRH