Gene Hneap_1106 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHneap_1106 
Symbol 
ID8534254 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalothiobacillus neapolitanus c2 
KingdomBacteria 
Replicon accessionNC_013422 
Strand
Start bp1198505 
End bp1200355 
Gene Length1851 bp 
Protein Length616 aa 
Translation table11 
GC content56% 
IMG OID646383491 
Productallophanate hydrolase 
Protein accessionYP_003262989 
Protein GI261855706 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0154] Asp-tRNAAsn/Glu-tRNAGln amidotransferase A subunit and related amidases 
TIGRFAM ID[TIGR02713] allophanate hydrolase 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAAAAGC AGGTTGTCCA GCACGAAGAC CAACACAATG TTAAAAAAGA TACGGGCACG 
GAGAGTGTGC TGACAATCGC GGGTTGGTTG GATCGATGGA AAACCAATCC CCAAGCGGCC
GCTCGTGATC TGCTTGATCG GCAGCAAAAA CGTATTGCCA AGTGCGCAGC GGGAAAAACC
CCACAATGGA TCCATCTTGT CGCGCTGGAT GTGCTTGAAG CACAAATTAA CGAACTTCTG
AAGACGGACG CAAACGACCT GCCGCTTTAC GGGGTACCGT TTGCCGTCAA GGACAACATC
GATGTTGCGG GCATGCCGAC GACCGCAGGT TGTCCTGAAT ATGCCTACAC GCCAGATTCG
GATGCCTTTG TTATCGCCAA GCTCAAAAAG GCCGGTGCCA TCTGTATCGG TAAAACCAAT
CTAGATCAAT TCGCGACGGG CCTAAACGGA ACGCGCACGC CCTATTCCAT TCCCCATTCG
GTGTTCAGTG AGGCGCATAT TTCCGGCGGT TCTAGTTCCG GTTCGGCGGT GGCGGTGGCG
CTGGGTGAAG TGGCCTTTTC ATTGGGTACA GATACGGCCG GTTCAGGTCG AGTCCCGGCC
GGCTGCAACC AACTGGTGGG GTTGAAACCG AGCAAGGGGT ATATCAGCAC GAACGGTTTA
ATCCCGGCGT GCCGCACACT CGATTGCATT TCGATCTTCG CGCATACGGT GGACGATGCC
CAAAACGTGC TCGCCGTCGC CGGTGTTTAT GACCCAGAAG ATCCTTATGC GCGGCAAGCC
CAACCGGCCG TGACATCTTT TTCGAGCACG AATCTTAAGC TGGCCACGTT CAGCAATTTA
AGTTGGTTTG GCGATACGCA GCAACAGGCC GCCTGGAATG CCTATGTCGC ACAACTGGCT
GCACGGGGCA TCACGCTGAC GGAAATCGAT CCGACGCCTT TTTTTGCATT GGCACCGCTG
CTCTATAACG GTCCTTGGGT GGCCGAGCGT TTGGCCGCGC TTTCTGAGTT TGTGGCAGAG
CAACCCGAGG CCATCCACCC CGTGGTGCGC GAAATCGTGC TTTCCGGTAA AAAATTCTCC
GCCGTCGATA CGTTCAATGC CGAATACCAA CGAGCGGATT TGGCAAGGAA AATCCAACAA
GCCCTGTCTG AGTTTGATGC GCTGATGGTG CCAACGGCAC CGATTTTCCC CACGATTGAA
GCCCTATTGG CCGAGCCGAT CAAACTCAAT TCCGAACTCG GCACCTACAC CAACTTTGTC
AATCTAGCTG ATTTATCCGC GTTGGCTGTG CCCGCCTGTT TGCGTGATGA CGGACTGCCG
TTCGGTATCA CCCTGATTGC CGAAGCCTGG GCCGACGCCC GATTGGCCGC CATTGCCAAA
CAGTTGAACC AGGAACAAGC GACCATCCCG CCCCTGCGCC ATGATGAAAT TGAGCTCGCC
GTTGTGGGGG CGCACCTGAC GGGCATGCCG CTGAATCACC AGTTAACGAA CCGAGGTGGT
CGCCTGCTCG AACAAACCAC CACGGCCGCG TCTTATAGAC TCTATGCTCT TGCCAATACC
ACACCACCCA AGCCGGGCCT GGTGTTCGAT ACCGAAGGCG ATGAAATCAT CGTTGAAGTG
TGGGCACTCC CGCGCACTGC TTTGGCCGAC TTTATTCAAG AAATCCCGCC ACCGCTGGGC
CTCGGCAGCC TTACGCTCGT CGATGGACGG CAGGTGACCG GGTTTATCTG TGAACCACGC
GCCCTGCAGG ACGCAAAAGA CGTGACGGCA TTTGGCGGCT GGCGACCCTA CATCGCCTCC
CGTCAATCTG CGCCATTGGC TCCCCAACCA AAAGGAATTA CGCATGCCTG A
 
Protein sequence
MQKQVVQHED QHNVKKDTGT ESVLTIAGWL DRWKTNPQAA ARDLLDRQQK RIAKCAAGKT 
PQWIHLVALD VLEAQINELL KTDANDLPLY GVPFAVKDNI DVAGMPTTAG CPEYAYTPDS
DAFVIAKLKK AGAICIGKTN LDQFATGLNG TRTPYSIPHS VFSEAHISGG SSSGSAVAVA
LGEVAFSLGT DTAGSGRVPA GCNQLVGLKP SKGYISTNGL IPACRTLDCI SIFAHTVDDA
QNVLAVAGVY DPEDPYARQA QPAVTSFSST NLKLATFSNL SWFGDTQQQA AWNAYVAQLA
ARGITLTEID PTPFFALAPL LYNGPWVAER LAALSEFVAE QPEAIHPVVR EIVLSGKKFS
AVDTFNAEYQ RADLARKIQQ ALSEFDALMV PTAPIFPTIE ALLAEPIKLN SELGTYTNFV
NLADLSALAV PACLRDDGLP FGITLIAEAW ADARLAAIAK QLNQEQATIP PLRHDEIELA
VVGAHLTGMP LNHQLTNRGG RLLEQTTTAA SYRLYALANT TPPKPGLVFD TEGDEIIVEV
WALPRTALAD FIQEIPPPLG LGSLTLVDGR QVTGFICEPR ALQDAKDVTA FGGWRPYIAS
RQSAPLAPQP KGITHA