Gene Hneap_1919 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHneap_1919 
Symbol 
ID8535077 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalothiobacillus neapolitanus c2 
KingdomBacteria 
Replicon accessionNC_013422 
Strand
Start bp2052356 
End bp2053447 
Gene Length1092 bp 
Protein Length363 aa 
Translation table11 
GC content57% 
IMG OID646384300 
Productaminodeoxychorismate lyase 
Protein accessionYP_003263788 
Protein GI261856505 
COG category[R] General function prediction only 
COG ID[COG1559] Predicted periplasmic solute-binding protein 
TIGRFAM ID[TIGR00247] conserved hypothetical protein, YceG family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.402452 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAATGGC TGAATACTAT GAATAGGGCG ATCGGTTTTT TGTGGGTCTG GCTGCTCACG 
ATGGGGCTTT TATTCGCCAC CGTATGGATG GTCGGCCGGA TTGGTGTTTT TCTCACCCAG
CCCTTACTGC CCTCAGGCGC TGCGGCGATC ACGATTGAAA TCCCCAGTGG TGCGGACGCG
CGGCAAATCG CTAAAATCGC TGATGCCTCC GGTGCGAGGG TCAATCCGAC CGTATTCGTG
TGGGCGGCTC GATTGAGTGG TAAAGCGCGC TCAATTCAGG CGGGTGCGTA CCAGATTACC
GATCAGGATC GGTTGCTCGG TTTTCTCGAT CGACTGGTCG AAGGCGATGT GGTGCGTTAC
CGCATCACCA TCCCCGAAGG GGATACCGCC CAGGATTTCC TGAACAAGCT CGCGGCGCAA
AAGGAAATAA AACACACGCT GAACGGGCTG GATCAGGCCC AGATCATCGC CGAGATGAAT
TGGCCGATCA CCCATCTCGA AGGTTGGTTG TTCCCCGATA CGTATGTATT CACACGCGGC
ACCACGGACA AAAAGATTCT GCAGGAAGCC TACCGCTCGA TGCGGTCTCA TCTGGACGCG
GCATGGGCGG ATCGCGCACC CGGGCTGCCC TTGAAAACGC CCTACGATGC ACTGATTCTG
GCTTCTATCG TGGAAAAGGA AACCGGCTTG CCCGATGAAC GCGCCATGGT GGCGGGTGTA
TTCATCAACC GATTGAACAT CGGAATGCGG TTGCAGACGG ATCCGGCTGT CATCTACGGC
GTGGCGGAGG CAACTCAGGG ACAGGTTGAC GAGGACAGTT CGCCACGAAG TCTGACGCTA
AGCCAGCTGC GCGCCGATAC GCCGTACAAT ACCTACACCC GCACCGGTTT GCCGCCGACG
CCGATTGCCC TGCCATCCGC AGCTGCATTG CAGGCTGTGA CGCATCCCGA TAAAACGGAT
GCCCTGTATT TTGTTGCCAA TGGCACGGGC GGACACACCT TTTCGCGCAC ACTGAAAGGA
CACAATCAAG CCGTGCAGAC CTGGCGTAAA ATTGAAGATA CGCGGGCATC CGAACCGAAA
AAAAAGCAAT GA
 
Protein sequence
MKWLNTMNRA IGFLWVWLLT MGLLFATVWM VGRIGVFLTQ PLLPSGAAAI TIEIPSGADA 
RQIAKIADAS GARVNPTVFV WAARLSGKAR SIQAGAYQIT DQDRLLGFLD RLVEGDVVRY
RITIPEGDTA QDFLNKLAAQ KEIKHTLNGL DQAQIIAEMN WPITHLEGWL FPDTYVFTRG
TTDKKILQEA YRSMRSHLDA AWADRAPGLP LKTPYDALIL ASIVEKETGL PDERAMVAGV
FINRLNIGMR LQTDPAVIYG VAEATQGQVD EDSSPRSLTL SQLRADTPYN TYTRTGLPPT
PIALPSAAAL QAVTHPDKTD ALYFVANGTG GHTFSRTLKG HNQAVQTWRK IEDTRASEPK
KKQ