Gene Hneap_2201 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHneap_2201 
Symbol 
ID8535365 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalothiobacillus neapolitanus c2 
KingdomBacteria 
Replicon accessionNC_013422 
Strand
Start bp2367217 
End bp2368515 
Gene Length1299 bp 
Protein Length432 aa 
Translation table11 
GC content59% 
IMG OID646384582 
Productpeptidase M24 
Protein accessionYP_003264064 
Protein GI261856781 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0006] Xaa-Pro aminopeptidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGACAAC GCGATGACAT GACTTTTCCG ATGGGGGAAT ACACTCGTCG GCTGAACGAA 
TTGCGCACCC GGATGCAGGA ACGATTGCTT GATGCCGTCA TCATCACCGA CCCGGAAAAC
CTGATGTATC TGACCGATTA TCAGACCACG GGCTATTCCT TTTTCCAGGC GCTGGTCGTG
CCGCTGGAAG ACGAGCCGTT CATGATCACC CGCAACATGG AAGAATCCAA TGTGATCGCC
CGTACCTGGG TCGAACTGAC GCGCCCCTTC CCCGATGGGG GCGACGCGAT CCAGATGCTC
GTTTCCAGCC TCAAGGAATT CGGCCTGTCG CACAAGACGC TGGGTTACGA GCGCAACAGC
TATTTTTTCC CGGCCTACCA ACAGGATTCG CTGCGGACCA GCCTGACCGA TGCCCGATTG
CAGGACTGCT TTGGCATTGT CGAATGCGGT CGGCGCACCA AATCCAGCGT TGAGATCGAG
ATCATGCGCA AAGCAGCTAT CGCGACCGAG GCGGGCATGA AAGCCGGGCT TGATGCTTGC
CGCGCCGGGG TCACCGAAAA CGAAATCGCC GCGGAAATTT CGGCGGCCAT GTTCCGTGCC
GGTGGCGAAG CGCCTGCGGT GATGCCCTAT GTCACCTCCG GGCCGCGCAC CATGATCGGT
CATGCCACCT GGGAAGGCCG CGTGGTGCAG CCCGGCGAGC ATGTGTTCAT GGAAGTCGGC
GGCTGTTACC GGCGCTATCA CACAGCCATG ATGCGCACCG CCGTGCTGGG CGAACCGACC
GATTACATGA TGCAGGCACA GGAACGAATG AAGCTGGCAC TCGAGCAGGT CAAGGCGCTG
ATCCGCCCCG GCGTGACGGT TTCCGATGCC GACAACCTCG TGCGCAGCAT CATGACGGTC
GACGACAAGC ACGGCAAACT CATTACCCGC TCCGGCTATT CGATTGGCAT CGCATTTCCG
CCGAGCTGGG ACGAGGGCTA CATTTTAAGC CTGATGCACG GCGACAAAAC CGTCCTGCGC
GAGGGCATGA CCTTCCACAT CATCCCCTGG GCATGGGGCG TGGACGGCGA CAAGACATGC
GGCATCTCCG ATACCATTTA CATCACCAAG GATGGGTGCG AATCGTTCTT CACGCTGGAT
CAGGACTTTG TGATCAAACC GGAGGAAGGC AAGAAAGCGC TGCCACCATC GCCGCCACTT
GAAATCATGG TGCCGCAAAA CGTCACCCCC ATCGCCAGCA AAGAAGGCAA AAACAAGAAG
AGCCGCTCTA CCGGCAAAAA GGAGCGTGAA GCCGTATGA
 
Protein sequence
MRQRDDMTFP MGEYTRRLNE LRTRMQERLL DAVIITDPEN LMYLTDYQTT GYSFFQALVV 
PLEDEPFMIT RNMEESNVIA RTWVELTRPF PDGGDAIQML VSSLKEFGLS HKTLGYERNS
YFFPAYQQDS LRTSLTDARL QDCFGIVECG RRTKSSVEIE IMRKAAIATE AGMKAGLDAC
RAGVTENEIA AEISAAMFRA GGEAPAVMPY VTSGPRTMIG HATWEGRVVQ PGEHVFMEVG
GCYRRYHTAM MRTAVLGEPT DYMMQAQERM KLALEQVKAL IRPGVTVSDA DNLVRSIMTV
DDKHGKLITR SGYSIGIAFP PSWDEGYILS LMHGDKTVLR EGMTFHIIPW AWGVDGDKTC
GISDTIYITK DGCESFFTLD QDFVIKPEEG KKALPPSPPL EIMVPQNVTP IASKEGKNKK
SRSTGKKERE AV