Gene Hneap_1404 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHneap_1404 
Symbol 
ID8534560 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalothiobacillus neapolitanus c2 
KingdomBacteria 
Replicon accessionNC_013422 
Strand
Start bp1512690 
End bp1513883 
Gene Length1194 bp 
Protein Length397 aa 
Translation table11 
GC content58% 
IMG OID646383795 
ProductCupin 4 family protein 
Protein accessionYP_003263285 
Protein GI261856002 
COG category[S] Function unknown 
COG ID[COG2850] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTGATTT CACAGGTGTT GGGTACCTTG TCCGTGGCGG ATTTTCTGCG TGATTACTGG 
CAGAAAAAAC CGGTGTTGAT CCGTCAGGGC GTGCCCGGCT TCGAATCGCC TCTATCTCCC
GAAGAACTAG CCGGTCTGGC CTGTGAGGAA GACGTGCCCG CGCGCTTGAT TCTCGAATCG
GCCGGCGCGC GGCCCTGGAC GTTGCGTCAT GGCCCATTCA CCGAAGCGGA CTTCACAAGC
CTGCCCGAAG ACGGTTACTC GTTGTTGATC ACCGATTGCG AAAAGCTGAT CCCGGACTTG
ATGAATTTGG TCCAGCACTT CCGTTTTGTA CCCGATTGGC GGATCGATGA CCTGATGATT
TCCTACGCAC CACCCGGTGG TTCGGTCGGG GCGCATATCG ATGAATATGA TGTGTTCCTG
TTGCAGGGGA TGGGGCGGCG CAAGTGGATG ATCGAGTATC CGCCGAAGCA CAGTGATTTT
GTGCCAGATC TGGATATTCG CCTGCTGCAA GAATTCGAAC CGACCGAAGA ATGGGTGCTG
GAACCGGGCG ACATGCTCTA TCTACCGCCC GGTGTGCCGC ATCACGGCGT AGCGGTCGAC
CACTGCATGA CGTATTCGAT CGGCTTTCGT GCGCCCTTGC TGCACGAGAT GGCGGCTGGC
GTCACCGACC GTCTGATTAC CGACATGGAT CAAGCGGCTC GTTACGGTGA TCCCGATTTG
CAGGCACCTG CGAATCCCGG CGCATTGGAT GCCTCATCGC GTGTCAAGTT GCGCGCGATC
TTGCAATCGG TACTCGATCA GGATGATGCC GTGCTGGATC GATTCATTGC CGAAACCCTC
ACCGAGCGCC CGCTGGATCA CGCGGGTTTT TATCCACAAA ACGATCCTTT GGACGCGAAG
GCCCTGCGCG GTGAAATCGC CCATAGCGGC GACACCCTCA TGCGCACACC GGCTGCGCGT
TTGTTGCTGG TTGAAGATGA GCCCGATTCG GCTGGCGGTG CACTGGCCGT AGATGGTCAA
AGCACGCTCT TGAATGCCGA AATGCTCCCC TTGGCGCGCT TGCTTGTGAG TCAGGTTTTT
TATGATGCCG CCGAACTGCT GGCAGCCACC GAGTCTGAGG CCGCGGCTGA ACTTTTGCAG
AAACTGTATG CCGATGGGGT GGTGCAGTGG CAGCCGAACT TGCTGAGTGT TTAA
 
Protein sequence
MVISQVLGTL SVADFLRDYW QKKPVLIRQG VPGFESPLSP EELAGLACEE DVPARLILES 
AGARPWTLRH GPFTEADFTS LPEDGYSLLI TDCEKLIPDL MNLVQHFRFV PDWRIDDLMI
SYAPPGGSVG AHIDEYDVFL LQGMGRRKWM IEYPPKHSDF VPDLDIRLLQ EFEPTEEWVL
EPGDMLYLPP GVPHHGVAVD HCMTYSIGFR APLLHEMAAG VTDRLITDMD QAARYGDPDL
QAPANPGALD ASSRVKLRAI LQSVLDQDDA VLDRFIAETL TERPLDHAGF YPQNDPLDAK
ALRGEIAHSG DTLMRTPAAR LLLVEDEPDS AGGALAVDGQ STLLNAEMLP LARLLVSQVF
YDAAELLAAT ESEAAAELLQ KLYADGVVQW QPNLLSV