Gene Hneap_0131 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHneap_0131 
Symbol 
ID8533245 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalothiobacillus neapolitanus c2 
KingdomBacteria 
Replicon accessionNC_013422 
Strand
Start bp122315 
End bp123301 
Gene Length987 bp 
Protein Length328 aa 
Translation table11 
GC content56% 
IMG OID646382509 
Producttranscriptional regulator, AraC family 
Protein accessionYP_003262042 
Protein GI261854759 
COG category[K] Transcription 
COG ID[COG2207] AraC-type DNA-binding domain-containing proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACCTCA AAAACACTCC TGCCACTCTG CTGCCCTTCA ACATGGTCTC GGGCCGATAT 
GAGGGATTAG AGCCTGTCAG GGAGGCATGG TCACAGCTCG TATGCCTGGA TGTTATGCCG
GCCGAGCCGA CGGTTCCCAA ACTCAGCGCA TCGATCTGGA GCTTAGGCGA TCTGAAAATC
GGCACATTCG ATGGATCGGC TGTCAAAATG GTGCGGACCA CTAAGTTGGC TCAGGATTGC
GCTGACGACT TGATCATATG CATCGAACGC AAGACGCCTA CTCGTGCCAC TTGGGGAGGT
GCTGGTGCGC GAGTTTTCGT ACCCGGCGAC ATTCACCTCT GGCGAGCGGA CGGCACCATG
CAGTGCGAAA CCAAAGGGGC ATTCTCTGCA TTGCTGTTGT CGATTCCGCG CACGGTTTTG
AGCAATCATG GGATCGATAT TGCCCCATTA TTACGTGAAG GCGGGATTTG CGGTAACAGC
CCCGAAACGC GGCTTCTTTG CCGTTACACT GATTCGGTCT TAGGCGAGGT TGATGCGTTA
TCCCCGATGG GGGTATCGTG CTGCCTCTCC CATCTCGTTG ATCTTGCGAT GCTTGTGCTG
GGAGGGCGGA ACACAGTCCG GCTTGCCAAT CGGCACCGTG TCCGCTCTGT TCGACTGGCG
TCAATCAAGT CGGATATCGA TGCCCATCTG CTGAACCCCG AGCTTTCAGT CGGCTGGGTG
CTGAGGCGGC ACCGGATTTC CGAGCGTTAT CTCCGCTCTT TGTTTGCTGA TGAGGACACG
AGTTTCACCC GTTTCGTGCT GGAGCGACGG CTAATGCGTG CCCATGCCGC GCTTGACCAG
TCAGGTCGTA GCATCAGTGA GATTGCCTAC GATTGTGGCT TTTCAGACCT GTCGTGGTTC
AACCGAGCCT TTCGGCAGCG GTTTGACATG ACACCATCCC AGGCCCGTGC GGGCCTGCTT
GATCTCAGTC TAGAACGGGA TGGGTAA
 
Protein sequence
MNLKNTPATL LPFNMVSGRY EGLEPVREAW SQLVCLDVMP AEPTVPKLSA SIWSLGDLKI 
GTFDGSAVKM VRTTKLAQDC ADDLIICIER KTPTRATWGG AGARVFVPGD IHLWRADGTM
QCETKGAFSA LLLSIPRTVL SNHGIDIAPL LREGGICGNS PETRLLCRYT DSVLGEVDAL
SPMGVSCCLS HLVDLAMLVL GGRNTVRLAN RHRVRSVRLA SIKSDIDAHL LNPELSVGWV
LRRHRISERY LRSLFADEDT SFTRFVLERR LMRAHAALDQ SGRSISEIAY DCGFSDLSWF
NRAFRQRFDM TPSQARAGLL DLSLERDG