Gene Hneap_1691 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHneap_1691 
Symbol 
ID8534849 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalothiobacillus neapolitanus c2 
KingdomBacteria 
Replicon accessionNC_013422 
Strand
Start bp1820127 
End bp1821128 
Gene Length1002 bp 
Protein Length333 aa 
Translation table11 
GC content60% 
IMG OID646384075 
ProductTIM-barrel protein, nifR3 family 
Protein accessionYP_003263563 
Protein GI261856280 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0042] tRNA-dihydrouridine synthase 
TIGRFAM ID[TIGR00737] putative TIM-barrel protein, nifR3 family 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCTGAAT CGCCTCCGCT GCCGCCCATC ATTCTGCACG GGCGAACCGG AGACCCGGTC 
ATCATCGACC CACCGCTCGC GCTCGCCCCC ATGGCCGGTG TTTCAGACCG CCCTTTCCGC
CAGCTCTGCC GTGAATACGG CGCAGGGCTG GTGGTCACGG AAATGATGAG CGCCAAACCG
GAACTGCAAT CCAGCGCCAA AAGTCGCCTG CGCCAAATCG ACAGCAACGA CATCGAACCC
CGTGCGGTGC AGCTGCTGGG CAATGATCCG TTTGAGCTGG CCGAAGCCGC CCGATTTGCA
GTCTCGCAAG GCGCACAACT GATCGACCTC AACCTCGGCT GCCCTGCCAA AAAGGTATGC
AAACGCGCTG CAGGATCTGC ACTCATGGCA GAACCGGATA CCGTCGCCCG CCTTCTGGAA
GCACTGGTGG CCGCCGTCGA TTGTCCGGTT AGCCTGAAAA TGCGCACCGG TCCGGACCGC
GAGTGGCGCA ATGCCGTTGC GATCGCAAAA ATCGCCGAGA ATGCCGGGAT ATCCATGCTG
TCGATCCACG GGCGCACCCG CGCAGACCGC TACGAAGGCG AAGCGGAATA CGACACCATT
GCCGAAGTGG TTGCCGCCGT CGACCTGCCA GTGTTTGCCA ATGGCGACAT CACTACCCCT
CAGAAAGCCC GTCAAGTGAT CGCCCATACG GGCGCCGCGG GCATCATGGT CGGACGAGGC
GCGTTTGGTC AGCCCTGGAT TTTCTCCGCG TTAAAAGCCG AGCTCACCGG CCAACCACTC
CCCAGCCCCC CGGATCGCGC CGAACGTGTC GTTGCCATCA AAAAACAATT TGAAAAAATC
TATCATCATT ACGGCGACTC ATTAGGAATT CGCATCGCCC GAAAGCATCT GGGCTGGTAT
GCTGCGTCCC TTGAATTGGG TGAAGAAGAT CGAGCTGTAT TCAACCGCTT CGAGCACCCT
GAACAGCAAC GCCAATGGTT AGCGCAGCAC GCTAACGGAT AA
 
Protein sequence
MAESPPLPPI ILHGRTGDPV IIDPPLALAP MAGVSDRPFR QLCREYGAGL VVTEMMSAKP 
ELQSSAKSRL RQIDSNDIEP RAVQLLGNDP FELAEAARFA VSQGAQLIDL NLGCPAKKVC
KRAAGSALMA EPDTVARLLE ALVAAVDCPV SLKMRTGPDR EWRNAVAIAK IAENAGISML
SIHGRTRADR YEGEAEYDTI AEVVAAVDLP VFANGDITTP QKARQVIAHT GAAGIMVGRG
AFGQPWIFSA LKAELTGQPL PSPPDRAERV VAIKKQFEKI YHHYGDSLGI RIARKHLGWY
AASLELGEED RAVFNRFEHP EQQRQWLAQH ANG