Gene Hneap_1147 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHneap_1147 
Symbol 
ID8534299 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalothiobacillus neapolitanus c2 
KingdomBacteria 
Replicon accessionNC_013422 
Strand
Start bp1246475 
End bp1247518 
Gene Length1044 bp 
Protein Length347 aa 
Translation table11 
GC content54% 
IMG OID646383536 
Productdihydroorotase 
Protein accessionYP_003263030 
Protein GI261855747 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0418] Dihydroorotase 
TIGRFAM ID[TIGR00856] dihydroorotase, homodimeric type 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATGAAA TTGTGATCCG CCAACCGGAC GATTGGCATG TGCATCTGCG CGATGGCGCC 
ATGCTTAAAG CCGTCGCACC ATTTACAGCA AAACAATGCC ACCGGGCGAT CATCATGCCC
AACCTGGTCC CACCAGTGAC AACAGTTGCC GAGGCCATGG CCTATCGAGA CCGCATACTT
GCGGCCATTC CAGAAGGCAT AGCATTCGAG CCGTTGATGA CGCTTTACCT CACCGAACAG
ACAACGGTTG AAGAGCTGGC CGCCGCAGCG GCCAACCCAC ATATCCATGC CGTAAAATTG
TATCCGGCAG GCGCGACAAC CAACTCGGAT CGAGGCGTGC GTGATCCCTT AAGCATCGAT
CACCTGTTGG GCGAATTGGC TCGTTCCGGT TTGCCCTTGC TGGTTCATGG CGAAGTGGTT
GATCCGAACA TCGATATTTT CGATCGTGAA GCGGTATTTA TTGACCGTGT ACTCACGCCC
ATTCTGAATC GCTTCCCAGA TTTGCGACTG GTAATGGAGC ACATCACGAC AACGCAGGCG
GTCGAGTTCG TCATGAGCCA ATCGGAGCGG GTCGCTGCCA CCATCACAGC ACATCATTTG
CTGTACAACC GCAATGCCAT GCTTGTCGGC GGCATCCGGC CACACTATTA CTGCTTGCCC
ATTCTCAAGC GTGAAACGCA TCGTCAATCC CTGCTGAATG CCGCCACGTC GGGTCATCCA
AAATTCTTCA TGGGTACCGA TACGGCGCCA CACACCCAAT CAAGCAAAGA ATCCGCCTGT
GGTTGTGCTG GCGCATTTAC CGGCTACGCC GCATTAGAGT TATATGCCGA GGCATTTGAT
GAGGTTGGCA AGTTAGATCA ACTAGAGGCC TTTACCAGCT TGAACGGCCC GCGTTTTTAC
CGGCTGCCTA TCAATGAAAA ACGAGTACGG CTGACGCGAA CCACCACACA AGTGCCCGAC
AGCTTCCCTG TAGAGGACAA TCAGCACCTT GTGCCGTTAC GCGCGGGTGA GTCGATTGCC
TGGCAATTTG AGCTGCTCGA ATGA
 
Protein sequence
MNEIVIRQPD DWHVHLRDGA MLKAVAPFTA KQCHRAIIMP NLVPPVTTVA EAMAYRDRIL 
AAIPEGIAFE PLMTLYLTEQ TTVEELAAAA ANPHIHAVKL YPAGATTNSD RGVRDPLSID
HLLGELARSG LPLLVHGEVV DPNIDIFDRE AVFIDRVLTP ILNRFPDLRL VMEHITTTQA
VEFVMSQSER VAATITAHHL LYNRNAMLVG GIRPHYYCLP ILKRETHRQS LLNAATSGHP
KFFMGTDTAP HTQSSKESAC GCAGAFTGYA ALELYAEAFD EVGKLDQLEA FTSLNGPRFY
RLPINEKRVR LTRTTTQVPD SFPVEDNQHL VPLRAGESIA WQFELLE