Gene Hneap_1105 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHneap_1105 
Symbol 
ID8534253 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalothiobacillus neapolitanus c2 
KingdomBacteria 
Replicon accessionNC_013422 
Strand
Start bp1194886 
End bp1198512 
Gene Length3627 bp 
Protein Length1208 aa 
Translation table11 
GC content58% 
IMG OID646383490 
Producturea carboxylase 
Protein accessionYP_003262988 
Protein GI261855705 
COG category[E] Amino acid transport and metabolism
[I] Lipid transport and metabolism 
COG ID[COG1984] Allophanate hydrolase subunit 2
[COG4770] Acetyl/propionyl-CoA carboxylase, alpha subunit 
TIGRFAM ID[TIGR00724] biotin-dependent carboxylase uncharacterized domain
[TIGR02712] urea carboxylase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.480964 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCTGATG TTCTCATGCC TAACACGATG CCGAACATGA TGTTCGATAA GGTTCTGATC 
GCCAATCGGG GGGAAATAGT AGTCCGAATT GCGCGTACCC TGAAAAAAAT GGGTGTCACC
TCCGTCGCCA TTTACAGTGA TTCCGATGCC GACAGCCCAC ATATCCGCGC CTGTGATGAA
GCGATTGCAT TGGGTGGCCA AACGGCGGCA GAAAGTTACC TTCAGGCGGA TCGGATTCTC
GCACTGGCAA AAACACACGG TGTGCAAGCC ATTATTCCCG GTTACGGTTT TCTCTCCGAA
AACGCCGACT TCGCTGAACG CTGTGCCCAC GAAGGGATCG CATTCATTGG TCCGACACCC
AATCAATTGC GTGAATTTGG TTTAAAGCAC CGCGCACGTG AACTGGCCAC CGCAGCGAAA
GTGCCGCTCG CACCGGGTAG CGACTTGCTC AAGGATCTGC CAGCTGCATT GCATGAGGCA
GAACGAATCG GTTATCCGAT CATGCTCAAA AGCACGGCTG GGGGCGGTGG GATTGGGTTA
AAGCGTTGCG CCGATGAACC CGCGCTCATT GACGCGTTCG AAGCAGTGGC CGGTATGGGC
GCACGCTTCT TTGGCGATGG TGGCGTGTTC GTCGAGCGAT TCATCGATCA TGCCCGCCAT
GTTGAAGTGC AGATCTTCGG GGATGGCAAG GGTCGCGTGG TTGCGCTTGG CGAACGGGAT
TGCTCACTGC AACGGCGCAA TCAGAAGGTC ATCGAGGAAA CACCCGCGCC GAATTTGCCA
GCCAAAACAC GCGCGGCCAT GCTGGCATCC GCCGAGCGAT TGGGCGAGTC CATCGCCTAT
CAATCCGCGG GTACGGTCGA ATATATCTAC GATGCGCCCC GCGATGAATT CTATTTTCTT
GAAGTCAATA CTCGCCTTCA GGTTGAACAC CCCATCACCG AAGAAGTCAC AGGCGTTGAT
TTAATCGAAT GGATGGTTTC TCTGGCTGCT GGTGTGCCCT TCGAACTGAG CGCCCCCACC
CCGCATGGTC ATGCCATTGA AGTGCGCGTG TATGCCGAAG AACCGCTGCG CCACTTCCAG
CCAGCGCCCG GCCAACTCAG CAATGTGACG CTGCCCGATG ATAAATTCGT CCGCGTGGAT
ACCTGGGTTG AAACCAGCAC CACGGTGCCT GCCCAGTTCG ATCCGATGCT GGCCAAGATC
ATCGTTACGG GCGAGACGCG CAGCAAAGCG CTGTCGAATC TGGCTGAAGC ACTTGCCAAA
ACGCGCTTTG ACGGCATTTC AACCAATCTG CCGTTTCTAC GCGACCTGCT TTCCCTGCCG
GATTTCGTGG CGGGCACCCA CAGCACCGGC AGTATCGACC ACTTTCTTGC CTCCGGAGCG
TATCGGCCAC CGGCCATCGA AATCATCAAA CCGGGCACCT ACACCACGGT GCAAGATTTT
CCTGGCCGAT TGGGGTTGTG GCACATCGGG GTGCCGCCTT CCGGCCCCAT GGACGATTAC
GCCTTGCGTC TGGCAAACCG AATCGTTGGC AACACGCCGG ACATGGCGGG GCTGGAGTGC
ACGCTGGTGG GCCCATCCCT CAAATTCCAT CGGGATAGCA TGGTCGCCAT TACGGGCGCC
GAAGCCGACA TTCGGCTGGA TGATCAGAAC GTCCAAGCCG GTCGACCCAT CGCCATCGAA
GCCGGGCAAA CTTTAAGCAT CGGACAGGTT CAGAGCGGCG CTCGCGTATA TCTGGCCATC
CGGGGCGGCA TCGACGTGCC CGAGTATCTG GGCAGCCGTT CAACATTTGC ACTCGGCAAA
ATGGGTGGGC ATGCAGGGCG CATCCTGCAG GTCGGCGACA TCCTGCCCAT CGGCACGGCG
ACGGCGGCAT TCGCGCCCAA GGTGGCGGAT GCGGCGCTAA TGCCCGCCTA TGGGAATCAC
TGGGAAATCG GCGTGCTTTA TGGACCGCAC GGAGCGCCGG ATTATTTCAT GCCCGAGGCC
ATCGAGCAGT TCTTCACCAC AGATTGGGCG GTGCATTACA ACAGCAACCG CCTAGGCATT
CGCTTGTCGG GCCCCAAACT CAGCTGGGCG CGCAGCGATG GGGGCGAGGC AGGTTTGCAT
CCTTCGAATA TCCACGATAC GGAATACGCC GTCGGCTCGA TCAATTTTAC CGGCGACATG
CCCGTCATTC TCACGCGCGA TGGCCCGAGC CTGGGTGGAT TTGTCTGCCC TGCCACCATT
GTGCGCGCCG AGCTCTGGAA GGTCGGGCAG CTCAAGCCCG GCGATACCAT TCGGTTCATC
CCCATCAGTT ATGCACAGGC GCGTGCGCTC GAACAGGCTC AGGATGCAGC AATTGATACC
CTTCTGTCAC CACATCCCCC TTCATTTTTG CAAGCAGAAT TGGGACAGTG CATTTTGCTG
GATGTGCCCG CGCAAAATAA CCTGCCACGG ATGACGATTC GCCAAGCCGG CGATGGTTAT
GTATTGCTCG AATATGGCGA GAACATTCTC GATTTGGCAT TGCGGCTGCG GGTTCACGCC
TTGATGCAGC ATCTACAAAC CGATCCGGTG CCTGGCATTC TTGAGTGTTC CCCCGGCGTG
CGCTCGTTGC AAATTCGCTA CGAGCCCAAA CGGCTCACGC AGTCGGGTTT AGTCGCCCGA
TTGGCCGACA TTAATCAGCA ATTGGCCGAT GTACGTACCT TATCGGTGGA TTCGCGCATC
GTGCATTTGC CGATGGCATT TGAAGACAGT GCCACGCTGG ATGCCGTTGC ACGCTATCGC
CAATCGGTGC GCGATACGGC ACCCTGGTTA CCGAGCAACA CCGAATTTAT TCGGCGTATC
AATGGTTTGC CAGATATCAA GGCCGTCACC GATACCATTT ATTCGGCCTC CTACATGGTC
TTGGGTTTAG GCGACGTGTA TCTCGGTGCG CCTTGTGCGG TGCCGCTCGA CCCCCGGCAT
CGTCTGCTTA CCTCCAAATA CAACCCGGCT CGAACCTACA CGGCAGAAGG CACGGTGGGC
ATCGGCGGCG TGTATATGTG CATCTACGGC ATGGATTCGC CCGGCGGTTA TCAACTCGTC
GGGCGAACGC TGCCCATCTG GAACCGTCGC CCCGCGCACC CAAACTTTGA GGCTGGAAAA
CCTTGGCTGC TGCGCTTCTT CGATCAGGTG CGGTTTTACC CGGTCAGCGA GGCCGAGCTT
ACCGAAATGC GCAGCGCCTT TGCCACGGGT GCATTGGATG TCAAAACTGA GGCAGTCGTT
TTTCGACTGG ATGAACATGA GGCATTCCTG GCAGCGAATC AGTCATCAAT TAACGAATTC
AAGGAGCGCC AAGCGGTGGC CTATCAGGCA GAAGTCAGCC TCTGGCAGGA GGATGAGGGC
AGTCTGAATA TCCGCGAGGC CGTGCTGACT GAGACCGAAG TTGAGGGTGA GCCGATTGCA
GCGAGCATCA GCGGCAATAT CTGGAAGCTG CTGGTGTCTC CCGGTGAAGC CGTACAGGCC
GGCCAGGTGG TCGCCATCAT TGAAGCCATG AAGATGGAGT TTAGCGTCGA AGCACCGCGG
GACGGAGTCA TCGCACAGTG CGCCTGCACG CCGGGGCAAC TCGTGCAGAT GGGGCAAACG
CTGGTCACGC TGGAGTTTCC CGCATGA
 
Protein sequence
MPDVLMPNTM PNMMFDKVLI ANRGEIVVRI ARTLKKMGVT SVAIYSDSDA DSPHIRACDE 
AIALGGQTAA ESYLQADRIL ALAKTHGVQA IIPGYGFLSE NADFAERCAH EGIAFIGPTP
NQLREFGLKH RARELATAAK VPLAPGSDLL KDLPAALHEA ERIGYPIMLK STAGGGGIGL
KRCADEPALI DAFEAVAGMG ARFFGDGGVF VERFIDHARH VEVQIFGDGK GRVVALGERD
CSLQRRNQKV IEETPAPNLP AKTRAAMLAS AERLGESIAY QSAGTVEYIY DAPRDEFYFL
EVNTRLQVEH PITEEVTGVD LIEWMVSLAA GVPFELSAPT PHGHAIEVRV YAEEPLRHFQ
PAPGQLSNVT LPDDKFVRVD TWVETSTTVP AQFDPMLAKI IVTGETRSKA LSNLAEALAK
TRFDGISTNL PFLRDLLSLP DFVAGTHSTG SIDHFLASGA YRPPAIEIIK PGTYTTVQDF
PGRLGLWHIG VPPSGPMDDY ALRLANRIVG NTPDMAGLEC TLVGPSLKFH RDSMVAITGA
EADIRLDDQN VQAGRPIAIE AGQTLSIGQV QSGARVYLAI RGGIDVPEYL GSRSTFALGK
MGGHAGRILQ VGDILPIGTA TAAFAPKVAD AALMPAYGNH WEIGVLYGPH GAPDYFMPEA
IEQFFTTDWA VHYNSNRLGI RLSGPKLSWA RSDGGEAGLH PSNIHDTEYA VGSINFTGDM
PVILTRDGPS LGGFVCPATI VRAELWKVGQ LKPGDTIRFI PISYAQARAL EQAQDAAIDT
LLSPHPPSFL QAELGQCILL DVPAQNNLPR MTIRQAGDGY VLLEYGENIL DLALRLRVHA
LMQHLQTDPV PGILECSPGV RSLQIRYEPK RLTQSGLVAR LADINQQLAD VRTLSVDSRI
VHLPMAFEDS ATLDAVARYR QSVRDTAPWL PSNTEFIRRI NGLPDIKAVT DTIYSASYMV
LGLGDVYLGA PCAVPLDPRH RLLTSKYNPA RTYTAEGTVG IGGVYMCIYG MDSPGGYQLV
GRTLPIWNRR PAHPNFEAGK PWLLRFFDQV RFYPVSEAEL TEMRSAFATG ALDVKTEAVV
FRLDEHEAFL AANQSSINEF KERQAVAYQA EVSLWQEDEG SLNIREAVLT ETEVEGEPIA
ASISGNIWKL LVSPGEAVQA GQVVAIIEAM KMEFSVEAPR DGVIAQCACT PGQLVQMGQT
LVTLEFPA