Gene HY04AAS1_1120 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHY04AAS1_1120 
Symbol 
ID6743936 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHydrogenobaculum sp. Y04AAS1 
KingdomBacteria 
Replicon accessionNC_011126 
Strand
Start bp1035653 
End bp1037317 
Gene Length1665 bp 
Protein Length554 aa 
Translation table11 
GC content38% 
IMG OID642750929 
ProductCarbamoyl-phosphate synthase L chain ATP-binding 
Protein accessionYP_002121784 
Protein GI195953494 
COG category[E] Amino acid transport and metabolism
[F] Nucleotide transport and metabolism 
COG ID[COG0458] Carbamoylphosphate synthase large subunit (split gene in MJ) 
TIGRFAM ID[TIGR01369] carbamoyl-phosphate synthase, large subunit 


Plasmid Coverage information

Num covering plasmid clones48 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCAAAAA ATACCAATAT AAAGAAAATA CTCATCGTAG GAGCAGGTCC TATTATCATA 
GGCCAAGCGG CAGAATTTGA TTATTCTGGT ACTCAGGCTT GCAAAGCCCT TATGAGAGAA
GGTTATGAAG TAGTGCTGGT AAATTCAAAC CCAGCCACCA TAATGACAGA TGAACAGCTT
GCCACTAAGA CTTACATAGA ACCATTGAGC GTTGAAGTGC TTGAGGAGAT TATAAAAAAA
GAACGCCCAG ATGTCTTACT ACCTACCTTA GGAGGGCAAA CGGCTTTAAA CTTAGCAAAG
GATTTATATG AATCGGGCAT ATTAGAAAGA TATGGTGTAG GAATAATAGG TGCCAACTAC
GAGGCTATCA AAAAAGGTGA AGACAGGGCT CTTTTTGCAA AAGCTATGGA GGAGATAGGT
CTAAAAGTTC CACCAAATGC CATAGTAAAT TCTATCTCAG AAGGTATGGC CTCTATAAAA
GATATAGGAT TTCCAGCTAT ATTAAGGCCA GCTTTTACAC TTGGCGGGAC AGGGGGTTCT
ATCGTTTACA ACTTAGAAGA GTTTCCAGCT AAGCTAAATG CTGCTTTAGA AGCCTCTCCT
ATACATCAGG TGCTTATAGA CAAATCTCTT ATAGGATGGA AGGAGTTTGA GCTTGAAGTC
ATAAGAGACA CAAAAGACAA CGTCGTTATA GTTTGTTCTA TAGAAAACTT TGATCCTATG
GGTGTTCATA CTGGAGACTC TATAACAGTA GCACCAGCCC AAACCCTAAC CGATAAACAA
TATCAGATGT TAAGAGATGC AAGCCTTGCC ATTATAAGAA AAATAGGTGT TGATACCGGA
GGCTCAAATA TCCAGTTTGC AGTAGACCCA AACAGCGACA ATTTTTACGT GATAGAGATG
AACCCAAGGG TTTCAAGAAG CTCAGCCTTG GCTTCTAAGG CCACTGGTTT TCCCATAGCA
AAAGTAGCTG CCCTTCTGGC GATTGGATAT ACCCTTGATG AGATAAAAAA CGACATTACA
AAAAATACGC CTATAAGCTT TGAGCCAAGC ATAGATTATG TTGTAGTAAA AATACCAAGA
TTTGATTTTG CAAAGTTTAA AGAATCAAGC AAAATTCTTG GTACCACCAT GAAATCTGTG
GGCGAAGTGA TGGCTATAGG AAGAACTTTT AAAGAAGCTT TCATGAAAGC CATAAGAAGT
TTAGAAACCG ATAATCCATA TCTCTTTATG AAAGATTACG AAGCGCTTTC TTACGATGAG
CTTCTTACAA ACATAAGGAT ACCAACACCA GAACGTATTT TTTATATAAA AGAAGCGTTT
ATGAGGGGTA TAAGCATAGA AAAAGTGAAT GAAGTAAGCC ACATAGACAA ATGGTTTTTA
CATCAGATAA AAGAGCTTGT AGAGGCTTAC AAACAAGATA TACCTTTTGA TAAAGATTAT
ATATTTGAGC TAAAAATATT GGGCTTTTCA AATAAAGAAA TAGCTAAAAA GTTTAACAAG
ACAGAAAAAG AGATAGAAGA GCTTTTGGAA GGTCTTATGC CTACTTTTAA AGCGGTGGAC
ACCTGCGCGG GGGAGTTTAG GGCTTATACG CCTTATTATT ACTCCTCTTG GGAATATCCA
TACTATAAAA TCGGGCAAGA AGAAGCTATT TTTGACAACG ACTAA
 
Protein sequence
MPKNTNIKKI LIVGAGPIII GQAAEFDYSG TQACKALMRE GYEVVLVNSN PATIMTDEQL 
ATKTYIEPLS VEVLEEIIKK ERPDVLLPTL GGQTALNLAK DLYESGILER YGVGIIGANY
EAIKKGEDRA LFAKAMEEIG LKVPPNAIVN SISEGMASIK DIGFPAILRP AFTLGGTGGS
IVYNLEEFPA KLNAALEASP IHQVLIDKSL IGWKEFELEV IRDTKDNVVI VCSIENFDPM
GVHTGDSITV APAQTLTDKQ YQMLRDASLA IIRKIGVDTG GSNIQFAVDP NSDNFYVIEM
NPRVSRSSAL ASKATGFPIA KVAALLAIGY TLDEIKNDIT KNTPISFEPS IDYVVVKIPR
FDFAKFKESS KILGTTMKSV GEVMAIGRTF KEAFMKAIRS LETDNPYLFM KDYEALSYDE
LLTNIRIPTP ERIFYIKEAF MRGISIEKVN EVSHIDKWFL HQIKELVEAY KQDIPFDKDY
IFELKILGFS NKEIAKKFNK TEKEIEELLE GLMPTFKAVD TCAGEFRAYT PYYYSSWEYP
YYKIGQEEAI FDND