Gene Hoch_1778 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_1778 
Symbol 
ID8544160 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp2456019 
End bp2457098 
Gene Length1080 bp 
Protein Length359 aa 
Translation table11 
GC content71% 
IMG OID646386485 
ProductL-asparaginase, type I 
Protein accessionYP_003266220 
Protein GI262195011 
COG category[E] Amino acid transport and metabolism
[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0252] L-asparaginase/archaeal Glu-tRNAGln amidotransferase subunit D 
TIGRFAM ID[TIGR00519] L-asparaginases, type I 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.285439 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.276697 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGCGG TTCCCAACAT CCTCCTCATC TACACCGGCG GCACCATCGG CATGCGCAAG 
ACCCCGGCCG GCTACCAACC CGAGCCCGGC TCGCTGCAGC GCCTGCTCAG CGAGCTGCCG
CGCTTCTCCG ATCCCGACGT GCCCCACTTC GACATCGCCG AGTTCGCGCC GCTGCTCGAC
AGCGCCGATA TGAACCCCTC ACACTGGCTG CGCATCGCCG AGATGGTGCG CGACAACTAC
GAGGACTACG ACGGCTTCCT GGTGCTGCAC GGCACCGACA CCATGGCCTT CACCGCCTCG
GCGCTGTCGT TCATGCTCGA GCGCCTGGCC AAGCCCGTGC TGCTCACCGG CTCGCAGATC
CCGCTCGAGG AGACCCGCAA CGACGCCCAG AACAACCTGC TCACCGCGCT CACCATCCTC
GGCCGCGACC ACGCCCGCCT GCCCGAGGTG CTGATCTACT TCGCCGGCCT GCTGCTGCGC
GGCAACCGCG CCACCAAGGT CTCGGTCGGC GAGTTCGCGG CCTTCGAGTC GCCCAACTTC
GCGCCCCTGG GCCGCGCCGG CATCGACATC GACATCGACT GGCGGCGCGT GCTGCCGCCG
CGCGCGCGCG CCAGCGAGGC CGTGCAGGTG GTCCCGGTCG GCAGCGCCAA CGTGGCCGCC
TTCCGCCTGT TCCCCGGGCT CAAGCCGGCG CTGCTCGAGG CCGTGCTCGC GGCCCCGGTG
CAGGGCGTGG TGCTCGAGTG CTACGGCGCC GGCAACGCGC CCACGGCCGA TCCCGCGTTC
ATGCGCGTGA TCGCCGAGGC CACGGCCCGC GGCGTGGTCC TGGTCGATGT CTCGCAGCCG
CTGCGCGGCT CGGCCGATCT GCGCCTGTAC GCCACCGGGC GCGCGCTGCT CGACGCCGGC
GTGGTCGGCG GCTACGACAT GACCGCCGAA GCCGCGCTGG CCAAACTCGC CTACCTGTTC
GAAAAAGGCC ACGGCCCCGA GCGCGTCAAG GAGCTGGTGC AGACGCCCCT GGTCGGCGAA
CTCACGCGCG CCGACGTGCC CACCTGGACG CCCTTCCGGG CCGTGGATCG CGACTCCTGA
 
Protein sequence
MSAVPNILLI YTGGTIGMRK TPAGYQPEPG SLQRLLSELP RFSDPDVPHF DIAEFAPLLD 
SADMNPSHWL RIAEMVRDNY EDYDGFLVLH GTDTMAFTAS ALSFMLERLA KPVLLTGSQI
PLEETRNDAQ NNLLTALTIL GRDHARLPEV LIYFAGLLLR GNRATKVSVG EFAAFESPNF
APLGRAGIDI DIDWRRVLPP RARASEAVQV VPVGSANVAA FRLFPGLKPA LLEAVLAAPV
QGVVLECYGA GNAPTADPAF MRVIAEATAR GVVLVDVSQP LRGSADLRLY ATGRALLDAG
VVGGYDMTAE AALAKLAYLF EKGHGPERVK ELVQTPLVGE LTRADVPTWT PFRAVDRDS