Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_3710 |
Symbol | |
ID | 5735574 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 4665688 |
End bp | 4667577 |
Gene Length | 1890 bp |
Protein Length | 629 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641280862 |
Product | asparagine synthase (glutamine-hydrolyzing) |
Protein accession | YP_001546474 |
Protein GI | 159900227 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0367] Asparagine synthase (glutamine-hydrolyzing) |
TIGRFAM ID | [TIGR01536] asparagine synthase (glutamine-hydrolyzing) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.411872 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTGTGGAA TCTGCGGAAT CGTCAGTACA AGCTTAATCG AGCCAAACAC TGTTCAGGCC ATGAACCAAC AATTGATGCA TCGTGGGCCA GATGGTGCGG GCACTTGGCA AACTGCCCAT GCTCAATTGG GTCATCGGCG GTTGGCCATT ATCGATTTAG TCACTGGCGA TCAGCCATTC AGCAGCCCCG ATGGTGCGTG GCAATTAGTC TTTAATGGCG AGATCTACAA TTTTCAGGCC TTGCGCCAAC AATTGCATGG GCTTGGGCAT CAATTTCGCA CCAACAGCGA TACCGAGGTA CTGCTGGCCG CCTTGATTGA ATGGGGCGAG CAGGCTTTTT CGCGTTTGGA AGGCATGTTT GCCTTTGCAG CCTGGCATCA ACCAAGCCAA AGTTTATGGC TGGTGCGTGA TAGTTTGGGC AAAAAGCCGC TTTACTATAG CCAAACCAAG CATGGATTAA TCTTTGGCTC GGAGATCAAA GCGCTGCTGC AATACCCTGA GCTTGATCAC AGCCTCAATC CACAAGCGCT GATCGCCTAT TTGACCTATG GCTATGTGCC AAATCCGGCC ACTTGGTATG CCAACATTCA GCAACTTGCC CCAGGCCATG CCTTGGTTTG GCAAGCGGGC AGCATTCGCG AATGGCAATG GTGGCAAGCA CGCCAGATCG CCCAACGACC ACGGCTTAAC ATCTCAGACA AAGCCGCAAT TGAACAAACG CGCCAGTTGG TGCGAGCCGC AGTTGAGCGA CGCTTGATCT GCGATGTGCC ATTGGGTGCG TTTTTGAGCG GCGGCTTAGA TTCAAGCATT GTGGTGGCCG AGGCTCAAGC TTTGTTAGGT CAGCAACTGC ACACATTTAG TATTGGCTTT GCTGGTGGTG GCTGGTACGA TGAAAGTTCC TACGCCGAAT TGGTTGCCAA ACAGCTTGGC ACTAAACATC AGTGTTTTAT GGTTGAGCCT GATGCCATCG CCAAATTACC CCAGTTGCTA GCCCACTACG ATGAGCCATT TCTCGATTCG TCGGCCTTGC CCATGGCCTT GCTCAGCCAA ATGACCCGTG AACATTGCAC GGTGGCCTTA TCGGGCGATG GCGGCGACGA GGTGTTTGCT GGCTACGAAC GCTTTGGCGC AGGCTTATGG ACGCAGCGCT ATCAGCAATT GCCGCGACCG CTACGCCAAA TGCTTGAGCA AACCATTGCC TTGTTGCCAC AAACCAAAGC GAGCAGTCGC TTGGCCCGGC TTAAACGGGT GCTGGCAAAA ATTCAATTGC CGCTTGCCCA GGGCTTTCCG CGCTGGCTGA TGGCTTTTAC ACCCGAAGAA TTAGCCGCTT GGGGCTTGCC GGCACTGCAA CCTAGCGCCG CCGAACGCTT TAGCCAAGCA ACCCAAGGCG TTACTGATCC GCTGGCTCAA TTAATCTGTT ATAACCTTGG GAGTTATTTG CCCGATGATC TGTTGGTCAA GGCCGATCGT ATGAGCATGG CCTATGGTTT AGAAGTTCGC TCGCCATTTT TGGATCAGCA ATTGGTTGAG TGGGCTTTGC AATTGCCAGC CAATTTGCAT TGGCGCGGTG GGCGTGGCAA GTGGCTTTTA CGCCAAGCCT ATGCCGAACG CCTGCCAAAA ATCATCATCG AACGACCTAA ACATGGCTTT GGTGTGCCGC TTGATCAATG GTTTCGCCAA CAACTCAAGC CAATGCTGCA CGATTACTTG CTCTCAACGA CCAGCCATGT TCAGCAATGG CTGCCCAAAC CAAAACTTGA GCAACTTTGC CACGCGCATT GGGCTGGTAC AATGAACGCT GGACATCAAC TTTGGACATT GCTCACCTTA GAATTATGGC TACAAACTCA TCACCAAATG AGGCCATATG CGCACAATTA TGTGGATTAA
|
Protein sequence | MCGICGIVST SLIEPNTVQA MNQQLMHRGP DGAGTWQTAH AQLGHRRLAI IDLVTGDQPF SSPDGAWQLV FNGEIYNFQA LRQQLHGLGH QFRTNSDTEV LLAALIEWGE QAFSRLEGMF AFAAWHQPSQ SLWLVRDSLG KKPLYYSQTK HGLIFGSEIK ALLQYPELDH SLNPQALIAY LTYGYVPNPA TWYANIQQLA PGHALVWQAG SIREWQWWQA RQIAQRPRLN ISDKAAIEQT RQLVRAAVER RLICDVPLGA FLSGGLDSSI VVAEAQALLG QQLHTFSIGF AGGGWYDESS YAELVAKQLG TKHQCFMVEP DAIAKLPQLL AHYDEPFLDS SALPMALLSQ MTREHCTVAL SGDGGDEVFA GYERFGAGLW TQRYQQLPRP LRQMLEQTIA LLPQTKASSR LARLKRVLAK IQLPLAQGFP RWLMAFTPEE LAAWGLPALQ PSAAERFSQA TQGVTDPLAQ LICYNLGSYL PDDLLVKADR MSMAYGLEVR SPFLDQQLVE WALQLPANLH WRGGRGKWLL RQAYAERLPK IIIERPKHGF GVPLDQWFRQ QLKPMLHDYL LSTTSHVQQW LPKPKLEQLC HAHWAGTMNA GHQLWTLLTL ELWLQTHHQM RPYAHNYVD
|
| |