Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_2020 |
Symbol | |
ID | 5733909 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 2510310 |
End bp | 2511533 |
Gene Length | 1224 bp |
Protein Length | 407 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 641279164 |
Product | phosphoribosylglycinamide synthetase |
Protein accession | YP_001544791 |
Protein GI | 159898544 |
COG category | [I] Lipid transport and metabolism |
COG ID | [COG0439] Biotin carboxylase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCCGCC ATTTTCTCTT AATCGGCACA GCTCGCGAGG TTCATCCCAA AATCAAGCAA CTTGGTCATC GCCTGAGTCT CTTGTGCCCG GTCAAAAATA TTAAAACACT CAAGAACCAT GGCGATTATG ATCGCATCGT TGGCATGGCA GCGAACGCAA CCATCGATCA ATGGATCGAG CAGGCACGCT TAATCCATTG CCATGCTCCG ATTGATGTGC TTGGTGGCTT CAATGAAGTG ACCCAGCATA TTGCTGCCGA TGTTGCCGCA GCATTGAAAC TGCCCTATCA TAGCTCCGCG ACAATTCACT ATACGCGCCA GAAAGATGCA ATGCGTCAAG TCTTACGCGA GGCGAACCTT GATCCAACCA TAGCGCAATC GGTCGAAACA GCCGACGACA TTAAGGTATT TGGCGAACGC TATGGCTATC CATTGGTCTT AAAGCCGCGT GATGGTCGAG CAAGCATGGG CGTTTCGATT ATTCGTTCTG CTGCTGAGAT TGCCACTGCC CAAGCATGGT TTGAAGCCGG TGCAGCAGGC CATGCAATGT TGGTCGAAGA ATATTTAAGT GGCGAAGAAT ATAGTGTTGA AGCCTTTTCG GAATATGGCC AACATCATAT AATCTGCGTC ACCCAAAAAT TCAAAGACCC CCAAACCAGT GTTGAGACTG GCCATTGCTT GCCAGCACCA CTGCCTCAAG CAACAGTTGC AGAGATTACA AGCTTTGTCG AACAGGTTTT AACGGCGCTT GATCTACAGA ATGGCCCATC GCATACCGAA ATTATTCTGA CTGCTCGCGG ACCACGGATT GTCGAAACTC ACGCCCGGCT TGCCGGCGAT AGCATTGTTG AATTGATTGA ACTAGCCAGC GGAATTGATG TCGATCAACT CTGGATTAAA CAGGTCGCTG GCGAAACAGT CTTTGAGCAA CTTCCACGCA GCTTTCAACG CTGGGCAGCA ATTGCCTATG CCAGCCCGCA TGCCATTGGC AGACTGGAAC GGGTAGATGG TCAAGAACAA GCAACGCATT GTCCCGGCGT GGTTAAAGCC GACGTACTCC AAGAGCTTGG CAGTTTCTTT CAAGGTGCAA GCGATTCATT TGCTCGAGGC GCATTTGCGA TCGCCACTGG CGATACTGCA ACTTTAGCAA CTATGCAAGC TCGTCAAGCG GCTCAGTGTT TTCGATTTCT GGTTAACTGT GCCGATCCTA GTACACACAG CTGA
|
Protein sequence | MSRHFLLIGT AREVHPKIKQ LGHRLSLLCP VKNIKTLKNH GDYDRIVGMA ANATIDQWIE QARLIHCHAP IDVLGGFNEV TQHIAADVAA ALKLPYHSSA TIHYTRQKDA MRQVLREANL DPTIAQSVET ADDIKVFGER YGYPLVLKPR DGRASMGVSI IRSAAEIATA QAWFEAGAAG HAMLVEEYLS GEEYSVEAFS EYGQHHIICV TQKFKDPQTS VETGHCLPAP LPQATVAEIT SFVEQVLTAL DLQNGPSHTE IILTARGPRI VETHARLAGD SIVELIELAS GIDVDQLWIK QVAGETVFEQ LPRSFQRWAA IAYASPHAIG RLERVDGQEQ ATHCPGVVKA DVLQELGSFF QGASDSFARG AFAIATGDTA TLATMQARQA AQCFRFLVNC ADPSTHS
|
| |