Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_0060 |
Symbol | |
ID | 5731932 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 75032 |
End bp | 78229 |
Gene Length | 3198 bp |
Protein Length | 1065 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 641277181 |
Product | carbamoyl-phosphate synthase, large subunit |
Protein accession | YP_001542840 |
Protein GI | 159896593 |
COG category | [E] Amino acid transport and metabolism [F] Nucleotide transport and metabolism |
COG ID | [COG0458] Carbamoylphosphate synthase large subunit (split gene in MJ) |
TIGRFAM ID | [TIGR01369] carbamoyl-phosphate synthase, large subunit [TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCAAAAC GCAGCGACAT TCACTCGATT CTGGTGCTTG GCGCAGGCCC AATCGTGATC GGGCAAGCAT GCGAATTTGA TTATAGCGGC ACTCAGGCGA TCAAAGCGCT TCGCGAAGAG GGTTTCCGCA TTATTTTGGT CAATTCCAAC CCAGCCACGA TTATGACCGA CCCGTCGTTG GCTGATCGTA CTTATATTGA GCCGTTGGAT GTGCCAACGG TCAGCGAAAT TATTGCCCGC GAACGGCCTG ATGCCTTGTT ACCCACGGTC GGCGGCCAAA CCGCGCTCAA TCTGGCCATG GATTTGCACC GCGCTGGCGT ATTGGAGCAA TATAATGTCC AGTTGCTCGG CGCAAATGTC GAAGCCATCG CTACTGCCGA GGATCGCGAG TTGTTCAAAC AGGCCATGGA TCGCATTGGC TTGCAATCGG CGCGTTCAGG CATCGCTCGC TCGCTCGAAG AAGCTCTCGA ATTAGTCAAA GCGACGGGCT TTCCTTCGAT TATTCGGCCT TCGTTTACTC TCGGCGGCGC TGGCGGCGGG ATTGCCTACA ACTCTGAGGA TTTTCGGCGA ATTGTCGAAG AGGGCTTGAA ACTTTCCCCG ATCGGCGAAC TATTGATCGA AGAAAGTTTG CTGGGCTGGA AGGAATATGA GCTTGAGTTG ATGCGTGATA GCGCTGGCAA TGGTGTGGTG GTCTGTTCAA TTGAGAATTT CGACCCCATG GGCGTGCACA CTGGCGACTC GATCACCGTT GCTCCGGCGA TGACCCTCTC CGACCGCGAA TATCAACGTT TGCGCGATAT GGGCTTGGCG GTGATGGAAG TTGTTGGTGT GGCAACTGGC GGATCGAATG TCCAATTCGC GATCAATCCC CACGATGGCC GCGTGATCGT GATCGAGATG AATCCACGGG TTTCGCGTTC ATCGGCGCTA GCATCCAAAG CAACCGGCTT TCCAATTGCT AAAATCGCCG CGAAGTTGGC GGTTGGCTAC ACCCTGCCCG AATTGCCCAA CGATATTACC AAATCAACCC CAGCCTCATT CGAGCCATCG CTCGATTATG TGGTGGTCAA AATTCCACGC TGGAACTTCG AGAAATTCCC TGGTTCGCAG CCAGTGCTTG GCACAGCGAT GAAATCAGTC GGCGAGGCCA TGGCCATGGG CCGCACCTTC ACCGAAGCCT TGCAAAAAGC CTTGCGCTCG TTGGAAAATG GCCGCATGGG TTGGGGCGCT GATGGCAGCC AACCTGTGGC CCAAGAACGC TTGCGCGAAC TGTTGATCAC GCCAACGCCA CAGCGCATTT TTGCCATGCG CACCGCCTTC GACCTTGGCT GGACGGTTGA TCAAATTCAT GAATTGACCA AAATCGATTA TTGGTTCCTT GATCAGTTGG CTAGTTTGAT TGCGCTCGAA AAAGATGTGC GCAACTATGA TTTGGCGACG ATTCCCGCCG ATCTATTGCT GCAAGCCAAA CGCAACGGCT ATAGCGACCC GCAATTGGCC TATTTGCTGA ATGCAACCCC AGCAGAGGTA CGCCAACAAC GGTTTGAGCA CAATATCAAA CCAACCTATC ATCATGTTGA TACCTGTGCG GCGGAATTTC CAGCCCAAAC GCCCTATCTC TACTCATCAT ATGAAACCGA GAGCGAAGCC AATCCCAGCG ATCGGCGCAA AGTGATGATT TTGGGTGGCG GGCCAAATCG CATCGGCCAA GGGCTTGAAT TTGATTATTG CTGCTGTCAC GCGGTGTTTG CCCTGCGCGA GCTTGGCTAT GAAACGATTA TGGTCAACTG TAACCCTGAA ACGGTTTCAA CCGATTATGA TACCGCTGAT CGACTGTATT TTGAGCCATT GACTCTCGAA GATGTGCTGA ATATTGTCGA TGTCGAGCAA CCAGCTGGCG TGTTGTTGCA GTTTGGTGGT CAAACCCCAC TCAAATTGGC CAAAGGCTTA GAAGCCGCAG GCGTGCCATT ATGGGGAACC TCGCCAGCCG CAATCGACCG TGCCGAAGAT CGCGGCCAAT TTGGCGCAGT ATTGGCCGAA TGTGGCTTGC GGGCAACGCC GTATGGATCG GCAACCTCGT GGGATGAAGC GCGGGCAATC GCTGAGCGCA TCGGCTATCC AACGATGGTG CGGCCATCGT ATGTGCTGGG CGGCCAAGGT ATGGCAATCA TCCACGATCA AGCAGGGCTT GACGATTATT TGCGCCATAT TACTGAGGCC TCGCCCGAAC ACCCTGTGCT GCTCGACCGC TTCCTTGAAG ATGCGACTGA GCTTGATGTT GATGCGGTTG CTGATGGCGA AACCGTAGTA ATTGCTGGCA TGATGGAGCA AATCGAACGC GCAGGCATCC ACTCCGGCGA TTCGGCTTGT GTTATGCCAA CCGTCGGCAT CAAGCCTCAG GTGATTGAAC AACTCAAAAT CGCCACTGAT AAATTAGCCC GCGCTTTGGG CGTGATTGGC TTGATGAACG TTCAATTCGC GGTCAAGGAT GACGAAATCT ACGTGCTAGA AGTCAATCCA CGGGCTTCGC GTTCGATTCC CTATGTTGCC AAAGCCAGTG GGGTCGCTTG GGCCGCCCTT GCCGCCAAAG TAATGGCAGG CGTGAGCCTC AAACAACAAG GTATCAGCAG TGCGCCGGAG TTACGCGGCT TCCACGTCAA AGAAGTTGTG TTGCCATGGC GGCGCTTCCC GGGTGCGACG ATTGCCCTAG GCCCAGAAAT GCGCTCAACT GGCGAGGCCA TGGGTAGCGG CGATAGCTTT GGGGCGGCCT TTGCCAAAGC CCAAGCAGGC TGTGGCCGCA GCTTGCCAAC CAGCGGCGCA ATCTTTGTCA GTGTCAACGA CCACGATAAA CCAGCCTTAG TGCCAGTTGC CAAGCAATAT AGCGAACTCG GCTTCAAAGT GATTGCGACT GCTGGCACAG CCCAATATCT GCGTGAGCAT GGCTTGGTCG TCGAAACGAT CTACAAAGTT AATGAAGGCC GACCCAACGC CGCCGACTAT ATTATCAACG GCCAAGTTGA TATTATTGTC AACACGCCAT TGGGCCGTGC CTCGTTATTT GATGAGCAGG CAATTCGCCA AACTGCTTTG CGCCAAGGCG TGATTTCGAT CACCACCGTG GCCAGTGCTG CGGCGGTTGC CGATGCGATT CAAGCGCTGC GCCAAGGTGG CTTGGGCGTG CGAGCTTTGC AAGATTAA
|
Protein sequence | MPKRSDIHSI LVLGAGPIVI GQACEFDYSG TQAIKALREE GFRIILVNSN PATIMTDPSL ADRTYIEPLD VPTVSEIIAR ERPDALLPTV GGQTALNLAM DLHRAGVLEQ YNVQLLGANV EAIATAEDRE LFKQAMDRIG LQSARSGIAR SLEEALELVK ATGFPSIIRP SFTLGGAGGG IAYNSEDFRR IVEEGLKLSP IGELLIEESL LGWKEYELEL MRDSAGNGVV VCSIENFDPM GVHTGDSITV APAMTLSDRE YQRLRDMGLA VMEVVGVATG GSNVQFAINP HDGRVIVIEM NPRVSRSSAL ASKATGFPIA KIAAKLAVGY TLPELPNDIT KSTPASFEPS LDYVVVKIPR WNFEKFPGSQ PVLGTAMKSV GEAMAMGRTF TEALQKALRS LENGRMGWGA DGSQPVAQER LRELLITPTP QRIFAMRTAF DLGWTVDQIH ELTKIDYWFL DQLASLIALE KDVRNYDLAT IPADLLLQAK RNGYSDPQLA YLLNATPAEV RQQRFEHNIK PTYHHVDTCA AEFPAQTPYL YSSYETESEA NPSDRRKVMI LGGGPNRIGQ GLEFDYCCCH AVFALRELGY ETIMVNCNPE TVSTDYDTAD RLYFEPLTLE DVLNIVDVEQ PAGVLLQFGG QTPLKLAKGL EAAGVPLWGT SPAAIDRAED RGQFGAVLAE CGLRATPYGS ATSWDEARAI AERIGYPTMV RPSYVLGGQG MAIIHDQAGL DDYLRHITEA SPEHPVLLDR FLEDATELDV DAVADGETVV IAGMMEQIER AGIHSGDSAC VMPTVGIKPQ VIEQLKIATD KLARALGVIG LMNVQFAVKD DEIYVLEVNP RASRSIPYVA KASGVAWAAL AAKVMAGVSL KQQGISSAPE LRGFHVKEVV LPWRRFPGAT IALGPEMRST GEAMGSGDSF GAAFAKAQAG CGRSLPTSGA IFVSVNDHDK PALVPVAKQY SELGFKVIAT AGTAQYLREH GLVVETIYKV NEGRPNAADY IINGQVDIIV NTPLGRASLF DEQAIRQTAL RQGVISITTV ASAAAVADAI QALRQGGLGV RALQD
|
| |