Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_2628 |
Symbol | |
ID | 5734506 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 3371691 |
End bp | 3373652 |
Gene Length | 1962 bp |
Protein Length | 653 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 641279768 |
Product | carbamoyl-phosphate synthase L chain ATP-binding |
Protein accession | YP_001545394 |
Protein GI | 159899147 |
COG category | [I] Lipid transport and metabolism |
COG ID | [COG4770] Acetyl/propionyl-CoA carboxylase, alpha subunit |
TIGRFAM ID | [TIGR00514] acetyl-CoA carboxylase, biotin carboxylase subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.731698 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGACCTACT TCGACTCTTT GTTAATTGCC AATCGGGGCG AAATTGCGGT GCGGGTAATT CGGGCTTGTC GCTCATTGGG CATCGAAGCA ATCGCGGTTT ATTCCGATGC CGACGCACGC GCCTTGCATG TGCGCGAGGC TGATCGCGCG ATTCGCATCG GCCCAGCTGC TGCCGCTCAA TCGTATTTGA ATATCGAAGC AGTGCTAGCC GCTGCCCAAC AAACTGGAGC ACAGGCGATT CACCCAGGCT ATGGCTTTCT CTCAGAAAAT GCCAATTTTG CCCGTGCCTG CCAATCAGCA GGCATTGCCT TCATTGGCCC GCCCGCCGAA GCAATCGAGG CCATGGGTTC GAAAAGCGCC GCCAAACGCC TCGCCGAATC GGTGGGCGTG CCAACCGTGC CAGGCTACAA CGGCGATGAT CAAAGCGAAC AACGCTTGTT GGCCGAAGCC GAGCGAATTG GCTTTCCATT GCTGATCAAG GCTTCAGCTG GTGGCGGCGG CAAAGGCATG CGCAGCGTGC ATCGGCTTGC TGAATTTTCA GCGGCCTTGG CAGCAGCGCA GCGCGAAGCC CTTGCCGCAT TTGGCGATCA GCATGTTTTG CTCGAAAAGC TGATTGAACG CCCACGCCAT ATTGAGTTTC AAATCCTCGG CGATCAACAT GGCACCATGC TGCATCTTGG CGAACGCGAG TGTTCAATTC AACGCCGCCA CCAAAAAGTG CTTGAAGAAT CGCCCTCAAT TGCATTAACT CCAGCATTAC GCGCTACGAT GGGCGCGGCG GCGGTGCGTT TGGCGCAAGC CGTCAATTAT TACAATGCTG GCACTCAAGA GTTTATGCTC GATGCCAATG GCGAGTTTTA TTTCCTCGAA ATGAATACCC GTTTGCAAGT TGAGCACCCC GTGACCGAGT TGGTGACAGG CTTGGATTTG GTGCAATTGC AAATTGCGAT CGCCGCTGGT CAGCGTTTGC CCTTTGCTCA AGCAGATATT CAGCAAACTG GCCATGCGAT TGAAGTGCGC TTGTATGCGG AAGATCCCGT GCAAATGTTG CCCTCAATCG GCCAACTCAG CAGCTACACG CCGCCCGAAG GCCCAGGCAT TCGACTGGAT ACTGGGGTCA CAGTTGGCGA TCATGTCACG ATCAATTATG ATCCGATGCT GGCCAAGTTG ATCGTGTGGG GCGAGCAGCG TGAGCAAGCG ATTGCCCGTT TGCGCTATGC CTTGCAGCAT TTCGAGGTTG CTGGCGTTAC CACTAACATC CCATTGTTGC AAGCGATTAT CAACACGCCA GCCTACCAAG CCGGAGCCAC AACCACCGAT TTTCTGCAAA CCTATGCAAT TAGCGAGCAA TTGCAACATC GTCAGTTGCC ACCTAATTTG GCTTTGGCAG CCAGAGCGTT GTTTGATTTA GAGCCTGATC CTGATGCAGC ATTAGTTGGC GACCCTTGGA ATGTGCCATG GCGGGCGGCC CACATGCCGC ATCAACTACG CTATCAAAGC AACGATCAAA CCGTGCCGAT CAAGGCGCAG CCGCAAGCGC CTCAAGCATG GCTGGTAACG ATCAACGACG AGCAACTTGA GATTGTGGTT TTGCGTCGCC ATCTCAATCG CATGGTTGTG CGGGTTGCCG ATCGAATTTA TCAATGCCAA CTTGAGCAGG ATAGCCTCGT TTGGCAAGGC GTTGGCTATC AGATTCAGCC TGCTGCCGCC CCCAGCCTTG ACCAAAATAA TGGTCACAAA GGCGATGCGA GCCTTGAAGC GCCCATGCCA GGCACAATTA TCAAACTGCT AGTAGCCGAA GGCGAGCATG TTAGTGCAGG CCAGCCATTG CTGATTATGG AAGCCATGAA GATGGAGCAC ACTGTGACCG CGCCCTACGC TGGCACTGTT GCCAAACTGC CCTACCGCCA AGGCCAACAA GTCAGCGGCG GGGTAGCGCT GGCCGAAATT ACCGCTGAGT AA
|
Protein sequence | MTYFDSLLIA NRGEIAVRVI RACRSLGIEA IAVYSDADAR ALHVREADRA IRIGPAAAAQ SYLNIEAVLA AAQQTGAQAI HPGYGFLSEN ANFARACQSA GIAFIGPPAE AIEAMGSKSA AKRLAESVGV PTVPGYNGDD QSEQRLLAEA ERIGFPLLIK ASAGGGGKGM RSVHRLAEFS AALAAAQREA LAAFGDQHVL LEKLIERPRH IEFQILGDQH GTMLHLGERE CSIQRRHQKV LEESPSIALT PALRATMGAA AVRLAQAVNY YNAGTQEFML DANGEFYFLE MNTRLQVEHP VTELVTGLDL VQLQIAIAAG QRLPFAQADI QQTGHAIEVR LYAEDPVQML PSIGQLSSYT PPEGPGIRLD TGVTVGDHVT INYDPMLAKL IVWGEQREQA IARLRYALQH FEVAGVTTNI PLLQAIINTP AYQAGATTTD FLQTYAISEQ LQHRQLPPNL ALAARALFDL EPDPDAALVG DPWNVPWRAA HMPHQLRYQS NDQTVPIKAQ PQAPQAWLVT INDEQLEIVV LRRHLNRMVV RVADRIYQCQ LEQDSLVWQG VGYQIQPAAA PSLDQNNGHK GDASLEAPMP GTIIKLLVAE GEHVSAGQPL LIMEAMKMEH TVTAPYAGTV AKLPYRQGQQ VSGGVALAEI TAE
|
| |