Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cphamn1_2349 |
Symbol | |
ID | 6376044 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium phaeobacteroides BS1 |
Kingdom | Bacteria |
Replicon accession | NC_010831 |
Strand | + |
Start bp | 2518460 |
End bp | 2519560 |
Gene Length | 1101 bp |
Protein Length | 366 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 642684833 |
Product | carbamoyl phosphate synthase small subunit |
Protein accession | YP_001960731 |
Protein GI | 189501261 |
COG category | [E] Amino acid transport and metabolism [F] Nucleotide transport and metabolism |
COG ID | [COG0505] Carbamoylphosphate synthase small subunit |
TIGRFAM ID | [TIGR01368] carbamoyl-phosphate synthase, small subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAGCCCA TACCCGCAAA ACTGGTTCTG GAAAACGGCT CCGTCTATAA AGGCGAAGCA TTTGGACATA TTGGCGAAGC CGCCGGAGAA GTGGTTTTCA ACACGTCGCT CACAGGATAT CAGGAAATTC TTACCGATCC GTCTTACGCA GGACAGATGG TGCTGATGAC CTATCCTCTT ATCGGAAATT ACGGTGTCAA CGAGACCGAT GAGGAATCGG GAAAAATCTG GGCTTCGGCG ATCATAGTCC GTGAAGCCTC ACATATATAC AGCAATTTCG CGGCGACTGA CAGTCTCGAC AACTACCTGA AAGAGTCGGA AGTCCTGGGT CTCGCAGGCA TCGACACCAG AAAACTGGTT CGTGAAATCA GGGAAAAAGG CGCGATGAGA GGAGTTATAT CAGCAATTGA CGCCGATGAG AAAAGTCTGC AGGAAAAAGC GATTGCCGTA CCTGAAATGA CCGGTCTCGA CCTTGTTCAA AAAGTCAGTA CGCCACAGAG CTATACAGCA GATTGCCCGG ACGCACAATA CCATGTCGTT GCCATGGATT TCGGCATCAA GAGAAACATT CTCAGAATGC TGCAGGATGC AGGATGCAGA GTCACCGTTC TGAACGCCGG CGCGACAGCA GATGATATCC GGGATCTGAA TCCCGACGGT CTTTTTCTTT CAAACGGGCC GGGAGATCCC TTTGCCGTAA CCTACGCAAT CGATACGATC AGAACCCTTA TCCGGGAGAA CGGCGATTCA GCTCCTTTGC CGATATTCGG AATCTGTCTT GGCCACCAGC TGCTTTCCCT GGCTTACGGA GCAAACACCT ACAAATTGAA GTTCGGACAT CACGGCAGCA ATCATCCTGT TAAAAATCTT TCAACCGGAT CAATCGAGAT AACATCCCAG AACCACGGAT TTGCCGTCGA GATGAGCTCA CTTCCGGAAG AACTTGAACT TACTCACCTC AACCTTTACG ACAACACTGT CGAAGGTGTG CGGCATCGTG AGTTGCCCTG TTTTTCCGTC CAATACCACC CCGAAGCGGC TCCGGGACCT CATGACTCAA ACTATCTTTT CAGTCTTTTC ACCGATATGA TGGCCGGATA G
|
Protein sequence | MQPIPAKLVL ENGSVYKGEA FGHIGEAAGE VVFNTSLTGY QEILTDPSYA GQMVLMTYPL IGNYGVNETD EESGKIWASA IIVREASHIY SNFAATDSLD NYLKESEVLG LAGIDTRKLV REIREKGAMR GVISAIDADE KSLQEKAIAV PEMTGLDLVQ KVSTPQSYTA DCPDAQYHVV AMDFGIKRNI LRMLQDAGCR VTVLNAGATA DDIRDLNPDG LFLSNGPGDP FAVTYAIDTI RTLIRENGDS APLPIFGICL GHQLLSLAYG ANTYKLKFGH HGSNHPVKNL STGSIEITSQ NHGFAVEMSS LPEELELTHL NLYDNTVEGV RHRELPCFSV QYHPEAAPGP HDSNYLFSLF TDMMAG
|
| |