Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cag_1787 |
Symbol | |
ID | 3747207 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium chlorochromatii CaD3 |
Kingdom | Bacteria |
Replicon accession | NC_007514 |
Strand | - |
Start bp | 2306406 |
End bp | 2308271 |
Gene Length | 1866 bp |
Protein Length | 621 aa |
Translation table | 11 |
GC content | 46% |
IMG OID | 637774325 |
Product | carbamoyl-phosphate synthase, large subunit |
Protein accession | YP_380081 |
Protein GI | 78189743 |
COG category | [E] Amino acid transport and metabolism [F] Nucleotide transport and metabolism |
COG ID | [COG0458] Carbamoylphosphate synthase large subunit (split gene in MJ) |
TIGRFAM ID | [TIGR01369] carbamoyl-phosphate synthase, large subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCACCA TTACGCCCGA CCAAACGGTT CTTACCCTTG CCAAAAAATT GTCTGCCGAG CAGCTTTTTG CCGCTAAGCA AAATGGCTTT TCTGACCTGC AGCTTGCCAC TATTTTTAAA ACATCCGATA CCGTTATTCG TGAGCTTCGT CGCCATTATG GCATTGCCTC CGTGTTTAAA ACGGTTGATA CTTGTGCGGC GGAGTTTGAT GCAAAAACCC CTTATCACTA TTCAACCTAT GAAGAGGAAA ATGAGTCGGT TTGCTCTGAT AGGAAAAAGG TGATTATTCT GGGTGGTGGA CCTAACCGCA TTGGGCAAGG TATTGAGTTT GACTACTGCT GTGTACAAGC GGTGTTTGCT TTGCGAGAAG CGGGTTACGA AACCATTATG GTTAATTGCA ATCCCGAAAC GGTTTCAACC GATTACGATA TTGCCGATAA ACTTTACTTT GAGCCGCTTA CCTTTGAGGA TACCATTCGC ATTATTGAGC ATGAAAAGCC ACTTGGTGTT ATTGTAAGCT TTGGCGGACA AACGCCACTC AAGCTCTCTA CCCGTTTGCA TGAAGCGGGT GTAAAAATTC TTGGCACCTC ATCTAAAGGC ATTGACTTAG CTGAGGATCG CAAAAAGTTT GGAGCCTTGC TTGTTGAGCT TGGTATTCCC CATCCAGCTT ACGGCACGGC TATTAGTTTG GAAGAGGCAA AAGCCATTAC CCAACGTATT GGCTATCCCG CCTTAGTTCG CCCCAGCTAT GTGCTTGGCG GACGTGCTAT GAAAATTGTC TATAACGACG ATTCGCTGAA GGAGTACATT GATCAAGCGC TCTTTATTTC CGAAAAATAT CCGCTCTTAA TTGATCGCTT TCTTGAAACT GCTGTGGAGT TTGATATTGA TGCCCTTGCC GATAGCACCG ATTGTGTGAT TAGCGGCATT ATGCAGCATG TAGAAGCAGC GGGCATTCAC AGTGGCGACT CCACCTCCAT TCTCCCTTAC CATAACATTA GCAAGCAGGC AATTGCTGCC ATGAAGGAGT ACACCCGAAT GCTTGCTAAA AGCTTGAATG TTATTGGGTT AATGAATGTG CAGTACGCTG TGCAAAACGA CACGGTGTAT GTTATTGAGG TGAACCCTCG TGCCAGCCGC ACCGTGCCAT TTGTGGGTAA AGCCACCGCT ATTCCGGTGG TAAAAATTGC TACCCGCGTT ATGCTTGGCG AAAAGCTCTG CGATTTGCGC AACGAGTACA ATTTAAAGGA TTGCGATGAA CTTGGCATGA AGCACATGGC AATTAAAGAG CCTGTTTTCC CCTTCTCGAA GTTTGTAAAA TCGGGCGTTT ATCTTGGTCC CGAAATGCGC TCTACGGGTG AAGCTATGAG TTTAGCGAAC GACTTCCCCG AAGCTTTTGC AAAAGCCTAT CAAGCCGCAA ATATGCAGCT TCCGCTTTCG GGCGCAGTGT TTATTAGTGT GAACGATCAA GATAAAAACC ATCGTATGCT TGCTATTGCT CGCTCGTTGT ACGATATGGA TTTTGATTTA GTGGCAACGG CTGGTACATG GCAGTTCCTT ACCGATAATG GTATTGAGTG CAAAAAAGTA TATAAAGTAG GTGAAGAGGG GCGTCCCAAT ATTTTTGACA GCATCAAACA CGGCAAAGTT GATTTTGTGA TTAATACGCC ACGCGGCGAA AAAGCACTGC ACGATGAAGA GGCAATTGGT GCGGCATCAG TGTTAAGCAA CGTGCCATTT GTAACCACCA TTGAGGCGGC TGAAGCCTCC GTGCAAGCAA TTGGCTGCAT TCGCCATCAA GAGTTTGGGG TAAAGAGCTT GCAAGAGTAT GCAGCGTATC GCGACACAGC TACCGCCACC TGTTAA
|
Protein sequence | MSTITPDQTV LTLAKKLSAE QLFAAKQNGF SDLQLATIFK TSDTVIRELR RHYGIASVFK TVDTCAAEFD AKTPYHYSTY EEENESVCSD RKKVIILGGG PNRIGQGIEF DYCCVQAVFA LREAGYETIM VNCNPETVST DYDIADKLYF EPLTFEDTIR IIEHEKPLGV IVSFGGQTPL KLSTRLHEAG VKILGTSSKG IDLAEDRKKF GALLVELGIP HPAYGTAISL EEAKAITQRI GYPALVRPSY VLGGRAMKIV YNDDSLKEYI DQALFISEKY PLLIDRFLET AVEFDIDALA DSTDCVISGI MQHVEAAGIH SGDSTSILPY HNISKQAIAA MKEYTRMLAK SLNVIGLMNV QYAVQNDTVY VIEVNPRASR TVPFVGKATA IPVVKIATRV MLGEKLCDLR NEYNLKDCDE LGMKHMAIKE PVFPFSKFVK SGVYLGPEMR STGEAMSLAN DFPEAFAKAY QAANMQLPLS GAVFISVNDQ DKNHRMLAIA RSLYDMDFDL VATAGTWQFL TDNGIECKKV YKVGEEGRPN IFDSIKHGKV DFVINTPRGE KALHDEEAIG AASVLSNVPF VTTIEAAEAS VQAIGCIRHQ EFGVKSLQEY AAYRDTATAT C
|
| |