Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Clim_1967 |
Symbol | |
ID | 6355471 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium limicola DSM 245 |
Kingdom | Bacteria |
Replicon accession | NC_010803 |
Strand | + |
Start bp | 2182981 |
End bp | 2184837 |
Gene Length | 1857 bp |
Protein Length | 618 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 642669565 |
Product | Carbamoyl-phosphate synthetase large chain domain protein |
Protein accession | YP_001943978 |
Protein GI | 189347449 |
COG category | [E] Amino acid transport and metabolism [F] Nucleotide transport and metabolism |
COG ID | [COG0458] Carbamoylphosphate synthase large subunit (split gene in MJ) |
TIGRFAM ID | [TIGR01369] carbamoyl-phosphate synthase, large subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.000000143423 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACCACTC AAGTTTCCAG CCTTTCACAA GAGCTTTCCG GCCTTGCGAG CAAACTCCCT AAAAAAACGC TGATAAAGGC AAAAGAGCAT GGATTTTCCG ATTGTCAGCT TGCCAATATT TTTAAAACCA CGGAAACCGT CATACGAACA CTGAGAAAAC AGTACGGTGT GGAATCGGTA TTCAAAACCG TCGATACCTG CGCTGCCGAA TTCGACGCGA AAACCCCGTA CCATTACTCG ACGTACGATG AAGAGAACGA GTCTGTGCGT TCCGACAGGA AAAAAGTCAT TATCCTCGGA GGCGGCCCGA ACCGTATCGG TCAGGGCATA GAATTCGATT ATTGCTGCGT ACAGGCGGTT TTCGCTCTCC GCGAGGCCGG CTATGAGACC ATCATGGTCA ACTGCAACCC CGAAACGGTT TCGACCGACT ACGACATCGC CGACAAGCTC TATTTCGAGC CATTGACGTT TGAGGACACG ATCCGTATCA TCGAGCATGA ACAGCCGCTC GGTGTGATCG TCAGCTTCGG AGGTCAGACC CCCCTGAAGC TCTCGACAAA ACTGGACGAG GCCGGCGTTA CCATTCTCGG AACATCCTCG AAGGGCATCG ATCTTGCGGA GGACCGCAAG AAATTCGGCG CTCTGCTCGA AAAACTCGAC ATTCTCCATC CGGATTACGG CACCGCCATC TGTTTTGATG AAGCGCTCGC CATTACCGAA AGAATCGGGT ATCCGGTTCT GGTTCGACCA AGCTATGTGC TTGGCGGAAG AGCCATGAAA ATCATCTATA ACAAAGACTC TCTCAAGGAG TACGTCGATC AGGCGCTTTT CATTTCTGAA AAATATCCGC TGCTTATCGA CCGATTCCTT GAAACTGCCG TTGAGTTCGA CATCGATGCC ATTGCCGATA CTACCGACTG CGTTATCAGC GGCATCATGC AGCATGTGGA GGCGGCAGGC ATTCACAGCG GCGATTCAAC CTCGATCCTT CCCTATCGCA ATATCAGCCA GGAAGTGATC AATACCATGA AAGCCTATAC CAGGACGCTT GCCGAACATC TGAAGGTTGT CGGCCTCATG AACGTTCAGT ATGCCGTCCA GAACGAAAGC GTTTACGTGA TCGAAGTGAA TCCGAGAGCG AGCCGTACGG TGCCGTTCGT TGGCAAGGCC ACTGCGGTTC CGGTTGTAAA AATCGCAACG CGGGTGATGC TTGGCGAGAA ACTCAGCGAC CTTCGCAAAG AGTACGATCT GAAGGATTGC GACGAACTCG GCATGAAGCA TATGGCCATA AAGGAGCCGG TATTTCCATT CTCGAAGTTC GTTAAATCAG GCGTTTACCT CGGCCCGGAA ATGCGCTCCA CCGGCGAAGC CATGAGCCTT GCAGAACAGT TTCCGGAGGC TTTCGCCAAA GCGTATCAGG CTGCGAACAT GGAACTTCCG CTTTCAGGGT CGGTCTTTAT CAGCGTAAAC GATCAGGACA AAAGCCAGCG CATTATCGCG ATTGCCAAAG AGCTTTACCG CATGGATTTC GATCTTGTCG CCACGGCCGG AACCCACCGT TTCCTTATCG AAAACGGAAT AGAGTGCAAA AAAGTCTTCA AGGTAGGCGA AGAGGGGCGT CCGAACATTT TCGACATCAT CAAACACGGC AAGATCGATT TTGTCATCAA CACACCCAGG GGGGAAAAGG CGCTGCATGA CGAGGAGGCT ATCGGCGCGG CATCGGTACT GAGCAACGTG CCGTTCGTCA CCACCATCGA GGCCGCCGAA GCATCGGTTC AGGCTATCGA CTGCATCCGG CGCCAGGAAT TCGGTGTCAA GAGTCTGCAG GAGTATTCGG CATATCGAAA CAAGTGA
|
Protein sequence | MTTQVSSLSQ ELSGLASKLP KKTLIKAKEH GFSDCQLANI FKTTETVIRT LRKQYGVESV FKTVDTCAAE FDAKTPYHYS TYDEENESVR SDRKKVIILG GGPNRIGQGI EFDYCCVQAV FALREAGYET IMVNCNPETV STDYDIADKL YFEPLTFEDT IRIIEHEQPL GVIVSFGGQT PLKLSTKLDE AGVTILGTSS KGIDLAEDRK KFGALLEKLD ILHPDYGTAI CFDEALAITE RIGYPVLVRP SYVLGGRAMK IIYNKDSLKE YVDQALFISE KYPLLIDRFL ETAVEFDIDA IADTTDCVIS GIMQHVEAAG IHSGDSTSIL PYRNISQEVI NTMKAYTRTL AEHLKVVGLM NVQYAVQNES VYVIEVNPRA SRTVPFVGKA TAVPVVKIAT RVMLGEKLSD LRKEYDLKDC DELGMKHMAI KEPVFPFSKF VKSGVYLGPE MRSTGEAMSL AEQFPEAFAK AYQAANMELP LSGSVFISVN DQDKSQRIIA IAKELYRMDF DLVATAGTHR FLIENGIECK KVFKVGEEGR PNIFDIIKHG KIDFVINTPR GEKALHDEEA IGAASVLSNV PFVTTIEAAE ASVQAIDCIR RQEFGVKSLQ EYSAYRNK
|
| |