Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cphamn1_0628 |
Symbol | |
ID | 6374292 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium phaeobacteroides BS1 |
Kingdom | Bacteria |
Replicon accession | NC_010831 |
Strand | - |
Start bp | 658456 |
End bp | 660312 |
Gene Length | 1857 bp |
Protein Length | 618 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 642683140 |
Product | Carbamoyl-phosphate synthetase large chain domain protein |
Protein accession | YP_001959067 |
Protein GI | 189499597 |
COG category | [E] Amino acid transport and metabolism [F] Nucleotide transport and metabolism |
COG ID | [COG0458] Carbamoylphosphate synthase large subunit (split gene in MJ) |
TIGRFAM ID | [TIGR01369] carbamoyl-phosphate synthase, large subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCACAG CCGCTTCAGA CCTGTCAGAA GAGGTACTCA GCCTGACCAC CAAACTATCG AGAGAACGCC TTTTGAAGGC AAAGGGACAC GGTTTTTCCG ATTGCCAGCT TGCCTATATC TTCAATGTGT CCGATACATC CATACGGGAA CTCAGAAAAC ATTACCATCT TGATGCCGTC TTTAAAACCG TAGACACCTG TGCGGCGGAA TTCGACGCGA AAACCCCCTA TCACTACTCC ACCTACGACG AGGAGAACGA ATCGGTCAGT TCCGACAGGG AAAAAGTGAT TATTCTGGGC GGTGGTCCGA ACCGTATAGG ACAGGGTATT GAATTCGACT ATTGCTGCGT TCAGGCGGTT TTCGCCCTCA AGGAAGCAGG GTATGAATCC ATAATGATAA ACTGTAACCC TGAGACAGTT TCAACAGACT ACGACATTGC AGATAAGCTG TATTTCGAGT CACTTACTTT CGAAGACACC ATTCGTATTA TCGAACACGA AAAGCCTCGC GGTGTCATTG TGAGTTTCGG GGGACAGACT CCTCTCAAAC TCTCCACAAA ACTTGAAGAA GCCGGAGTGA CGATTCTCGG CACCTCGTCA AAAGGAATCG ACCTTGCCGA AGACAGAAAA AAGTTCGGCG CGCTTCTCCG CGAACTCGAT ATTCCGCATC CTGATTATGA TACAGCCGTT TCATTTGAAG AGGCACAGGA AATTACCAGC CGGATTGGCT ATCCTGTTCT GGTACGACCC AGCTATGTGC TCGGCGGAAG AGCGATGAAA ATCATTTACA GCGATGACTC GCTCAAGGAA TATGTCGATC AGGCCCTGTT TATCTCTGAA AAATTTCCCC TGCTGATCGA TCGTTTTCTT GAAACTGCAG TTGAGTTCGA CATCGACGCG CTTGCAGATT CTACAGACTG CATAGTAAGC GGCATGATGC AGCATGTCGA GGCAGCCGGA ATTCACAGTG GCGACTCGAC ATCTATTCTC CCCTATCACA ATATCGATCC GGGTGTCATA AAAACCATGA AGGAATACAC CCGCAGGCTT GCCGAAAGTC TGAAGGTAGT CGGGTTGATG AATGTTCAGT ATGCGGTTCA GAACAACAGT GTCTATGTCA TCGAGGTAAA CCCGAGAGCA AGCAGAACCG TACCTTTCGT TGGAAAAGCG ACAGCCATAC CTCTGGTCAA AATCGCAACG AGGGTAATGC TCGGAGAAAA ACTCTCTGAC TTGCGCAAGG AATACCGTCT GACTGATTGT GACGAACTCG GCATGCCTCA CCTTGCCATC AAGGAGCCGG TATTTCCTTT CTCAAAATTC CTTAAATCCG GAGTCTACCT CGGTCCGGAA ATGCGTTCGA CAGGCGAAGC CATGAGCCTT GCCTATGATT TTCCTGAAGC TTTTGCCAAG GCATATCAGG CGGCAAACAT GGAACTGCCG AAATCCGGAA CAGTATTTAT CAGCGTCAAC AATCATGACA AGGATGAAAG GATTATCAGG ATAGCGAACG AATTGTACCG GATTGGTTTC GATCTTGTGG GAACGGCAGG TACGCAGCAG TTTCTTGCGG ACAACGGTAT TGAATGCAAA AAAGTCTACA AGGTAGGAGA AGAAGGGCGT CCAAATGTAT TCGACACGAT CCGGCTCGGT AAAATCGACC TTGTGATCAA CACCCCGCTG GGAGAGCGAG CTCTGCATGA TGAGGAAGCT ATCGGCGCAG CATCGGTCAT GAACGGCGTT CCGTTTGTCA CGACCATTGA GGCCGCCGAA GCATCGGTTC AGGCGATCGC CTGTCTCCGA AAACAGGAGT TCGAGGTTAA AAGCCTTCAG GAATACGCAT CTTACCGGGA CATGTAG
|
Protein sequence | MTTAASDLSE EVLSLTTKLS RERLLKAKGH GFSDCQLAYI FNVSDTSIRE LRKHYHLDAV FKTVDTCAAE FDAKTPYHYS TYDEENESVS SDREKVIILG GGPNRIGQGI EFDYCCVQAV FALKEAGYES IMINCNPETV STDYDIADKL YFESLTFEDT IRIIEHEKPR GVIVSFGGQT PLKLSTKLEE AGVTILGTSS KGIDLAEDRK KFGALLRELD IPHPDYDTAV SFEEAQEITS RIGYPVLVRP SYVLGGRAMK IIYSDDSLKE YVDQALFISE KFPLLIDRFL ETAVEFDIDA LADSTDCIVS GMMQHVEAAG IHSGDSTSIL PYHNIDPGVI KTMKEYTRRL AESLKVVGLM NVQYAVQNNS VYVIEVNPRA SRTVPFVGKA TAIPLVKIAT RVMLGEKLSD LRKEYRLTDC DELGMPHLAI KEPVFPFSKF LKSGVYLGPE MRSTGEAMSL AYDFPEAFAK AYQAANMELP KSGTVFISVN NHDKDERIIR IANELYRIGF DLVGTAGTQQ FLADNGIECK KVYKVGEEGR PNVFDTIRLG KIDLVINTPL GERALHDEEA IGAASVMNGV PFVTTIEAAE ASVQAIACLR KQEFEVKSLQ EYASYRDM
|
| |