Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cpha266_1903 |
Symbol | |
ID | 4570862 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium phaeobacteroides DSM 266 |
Kingdom | Bacteria |
Replicon accession | NC_008639 |
Strand | - |
Start bp | 2211310 |
End bp | 2212200 |
Gene Length | 891 bp |
Protein Length | 296 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 639766485 |
Product | Nitrilase/cyanide hydratase and apolipoprotein N-acyltransferase |
Protein accession | YP_912343 |
Protein GI | 119357699 |
COG category | [R] General function prediction only |
COG ID | [COG0388] Predicted amidohydrolase |
TIGRFAM ID | [TIGR03381] N-carbamoylputrescine amidase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.00174651 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCGTCGG AAACAGTTAC CATCGCACTT CTTCAAACAA CATCGTCAGA AAGACCGGAA GAGAACCTCG CCGAAGCGGA TCGCCTTATC AGGAGCGCTG CTGCCGGCGG AGCACAGGTT ATCTGCCTGC AGGAGCTGTT CACCACACGG TACTTCTGTC AGATCGAGGA TTATGAACCC TTTGCTTACG CTGAACCTGT TCCCGGTCCG ACAACCCAGG CTTTGCAGGA ACTGGCACGC GAGCTTCAGG TCGTCATCGT CGCCTCGCTT TTTGAAGCGC GGGCCAGAGG TCTCTATCAT AATACCGCTG CGGTTATCGA TGCAGATGGC AGCTATCTCG GCAAGTACAG GAAAATGCAT ATTCCCGATG ATCCCGGGTT TTACGAGAAG TTTTATTTCA CCCCCGGAGA TCTCGGTTAC AAAGTTTTTA AAACCCGGTA TGCAACCATC GGCGTTCTGA TCTGCTGGGA TCAATGGTAC CCTGAAGCGG CAAGGCTGGT TGCGCTCAGG GGTGCCGAAA TCATTTTTTA TCCAACGGCC ATAGGCTGGG CAGCCAGCGA GATTTCCGAC GAGGTACGCC GAGCGCAACG GACAGCATGG AAAACCATGC AGCTCAGCCA TGCGGTTGCC AATGGCGTAT TTGTCGCTGC GGCCAACAGG GTTGGTACTG AAGGTGAGCT TGAGTTCTGG GGAAACAGCT TTGTCTCTGA TCCTTTTGGT CAGGTTATTG CCGAAGCTCC CCATCAGAAC GAAGCCGTTC TGCTTGCCCG GTGCGATCTC GGTCGTATCG GATACTACCG TTCGCACTGG CCTTTTTTGC GTGATCGTCG CATTGAATCC TACGGGGATG TGCAGAAGCG CTACATAGAT GCCGATAGCG GACAGGGTTA G
|
Protein sequence | MPSETVTIAL LQTTSSERPE ENLAEADRLI RSAAAGGAQV ICLQELFTTR YFCQIEDYEP FAYAEPVPGP TTQALQELAR ELQVVIVASL FEARARGLYH NTAAVIDADG SYLGKYRKMH IPDDPGFYEK FYFTPGDLGY KVFKTRYATI GVLICWDQWY PEAARLVALR GAEIIFYPTA IGWAASEISD EVRRAQRTAW KTMQLSHAVA NGVFVAAANR VGTEGELEFW GNSFVSDPFG QVIAEAPHQN EAVLLARCDL GRIGYYRSHW PFLRDRRIES YGDVQKRYID ADSGQG
|
| |