Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cpha266_1901 |
Symbol | |
ID | 4570860 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium phaeobacteroides DSM 266 |
Kingdom | Bacteria |
Replicon accession | NC_008639 |
Strand | - |
Start bp | 2207738 |
End bp | 2208796 |
Gene Length | 1059 bp |
Protein Length | 352 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 639766483 |
Product | peptidyl-arginine deiminase |
Protein accession | YP_912341 |
Protein GI | 119357697 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG2957] Peptidylarginine deiminase and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.459054 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATGGATT TCCAGTATCG CATGCCCCCG GAGTGGGCTT GTCACAAAGC CACCTGGCTT TCCTGGCCGC ACAAACGTGA ATCCTGGCCA GGAAAATTTG AACCGGTTCC GGCAGTTTTT GTCGAGATTG CTTCATGGCT GAGTTCATCG GAAGAGGTAC ATATCAATGT GCTTGACGAA GCGATGGAAG TGGAGGTGCG CGAACTGTTC CGAAAGGCAG AGTACGATCA TTTGCGAAGG GATCGACTTG TGCTGCACCG CATTCCAACC AATGACGCCT GGTGTCGCGA CCACGGCCCG AACTATGTAT TCCGCCAGGG CGGCACCGGC CTGGAGAAGG TGATTCTGAA CTGGCAGTTC AATGCCTGGG GAGGAAAATA TGAGCCCTGT GATGATGACA ATGCAGTGCC TGTGCGGATT GCGGAACAGC AGCATTTGCC TCTGGTTTCG ATCGACATGG TGCTTGAAGG TGGAGCAATC GATGTGAACG GGAACGGTTT GCTGCTGACG ACCGAAGCCT GTTTATTGAA CAGGAACCGC AATCCCGGAA TGAGTCGTCT TGAGATCGAA GGCGCGCTTG GCAGGTATCT CGGTATTGAA AAGGTACTCT GGCTCGGCGA CGGTATAGCA GGGGACGATA CCGATGGTCA TGTTGATGAT ATGGCGAGGT TTGTCAATGA ATCAACCGTC GTGATAGCCG TTGAAGACGA TGCTGCTGAT GAGAATTATG AACCGCTTCA GGATAACTAT CGACTGCTCA AGAGTTTTAC CGATCTCAGG GGAGAGCCTC TCAACGTGGT GAAGCTGCCC ATGCCGGATC CGGTTTACTA CGATGGCGAA CGTCTTCCTG CAAGCTATGC GAACTTCTAT ATTGCCAACA GTGTTGTGCT TGTACCTCTA TACCGGTGCG ACGCCGACAA AAAGGCGCTT GCTGTTCTGC AGGAGTGTTT TCCCGGAAGA AAGGTTGTGG GTATCGACTG TTCAGACCTT ATATGGGGAT TGGGAGCCAT TCACTGTATC ACCCACGAAG AGCCGGATCT GCCGATACGG GAGAGGTGA
|
Protein sequence | MMDFQYRMPP EWACHKATWL SWPHKRESWP GKFEPVPAVF VEIASWLSSS EEVHINVLDE AMEVEVRELF RKAEYDHLRR DRLVLHRIPT NDAWCRDHGP NYVFRQGGTG LEKVILNWQF NAWGGKYEPC DDDNAVPVRI AEQQHLPLVS IDMVLEGGAI DVNGNGLLLT TEACLLNRNR NPGMSRLEIE GALGRYLGIE KVLWLGDGIA GDDTDGHVDD MARFVNESTV VIAVEDDAAD ENYEPLQDNY RLLKSFTDLR GEPLNVVKLP MPDPVYYDGE RLPASYANFY IANSVVLVPL YRCDADKKAL AVLQECFPGR KVVGIDCSDL IWGLGAIHCI THEEPDLPIR ER
|
| |