Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hhal_1254 |
Symbol | |
ID | 4710512 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorhodospira halophila SL1 |
Kingdom | Bacteria |
Replicon accession | NC_008789 |
Strand | + |
Start bp | 1360777 |
End bp | 1361937 |
Gene Length | 1161 bp |
Protein Length | 386 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 639855727 |
Product | peptidyl-arginine deiminase |
Protein accession | YP_001002831 |
Protein GI | 121998044 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG2957] Peptidylarginine deiminase and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.145775 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATCGGGT ATAGTCCCGG TCGCCTTTTC GGGATCCGAC CCAACGCCAT GTCCGCGCCC GTTCACCCCG ATTGTCAGCA CCCACGCCTC CCCGCCGAAT GGGAAGCCCA GGCCGCGCTG ATGCTGACCT GGCCCCATGC CAGCGGCGAC TGGGGCGAGC ACCTCGCCGC CGCCGAGGCG TGTTTCGAGC GCCTCGCCGC AGCCGCGGCG CGCTACCAGC CTGTTCTCAT CGTCTGCCCC GACAGCCGCA CCAGCGCGCG GGTCCGCAAC CGCCTGCGCT CCGCCGGAGT CTCCCCCCAG CGCATGATCT TCACGGAGGC CCCCTCCAAC GACGTCTGGG CACGGGATCA CGGACCGATC ACCGTCCGGC GCGCCGGCGG GAGGGCCCAG CTGCTCGACT TCCGGTTCAA CGGCTGGGGC GAGCGCTATC CCGCCGATGA AGACGACCGT CTGACCCGCA GGCTCACTGA GGAGGGCGTG ATTGGGGGCG AGTCGTACCG GCGGATCGAG TGGATTCTTG AGGGCGGCAG CATCGACAGC GACGGCGCGG GCACCCTGCT GACCACGACC CGCTGCCTGC TGAACCCCAA CCGTAATACG GACACCGGCC GCGAAGAGGT TGAGGCCCAG CTACGCGCGC GCCTCGGCAT CCGCCGCGTC CTTTGGCTGG AGTCGGGCTG GCTGTGCGGC GACGATACCG ACGGTCACGT GGATATGCTG GCGCGATTCG TCGATCGGCG CACGATCGCC CACGCCGTCT GCGAAGACCC GGACGACCCC CATTACGCGC CGCTGCGGGC GCTGCGCGAG GAGCTGCAAG CCGCCCGCAC CCGCAACGGT GATCCCTACC GGCTGGTCGA GCTGCCGCTC CCGGCGCCGA TCCACGATGA AGACGGTAAT CGACTCCCGG CGACGTACGC CAACTTCGTC TTCGTCAACG GTGCCGTGCT GGTCCCGGTT TACGACGATC CTGCAGACGC CATCGCCTGC GCACGACTTG CCCAGGCCTG CCCAGGACGC GATATCGTTC GGGTACCGGC CCAGGATCTC ATTCGTCAGG GCGGGAGTGT GCACTGTGCC ACCATGCAAC TGCCAGCGGG CGTGATCATC GACGGCCTGG CCACAGGGGC GGAGGCCACG CGGAGGGTCC ACGAGGCATG A
|
Protein sequence | MIGYSPGRLF GIRPNAMSAP VHPDCQHPRL PAEWEAQAAL MLTWPHASGD WGEHLAAAEA CFERLAAAAA RYQPVLIVCP DSRTSARVRN RLRSAGVSPQ RMIFTEAPSN DVWARDHGPI TVRRAGGRAQ LLDFRFNGWG ERYPADEDDR LTRRLTEEGV IGGESYRRIE WILEGGSIDS DGAGTLLTTT RCLLNPNRNT DTGREEVEAQ LRARLGIRRV LWLESGWLCG DDTDGHVDML ARFVDRRTIA HAVCEDPDDP HYAPLRALRE ELQAARTRNG DPYRLVELPL PAPIHDEDGN RLPATYANFV FVNGAVLVPV YDDPADAIAC ARLAQACPGR DIVRVPAQDL IRQGGSVHCA TMQLPAGVII DGLATGAEAT RRVHEA
|
| |