Gene Hhal_1254 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHhal_1254 
Symbol 
ID4710512 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhodospira halophila SL1 
KingdomBacteria 
Replicon accessionNC_008789 
Strand
Start bp1360777 
End bp1361937 
Gene Length1161 bp 
Protein Length386 aa 
Translation table11 
GC content69% 
IMG OID639855727 
Productpeptidyl-arginine deiminase 
Protein accessionYP_001002831 
Protein GI121998044 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2957] Peptidylarginine deiminase and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.145775 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATCGGGT ATAGTCCCGG TCGCCTTTTC GGGATCCGAC CCAACGCCAT GTCCGCGCCC 
GTTCACCCCG ATTGTCAGCA CCCACGCCTC CCCGCCGAAT GGGAAGCCCA GGCCGCGCTG
ATGCTGACCT GGCCCCATGC CAGCGGCGAC TGGGGCGAGC ACCTCGCCGC CGCCGAGGCG
TGTTTCGAGC GCCTCGCCGC AGCCGCGGCG CGCTACCAGC CTGTTCTCAT CGTCTGCCCC
GACAGCCGCA CCAGCGCGCG GGTCCGCAAC CGCCTGCGCT CCGCCGGAGT CTCCCCCCAG
CGCATGATCT TCACGGAGGC CCCCTCCAAC GACGTCTGGG CACGGGATCA CGGACCGATC
ACCGTCCGGC GCGCCGGCGG GAGGGCCCAG CTGCTCGACT TCCGGTTCAA CGGCTGGGGC
GAGCGCTATC CCGCCGATGA AGACGACCGT CTGACCCGCA GGCTCACTGA GGAGGGCGTG
ATTGGGGGCG AGTCGTACCG GCGGATCGAG TGGATTCTTG AGGGCGGCAG CATCGACAGC
GACGGCGCGG GCACCCTGCT GACCACGACC CGCTGCCTGC TGAACCCCAA CCGTAATACG
GACACCGGCC GCGAAGAGGT TGAGGCCCAG CTACGCGCGC GCCTCGGCAT CCGCCGCGTC
CTTTGGCTGG AGTCGGGCTG GCTGTGCGGC GACGATACCG ACGGTCACGT GGATATGCTG
GCGCGATTCG TCGATCGGCG CACGATCGCC CACGCCGTCT GCGAAGACCC GGACGACCCC
CATTACGCGC CGCTGCGGGC GCTGCGCGAG GAGCTGCAAG CCGCCCGCAC CCGCAACGGT
GATCCCTACC GGCTGGTCGA GCTGCCGCTC CCGGCGCCGA TCCACGATGA AGACGGTAAT
CGACTCCCGG CGACGTACGC CAACTTCGTC TTCGTCAACG GTGCCGTGCT GGTCCCGGTT
TACGACGATC CTGCAGACGC CATCGCCTGC GCACGACTTG CCCAGGCCTG CCCAGGACGC
GATATCGTTC GGGTACCGGC CCAGGATCTC ATTCGTCAGG GCGGGAGTGT GCACTGTGCC
ACCATGCAAC TGCCAGCGGG CGTGATCATC GACGGCCTGG CCACAGGGGC GGAGGCCACG
CGGAGGGTCC ACGAGGCATG A
 
Protein sequence
MIGYSPGRLF GIRPNAMSAP VHPDCQHPRL PAEWEAQAAL MLTWPHASGD WGEHLAAAEA 
CFERLAAAAA RYQPVLIVCP DSRTSARVRN RLRSAGVSPQ RMIFTEAPSN DVWARDHGPI
TVRRAGGRAQ LLDFRFNGWG ERYPADEDDR LTRRLTEEGV IGGESYRRIE WILEGGSIDS
DGAGTLLTTT RCLLNPNRNT DTGREEVEAQ LRARLGIRRV LWLESGWLCG DDTDGHVDML
ARFVDRRTIA HAVCEDPDDP HYAPLRALRE ELQAARTRNG DPYRLVELPL PAPIHDEDGN
RLPATYANFV FVNGAVLVPV YDDPADAIAC ARLAQACPGR DIVRVPAQDL IRQGGSVHCA
TMQLPAGVII DGLATGAEAT RRVHEA