Gene A2cp1_3562 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagA2cp1_3562 
Symbol 
ID7299612 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaeromyxobacter dehalogenans 2CP-1 
KingdomBacteria 
Replicon accessionNC_011891 
Strand
Start bp3979770 
End bp3981341 
Gene Length1572 bp 
Protein Length523 aa 
Translation table11 
GC content74% 
IMG OID643596375 
Producttransglutaminase domain protein 
Protein accessionYP_002493958 
Protein GI220918654 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1305] Transglutaminase-like enzymes, putative cysteine proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.209631 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCGCTCC CCCACCCGCA CAGGCTCGGC CTCACCGTCC TCGCGCTCGT CCTCGGCACC 
GCCGGCCTGA TGGCGTACAA GGTGCGCGCG CTCGGCTACC GGCTCGCGGA CATCCTGCCG
GTGCGCCAGT ACGAGGTCAC CTACGCGCTC GAGCTCGACG GCCACGGCGG CGACGTGCGC
GTCCGCAGCT TCCTCCCGTC GAGCGACGCG CACCAGACCA TCTCCGAGGA GCGCGACCAG
ACCTCCGGCC TGCACCTCTC GCAGTCGATG GAGGGGCCGA ACCGGGTGGC CACCTGGAGC
GGCGCCGACG TCCCCAACGG CGCGCGCATC CGCCACGCGT TCAAGGTGCT CCCGCGCCGC
GTGTCGTACG ACCTGCCCGC CGGGCTCGAG GTGCCCGCCG CCTACCCGCC CTCGGCGGCC
GCCTGGCTGC GGCCGGAGAA GGACATCCAG GTGGACGCGC CGGAGATCCG CGCCACGCTG
CAGCGCATCG GCGCCGATCA GGGCGGCGTG GTGGAGCGGC TCCGGCGCAT CCACGCGCTG
GCCGCCTCGC TGCAGCCGCG GCCGTTCAAG GGGACCACCG ACGCGCTCAC CGCGCTGCGC
CTGGGCGAGT CGAGCTGCAA CGGCAAGAGC CGGCTGTTCG TGGCGCTGGC CCGCGCGGGC
GGGATCCCGG CGCGGCTGGT GGGTGGCCTC ATCCTCGAGC CCGGCGCGAA GCGGACCTCG
CACCAGTGGG TGGAGGCCTG GGTGGCCGGG CACTGGGTGC CGTTCTGCCC GACGAACGGC
CACTTCGCCG AGCTGCCCGA GCGCTACCTC ACGCTCTACG TCGGCGACGA GGCGCTGTTC
CGCCACACCG CCGACGTGAA CTTCGACTAC CGCTTCGAGA CGCACGGCGC GCTGGTGCCG
TCGCCGCAGG CGAAGGCGAC GTTCACGCTG TTCGACGTGT GGGGGCTGTT CGACCGCCTG
CGGCTCCCGT TCGCGCTGCT CCGCACCGTG CTGATGCTGC CGGTGGGCGC GCTGCTGGTG
GTGCTGTTCC GGAACGTGGT GGGGATGCCG ACGTTCGGCA CCTTCCTGCC GGCGCTGCTC
GCCGCCTCGG CGGGCGAGAC CGGCGCCGGG TACGGCGTGC TGGCGGTGCT GCTGGTGGTG
GCCGCGGTCG CGGCGGTGCG CTGGGGGCTC ACCCGGCTCG AGCTGCTCCA CTCGCCCACG
CTCGCGATCC TGCTGGCGGC GGTGGTGCTG ACGCTGCTCA CCACCTCGAT GATCGCGGAG
CGCGCCGGGA TCGCGCAGCT CACCCGCGTC ACCATGTTCC CGATCGCCGT GCTCGCCATC
TGCGCCGAGC GCTTCTACCT GTCCCTCACC GAGCACGGGG CGCGCGCCGC CGGCAAGGAG
CTGGCCGGGA CGCTGGTGGT GATGCTGGCG TGCCACGCGG TGATGAGCTC GCTGGCGCTG
CAGGTGCTGG TGATCGGCTT CCCCGAGGTG CTGCTGCTGG TGGTGGCGGC GAACGTGTAC
CTGGGGCGCT GGGTGGGGAT GCGGCTCAGC GAGTACCGCC GCTTCCGCGG GCTGCTCGGG
GGCGCGGCGT GA
 
Protein sequence
MALPHPHRLG LTVLALVLGT AGLMAYKVRA LGYRLADILP VRQYEVTYAL ELDGHGGDVR 
VRSFLPSSDA HQTISEERDQ TSGLHLSQSM EGPNRVATWS GADVPNGARI RHAFKVLPRR
VSYDLPAGLE VPAAYPPSAA AWLRPEKDIQ VDAPEIRATL QRIGADQGGV VERLRRIHAL
AASLQPRPFK GTTDALTALR LGESSCNGKS RLFVALARAG GIPARLVGGL ILEPGAKRTS
HQWVEAWVAG HWVPFCPTNG HFAELPERYL TLYVGDEALF RHTADVNFDY RFETHGALVP
SPQAKATFTL FDVWGLFDRL RLPFALLRTV LMLPVGALLV VLFRNVVGMP TFGTFLPALL
AASAGETGAG YGVLAVLLVV AAVAAVRWGL TRLELLHSPT LAILLAAVVL TLLTTSMIAE
RAGIAQLTRV TMFPIAVLAI CAERFYLSLT EHGARAAGKE LAGTLVVMLA CHAVMSSLAL
QVLVIGFPEV LLLVVAANVY LGRWVGMRLS EYRRFRGLLG GAA