Gene AnaeK_3494 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAnaeK_3494 
Symbol 
ID6786911 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaeromyxobacter sp. K 
KingdomBacteria 
Replicon accessionNC_011145 
Strand
Start bp3953792 
End bp3955363 
Gene Length1572 bp 
Protein Length523 aa 
Translation table11 
GC content74% 
IMG OID642764965 
Producttransglutaminase domain protein 
Protein accessionYP_002135836 
Protein GI197123885 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1305] Transglutaminase-like enzymes, putative cysteine proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.764922 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCGCTCC CCCACCCGCA CAGGCTCGGC CTCACCGTCC TCGCGCTCGT CCTCGGCACC 
GCCGGCCTGA TGGCGTACAA GGTGCGCGCG CTCGGCTACC GGCTCGCGGA CATCCTGCCG
GTGCGCCAGT ACGAGGTCAC CTACGCGCTC GAGCTCGACG GCCACGGCGG CGACGTGCGC
GTCCGCAGCT TCCTGCCGTC GAGCGACGCG CACCAGACCA TCTCCGAGGA GCGCGACCAG
ACCTCCGGCC TGCACCTCTC GCAGTCGATG GATGGGCCGA ACCGGGTGGC CACCTGGAGC
GGCGCCGACG TGCCCAACGG CGCGCGCATC CGCCACGCGT TCAAGGTGCT CCCGCGCCGC
GTGTCCTACG ACCTGCCCGC CGGGCTCGAG GTGCCCGCCG CCTACCCACC CTCGGCGGCC
GCCTGGCTCC GGCCGGAGAA GGACATCCAG GTGGACGCGC CGGAGATCCG CGCCACGCTG
CAGCGCATCG GCGCCGATCA GGGCGGCGTG GTGGAGCGGC TCCGGCGCAT CCACGCGCTG
GCCGCCTCGC TGCAGCCGCG GCCGTTCAAG GGGACCACCG ACGCGCTCAC CGCGCTGCGC
CTGGGCGAGT CGAGCTGCAA CGGCAAGAGC CGGCTGTTCG TGGCGCTGGC CCGCGCGGGC
GGGATCCCGG CGCGGCTGGT GGGTGGCCTC ATCCTCGAGC CCGGCGCGAA GCGGACCTCG
CACCAGTGGG TGGAGGCCTG GGTGGCCGGG CACTGGGTGC CGTTCTGCCC GACGAACGGC
CACTTCGCCG AGCTGCCCGA GCGCTACCTC ACGCTCTACG TCGGCGACGA GGCGCTGTTC
CGCCACACCG CCGACGTGAA CTTCGACTAC CGCTTCGAGA CGCACGGCGC GCTGGTGCCG
TCGCCGCAGG CGAAGGCGAC GTTCACGCTG TTCGACGTGT GGGGGCTGTT CGACCGCCTG
CGGCTCCCGT TCGCGCTGCT CCGCACCGTG CTGATGCTGC CGGTGGGCGC GCTGCTGGTG
GTGCTGTTCC GGAACGTGGT GGGGATGCCG ACGTTCGGCA CCTTCCTGCC GGCGCTGCTC
GCCGCCTCGG CGGGCGAGAC CGGCGCCGGG TACGGCGTGC TGGCGGTGCT GCTGGTGGTG
GCGGCGGTCG CGGCGGTGCG CTGGGGGCTC ACCCGGCTCG AGCTGCTCCA CTCGCCCACG
CTCGCGATCC TGCTGGCGGC GGTGGTGCTG ACGCTGCTCA CCACCTCGAT GATCGCGGAG
CGCGCCGGCA TCGCGCAGCT CACCCGCGTC ACCATGTTCC CGATCGCCGT GCTCGCCATC
TGCGCCGAGC GCTTCTACCT GTCGCTCACC GAGCACGGGG CGCGCGCCGC CGGCAAGGAG
CTGGCCGGGA CGCTGGTGGT GATGCTGGCG TGCCACGCGG TGATGAGCTC GCTGGCGCTG
CAGGTGCTGG TGATCGGCTT CCCCGAGGTG CTGCTGCTGG TGGTGGCGGC GAACGTGTAC
CTGGGGCGCT GGGTGGGGAT GCGGCTCAGC GAGTACCGCC GCTTCCGCGG GCTGCTCGGG
GGCGCGGCGT GA
 
Protein sequence
MALPHPHRLG LTVLALVLGT AGLMAYKVRA LGYRLADILP VRQYEVTYAL ELDGHGGDVR 
VRSFLPSSDA HQTISEERDQ TSGLHLSQSM DGPNRVATWS GADVPNGARI RHAFKVLPRR
VSYDLPAGLE VPAAYPPSAA AWLRPEKDIQ VDAPEIRATL QRIGADQGGV VERLRRIHAL
AASLQPRPFK GTTDALTALR LGESSCNGKS RLFVALARAG GIPARLVGGL ILEPGAKRTS
HQWVEAWVAG HWVPFCPTNG HFAELPERYL TLYVGDEALF RHTADVNFDY RFETHGALVP
SPQAKATFTL FDVWGLFDRL RLPFALLRTV LMLPVGALLV VLFRNVVGMP TFGTFLPALL
AASAGETGAG YGVLAVLLVV AAVAAVRWGL TRLELLHSPT LAILLAAVVL TLLTTSMIAE
RAGIAQLTRV TMFPIAVLAI CAERFYLSLT EHGARAAGKE LAGTLVVMLA CHAVMSSLAL
QVLVIGFPEV LLLVVAANVY LGRWVGMRLS EYRRFRGLLG GAA