Gene Anae109_3474 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAnae109_3474 
Symbol 
ID5377655 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaeromyxobacter sp. Fw109-5 
KingdomBacteria 
Replicon accessionNC_009675 
Strand
Start bp4079217 
End bp4080803 
Gene Length1587 bp 
Protein Length528 aa 
Translation table11 
GC content72% 
IMG OID640844999 
Producttransglutaminase domain-containing protein 
Protein accessionYP_001380642 
Protein GI153006317 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1305] Transglutaminase-like enzymes, putative cysteine proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value0.934792 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones38 
Fosmid unclonability p-value0.0999002 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCGTCG CCCACCACCA CAAGGTCGCC GCTACCGTCG TCGCGCTCGT CGCCACGACC 
GCCGCGCTGA TGCTCTACAA GGTCCACGCG CTCGGGTACT CGCTCGCGGA CATCCTGCCG
GTCCGGCAGT ACGAGGTCAC CTACGCGCTC CGGCTCGACG GACACGGCGG CGACGTCCGC
GTGCGGACCT TCCTGCCCGC GAGCGACGCG CGCCAGACGA TCTCCGACGA GCGCACCGAG
TCGGCGGGGT TCCACCTGTC GCAGGTGCTC GACGGCCCGA ACCGGGTGGC GACCTGGGTC
GGCGCGCAGA TCCCGGACGG CGCCGAGATC CGGCACACCG TGCGGATCGT CCCGCGGCGG
GTCGCCTACG CGATCCCCGC CGATCTCCCG GTGCCCACCT CCTACCCGGC TTCGACGGCG
CCGGCGCTCC GGCCGGAGAA GGAGATCCAG GTCGACGCGC CCGAGATCGC GGCCGCCCTC
GCGAGCATCG GGGCGGACCG CGGGAGCGTC ATGGAGCGGC TGCGGCGAAT CCATGACTAC
ACCCATGGGC TGACGACGCG GCCGTTCAAG GGAACCACCG ACGCGCTCAC CGCGCTGCGC
CTCGGCGAGG CGAGCTGCAA CGGGAAGAGC CGCCTCTTCG TGGCGCTCGC GCGCGCGACC
GGCATCCCGG CGCGCCTCGT GGGCGGCCTC ATCCTCGAGA CCGGCTCGAA GCGCACCTCT
CACCAGTGGG TCGAGGCCTA CGTGTCCGGC CACTGGATCC CGTTCTGCCC CACGAACGAT
CACTTCGGCG AGCTCCCCGA GCGCTACCTC TCCCTCTACT ACGGCGACGA GGTCCTGTTC
CGGCACACGG CCGACGTGAA CTTCGACTAC CGGTTCGACG CCCGCTCGCA GCTCGTCCCC
TCGCCCCGCG CCAAGGCGTC CTTCACGTTC CTCGACGTGT GGGGCCTGTT CGATCGCCTG
AAGCTGCCCT TCAGCCTGCT GCGGACGATC CTGATGCTCC CCATCGGCGC GCTCCTCACG
GTACTCTTCC GGAACGTCGT CGGCATGCCG ACGTTCGGCA CGTTCCTGCC CGCGCTGCTC
GCCGCCGCGG CGGGCGAGAC CGGCGCCGGG TTCGGCGTCC TCGCGGTGCT CATCGTGGTC
GCGGCGGTCG CGACCGCTCG CTGGGCCGTG TCGCGGCTCG AGCTCCTCCA CTCGCCCACG
CTCGCGATCC TGCTCTGCGC GGTGGTCGTC ACCCTCGTCG GCACCTCGAT GCTCGCCGAG
CGGCTCGGGA TCTCCGGGCT CACCCACGTG ACGCTGTTCC CGCTCGCGGT GCTCGCCATC
TGCGCCGAGC GCTTCTACCT CTCGCTCACC GAGCACGGGG CGCGCGCGGC CGGCAAGGAG
CTCGCGGGCA CGCTCGTCGT GATGCTCGCC TGCTACGTGG TGATGAACTC GCTCGCGCTG
CAGGTGCTCG TCATCGGGTT CCCCGAGGTG CTGCTGCTCG CGGTCGCCGC GAACGTGTAC
CTCGGGCGCT GGGTGGGGAT GCGGCTCAGC GAGTACCGGC GGTTCCGCGG GCTGCTCGGG
GCCGCCTCGC CCGGAGGCTC GCCGTGA
 
Protein sequence
MAVAHHHKVA ATVVALVATT AALMLYKVHA LGYSLADILP VRQYEVTYAL RLDGHGGDVR 
VRTFLPASDA RQTISDERTE SAGFHLSQVL DGPNRVATWV GAQIPDGAEI RHTVRIVPRR
VAYAIPADLP VPTSYPASTA PALRPEKEIQ VDAPEIAAAL ASIGADRGSV MERLRRIHDY
THGLTTRPFK GTTDALTALR LGEASCNGKS RLFVALARAT GIPARLVGGL ILETGSKRTS
HQWVEAYVSG HWIPFCPTND HFGELPERYL SLYYGDEVLF RHTADVNFDY RFDARSQLVP
SPRAKASFTF LDVWGLFDRL KLPFSLLRTI LMLPIGALLT VLFRNVVGMP TFGTFLPALL
AAAAGETGAG FGVLAVLIVV AAVATARWAV SRLELLHSPT LAILLCAVVV TLVGTSMLAE
RLGISGLTHV TLFPLAVLAI CAERFYLSLT EHGARAAGKE LAGTLVVMLA CYVVMNSLAL
QVLVIGFPEV LLLAVAANVY LGRWVGMRLS EYRRFRGLLG AASPGGSP