Gene AnaeK_4199 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAnaeK_4199 
Symbol 
ID6784214 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaeromyxobacter sp. K 
KingdomBacteria 
Replicon accessionNC_011145 
Strand
Start bp4729910 
End bp4731457 
Gene Length1548 bp 
Protein Length515 aa 
Translation table11 
GC content76% 
IMG OID642765666 
Producttransglutaminase domain protein 
Protein accessionYP_002136531 
Protein GI197124580 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1305] Transglutaminase-like enzymes, putative cysteine proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.148466 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGACCC CGACCTCCGC GCGCCTCGCC CTCGTCGCCC TCTGCTGCTC GACCTCCCTC 
GCGCTCGCCG CCTGCCCCGA GCAGCGGCCG CAGCCGGTGA AGGCGCCGCC GCGCCCGCCG
CAGGCGGCGC TGGCCGCCGG GTCCGGCGAC CTCGCCGACG TGCTCACGGT GCCGCGGCCG
GTGGGGCCGG AGTGGTTCGG CCTGTACCTG GTCGGCCAGA AGGCGGGCTG GAGCAAGGTC
GAGCTGAGCC GCGAGCTGCG CGACGGGCGC GACGTGCTGG TCGGGCGGAG CGAGATGCTG
CTGCGCGTGA ACGTGGGCGG CAACACCGTG GAGCGGCGCC AGAGCGAGGA GCGCGTCTGG
GAGGCGCGCG CGGCCGGCCG GCTGGTGGGG TTCAAGGCGG CGTTCTCCGG CGACGGCGGC
GAGCGGACGC TCACCGGCAC CTGCGCGAAG GACCGCTGCA AGCTCACCGT CACCGCCGCC
GACGGCACGC GCGAGCAGGA GCTGGAGGGC GTGGCCGAGA CCGCCGAGAT GGCGGACGGC
GTCCGGCTCG CGGCCGCGCG GCGCAGCACC GTCCGCGGCA AGCAGCTCGA CCTGCTCAAG
CTGCGGGTCC GCGAGGTGCA GCACGTGTTC GTCCGCCGGG AGCCGGTGGC CGGCGCGGGC
GTGCAGGAGG AGGTCTCGGT CGTCGAGGAG TCGGAGATCG GCGACCGCGT GGCCATCCAG
TACAAGGTGG CGGACGACGG ACGGATCGTG GAGTGGCACC TCGGCGACGC GATCGTGGGC
CGTCCCGAGC CGTCGGATCG CGCGCAGCGG CTCGACGAGG TGGACCTGTT CGCGCTCGGC
CGCGTGCCGC TGCCGAAGCC GCTGCCGCGC ACCGTGCCCG CCACCATCAC CTACCGCCTG
CGGGGGCTCC CGGCCGCGTT CCAGAAGGCG GACCAGCGCC AGCGGTACGA GCGCGGCCCG
GAGGGCACCA CGCTCCTCAC CATCACCGCG AAGCCGCCCG CCGCGGCGGA GCCGGCCCGG
GACACGCCGC TCGCGCGGGC GGGGGAGGGG GCGGGCCGCG ACGACCTCGC CGCCACGCCG
CAGGTGGACT CCGACGCGCC CGCCATCGCG GCGCTGGCGA AGCAGGTGGC GGGCGACGCG
CGCGGCACGT ACCAGGCCGC GCTCGCGCTG GCGCGCTGGG TGAACGAGCA CCTCGAGAAG
GCGTACGGGG CGAGCAACGA CCGGGCGAGC GACGTGCTCG CGGCGCGGAA GGGCGACTGC
ACCGAGCACG CGGTGCTGAC GGTGGCGCTG GCGCGCGCGC TGGGGATCCC CTCGCGGCAG
GTCTACGGCC TCGTCTACGC GCGCTACGCC GACGGCAAGG ACGCGCTCTA CTGGCACGCC
TGGGCGGAGG TGCGGAGCGC CGGCGAGTGG ATCGCCATCG ACCCGATCTT CGGGCAGCCG
GTGGCGGACG CGACCCACGT GGCGCTCGGC ACCGACAAGC AGGAGGACGC GGTGGGCCTG
CTCGGCGCGC TCAAGGTGGA GAAGGTGGAC GTGAAGGGCG CGAAGTAG
 
Protein sequence
MTTPTSARLA LVALCCSTSL ALAACPEQRP QPVKAPPRPP QAALAAGSGD LADVLTVPRP 
VGPEWFGLYL VGQKAGWSKV ELSRELRDGR DVLVGRSEML LRVNVGGNTV ERRQSEERVW
EARAAGRLVG FKAAFSGDGG ERTLTGTCAK DRCKLTVTAA DGTREQELEG VAETAEMADG
VRLAAARRST VRGKQLDLLK LRVREVQHVF VRREPVAGAG VQEEVSVVEE SEIGDRVAIQ
YKVADDGRIV EWHLGDAIVG RPEPSDRAQR LDEVDLFALG RVPLPKPLPR TVPATITYRL
RGLPAAFQKA DQRQRYERGP EGTTLLTITA KPPAAAEPAR DTPLARAGEG AGRDDLAATP
QVDSDAPAIA ALAKQVAGDA RGTYQAALAL ARWVNEHLEK AYGASNDRAS DVLAARKGDC
TEHAVLTVAL ARALGIPSRQ VYGLVYARYA DGKDALYWHA WAEVRSAGEW IAIDPIFGQP
VADATHVALG TDKQEDAVGL LGALKVEKVD VKGAK