Gene GM21_3887 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_3887 
Symbol 
ID8139261 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp4474723 
End bp4476159 
Gene Length1437 bp 
Protein Length478 aa 
Translation table11 
GC content64% 
IMG OID644871504 
Producttransglutaminase domain protein 
Protein accessionYP_003023662 
Protein GI253702473 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1305] Transglutaminase-like enzymes, putative cysteine proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones77 
Fosmid unclonability p-value0.978849 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTCTCC CATTAAACCT ACTGCGGCTG ATTATCGCCT CGCTGCTGGT GCTGCTGCCG 
CTCCAGCTTT TCGCAGCTCC CCTTCCCAAG CTGACCGAGC TGCCGCTCGG GGAGCGCTGG
TTCAGCATCA AGCTCGGCGA CGAGCGGGTC GGCTTCAGCC GGCTCACCAT CACCCGGACC
GATGCGGGCT ACCGCATCGA TTCCGAGGGG AGCGTGAAGA TGCGGGTGAT GGGGTTCTCC
AGGGAGGCCA CCTCCAAGGA GTCCTACCTG GTCGCCCGGG ACCTCGCGCT GCAATCATTC
TCGGCAGAGA ACCGCATCGA CGGCAGCCCC GTCACCACTA CCGGCGAGGT CACCCCTAAG
GGAGTGACAA TCGTCTCCCA GTCCGGCAGC GGGAAGAAGG AGCGGACCCT CAAGTTGAAG
GGGGCGCTTT ACCCCCCCCA CGCGCTGAAC CTTTACCCGC TCATGCAGGG TGCGACCAAG
GGTAAAAGCT ACAAGCTCCC GGTGCTGGAC GTGGAAGCGC TTAAGGTGAA GCAGCTCAAG
GTCGAGGTGG TCGGCGAGGA GACGCTTCCG CCGGGGCGGC AGGTGATCCA TCTGAAGAAC
GACGCCTACC CCATGGTCGA TAACGATATA TGGGTGGACC TCCAGGGGAA CGTGCTCAAG
GAGTCGGTCC GCGACGACCT GGTGGTCACC GAGGCCGAGG ACGAGGCGAC CGCCCGTGCG
GAACTTGCGC GCGACGCGCT CTGCAAGACG GACCTGGTGC TGGATTTCAG CATGATCCGG
GTCACCCCCC CTATCGAGCG TCCAGCCCAA TTGCAGAAGC TGGTACTGGA GATGACAGGC
ATCCCGTCGC AGCTGCCGCT TTTGCAGGGG ACAAGGCAGC AGGTGGTGCG CAGGGCCGAC
GGTACGGTGC TCGTCACCAT GCCCAACCCC GCCCTAGTCG CCGAAGCGCC CCCAACCGCC
GCCGATCTGG AACCGGCGGA GCGGATACCG AGCGACCACC CGGAGATAAA GGCGAAAGCC
GTGGAGATAA TGGGGAGCGA GCAGGATCCG GCTCAGGTAG CGAAGCTTCT GTCCGACTGG
GTGGCCCGCG AGATAAAGGG GGCAGTGACC GACAGCCAGT CCCCCCTGGA AACGCTCAAA
ACCCGCATCG GCAACTGCCA GAGCCACGCG AGGCTCTACG CTTCCCTCGC ACGCGCCTCC
GCAATCCCGA CCCGCTTCGT GTCCGGGATC GTCCACCAGG GGGAGGGCTT TCTCTACCAT
AGCTGGGCGG AAAGCTACTT GGGCGGCGCC TGGGTCCCCA TCGATCCCAC CTTCGGCGAG
ATGCCTGCCA ACCTGAGCCA CGTCAAGTTC GTCGATGGGG AAACGCTGGA CGAAATGGGG
TCGCTGGCGG GGATGATCGG GAAGGTACGG GCGAAAGTCG TGGAAAAGCG GTACTGA
 
Protein sequence
MTLPLNLLRL IIASLLVLLP LQLFAAPLPK LTELPLGERW FSIKLGDERV GFSRLTITRT 
DAGYRIDSEG SVKMRVMGFS REATSKESYL VARDLALQSF SAENRIDGSP VTTTGEVTPK
GVTIVSQSGS GKKERTLKLK GALYPPHALN LYPLMQGATK GKSYKLPVLD VEALKVKQLK
VEVVGEETLP PGRQVIHLKN DAYPMVDNDI WVDLQGNVLK ESVRDDLVVT EAEDEATARA
ELARDALCKT DLVLDFSMIR VTPPIERPAQ LQKLVLEMTG IPSQLPLLQG TRQQVVRRAD
GTVLVTMPNP ALVAEAPPTA ADLEPAERIP SDHPEIKAKA VEIMGSEQDP AQVAKLLSDW
VAREIKGAVT DSQSPLETLK TRIGNCQSHA RLYASLARAS AIPTRFVSGI VHQGEGFLYH
SWAESYLGGA WVPIDPTFGE MPANLSHVKF VDGETLDEMG SLAGMIGKVR AKVVEKRY