Gene Gdia_0744 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGdia_0744 
Symbol 
ID6974141 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGluconacetobacter diazotrophicus PAl 5 
KingdomBacteria 
Replicon accessionNC_011365 
Strand
Start bp847711 
End bp848904 
Gene Length1194 bp 
Protein Length397 aa 
Translation table11 
GC content66% 
IMG OID643390273 
Producttransglutaminase domain protein 
Protein accessionYP_002275149 
Protein GI209542920 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1305] Transglutaminase-like enzymes, putative cysteine proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.0804434 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones58 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGAAT TACTGCAATT CGCGGACCGC TACAGGACCA TCGATGAATC TCGGATTCTG 
AAGGCGCTTC TGCTTTTGGG CTGGGCGTTC ACCGACGATA CCGCGCATGC CCTGGCCATG
ACGCGCCAGG CGCTGGACCG GTGGATCGGG TCCGGCCTGC AATTCCACCG GGATCGTCTG
GGACAGCGCC TGTTCGATCC GGTGGAGGTA GTACATTTCC TGAAATCGCG CGGGCGGCAG
GGACAGGATG ATTTCTGGTC CTCATGCTAC GTCCCGACCA GCCGGCTGCT GGTCGGGGAA
CTGGGCGGCC GCGACAGCCC GCATATGGAC ATCACCATCG GGCGCACGTT CGCCCGCGGT
GCGTTCGCCC CCGACAGGAC GCTGCGGCTA CGCATGCCGC TGCCGTTGCG ATCGCGATGC
GATTATCTGG ATGTCCGGCC CTGGCCCTGC GACGGGGGGA CGATCAGCAT CAGCGACGGC
CGCATGGATG TGCGGATCCG TCCGGACGGG CAAGGCGACA TCACGATCGG GGCGGATGTC
TTCCTCGCTC CCCTGCCGGA CGGCGGGCCC GAGGATGACG CCGATCGCGA GATCTTCCTT
CGTCCCAGCG AAGGCCTGAT CAGGATCACG GATCCGGTCG CGGCGCTTGC ACGCCGCCTG
GCCGGGACGG CACCGACGGA ACGGGCCGTG CGGGCCTTCT GGTCCTTCAT CATGGACGAA
CTGATCAACA GCCCGGTCCA TTACGATCAG ATCCGGGCCG ACGCCCCCCT GGACTGGGTC
CTGGAGGCCG GATGCTACGA CTGCCAGCTT GGCGCGGCGC TTCTGATCGG CCTGTGCCGC
GCGCGGGGTA TTCCCGCACG CCTGGTGGGC GGCCATTTCC TGTACCGGCA TTCGCCGACC
CTTCATTACT GGGCCGAAAT CTGGACGGAG GATGCGGGAT GGCGTCCGTT CGACTTCATG
AGCTGGGACC TGTCCCACGG CGGACAGGAT TCCGCCTGGC GCGACCATTT TTACGGGCGG
ACCGATGCCC GGATGATCAC GCAGTGCCTG CCGCGCCGCT GCGTGGGGCC CGTGGGCGTC
GCCATACCCG CCACCTGGCG CGTGCTGCAG ACCGCGCGCG GCAAGGGTGT GGATATCGAC
ATGGTCGGGC TGGATGGCGC ATCGATCTAC ACCGACCGGG TCACCGTCAT ATGA
 
Protein sequence
MTELLQFADR YRTIDESRIL KALLLLGWAF TDDTAHALAM TRQALDRWIG SGLQFHRDRL 
GQRLFDPVEV VHFLKSRGRQ GQDDFWSSCY VPTSRLLVGE LGGRDSPHMD ITIGRTFARG
AFAPDRTLRL RMPLPLRSRC DYLDVRPWPC DGGTISISDG RMDVRIRPDG QGDITIGADV
FLAPLPDGGP EDDADREIFL RPSEGLIRIT DPVAALARRL AGTAPTERAV RAFWSFIMDE
LINSPVHYDQ IRADAPLDWV LEAGCYDCQL GAALLIGLCR ARGIPARLVG GHFLYRHSPT
LHYWAEIWTE DAGWRPFDFM SWDLSHGGQD SAWRDHFYGR TDARMITQCL PRRCVGPVGV
AIPATWRVLQ TARGKGVDID MVGLDGASIY TDRVTVI