Gene Gdia_3361 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGdia_3361 
Symbol 
ID6976804 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGluconacetobacter diazotrophicus PAl 5 
KingdomBacteria 
Replicon accessionNC_011365 
Strand
Start bp3681271 
End bp3682635 
Gene Length1365 bp 
Protein Length454 aa 
Translation table11 
GC content61% 
IMG OID643392874 
ProductTetratricopeptide TPR_2 repeat protein 
Protein accessionYP_002277702 
Protein GI209545473 
COG category[R] General function prediction only 
COG ID[COG0457] FOG: TPR repeat 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.0780323 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones38 
Fosmid unclonability p-value0.195256 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGCATCGGG CCGGTCTGTT GCAGGAAGCG GCGGAGGCCT ACAGGCACGA ACTGGTCCAG 
ACGCCGGATG ATGCGCGTAC CCTGAGTAAC TATGGCGGGT TGCTCTGCAC CCTTGGTGAT
TTCCAGGAGG CGCATGACGT CTTGATTCGT GCGCTCAATC TGAAGCTGAC GCTGGTCGAT
GCCTGGTCCA ATTTCGGCAA CGCATTGCTG GCACTCCAGC GCTATGACGA CGCGATTGCC
GCGTACAAGG AGTGCCTGAC CCGTTATCCC CAGCATGTCC TGGCCCTCAG CAATCTCGGG
GTCGCGCTCG ACCGGCGTGG GGAGCATGCG CTGGCGCGGA ACTTCCATCG CGTGGCGGTC
AGGCTGGACC CGGAAAACGC GGAAAGCCAC GCCAACTACG CCATCTGCCT CCTAGCCCTG
GGCGAGTACC AGGAGGGGTT CGAGGAATAT GAATGGCGCT GGAAAACCCG GATGCTCGGT
CATCACCGCA TGACCGCCCC GTTGTGGGAG GGTGGAGATT TCACCGGACG CACCCTGCTG
ATTCATACGG AAGGTGGCTT CGGGGATATG CTTCAGTTTT CCCGCTTCAT TCCGTTTGCA
GCGCAATTCG GCGGGCGCAC CCTCGTTCGG GTCCGGAAGG AATTACTGTC CCTGTTTCGA
CTTTCGTTCC CGGACCAGAC ATTCATATCG ATCGACGACC CTGTTCCGCC TCACGATCTG
CAATGTCCGG CAGCCAGTCT TCCGCGTGCG TTAGGCACGA CTCTGGAAAC CATTCCGTCG
CCAGGGGGCT TTCTGAAGGC AGACCCGCAA AAGGTGGCGT TCTGGGCTGA AAAACTCGAG
GACGATCTGG AAAAAAGGGG CTTGTCTTCA GCGCCCCTTC GCGTCGGCCT CGTCTGGGCG
GGGGCGCCGC ATCGCGGCGT CCGCGAGGTC AACATCGCCG ACCAGCGCCG TTCCACCGAT
CTGGCGACGC TGGCATCGCT CGCCTCCGTG CCCGATACCC TGTTCTACAG CTTGCAGATC
GGGGAAAAAT CCGTTCAGGC CAAAACGCCC CCCGCCGGAA TGCAACTGAT TGACCATACG
CCCTTGCTTC ATGATTTCAG CGACACGGCT GCCTTCGTCA GCAATCTGGA CCTGGTCATC
GCGGTCGATA CGTCAACCGC ACATGTCGCC GCCGGACTGG GAAAGCCCGT CTGGATGCTG
TCCCGCTACG ATCAGTGCTG GCGATGGCTT TCCGGCCGCT CGGACTCGCC GTGGTATGAC
AGCCTGCGGA TCTACCAGCA AAGCCAGCCG CTTGACTGGT CGGCTCCCAT GCACCGTATC
ACGGCTGATC TGGCGAAATT TGCCCGGGCA TGGGGCCGAA CGTGA
 
Protein sequence
MHRAGLLQEA AEAYRHELVQ TPDDARTLSN YGGLLCTLGD FQEAHDVLIR ALNLKLTLVD 
AWSNFGNALL ALQRYDDAIA AYKECLTRYP QHVLALSNLG VALDRRGEHA LARNFHRVAV
RLDPENAESH ANYAICLLAL GEYQEGFEEY EWRWKTRMLG HHRMTAPLWE GGDFTGRTLL
IHTEGGFGDM LQFSRFIPFA AQFGGRTLVR VRKELLSLFR LSFPDQTFIS IDDPVPPHDL
QCPAASLPRA LGTTLETIPS PGGFLKADPQ KVAFWAEKLE DDLEKRGLSS APLRVGLVWA
GAPHRGVREV NIADQRRSTD LATLASLASV PDTLFYSLQI GEKSVQAKTP PAGMQLIDHT
PLLHDFSDTA AFVSNLDLVI AVDTSTAHVA AGLGKPVWML SRYDQCWRWL SGRSDSPWYD
SLRIYQQSQP LDWSAPMHRI TADLAKFARA WGRT