Gene Gdia_0304 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGdia_0304 
Symbol 
ID6973696 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGluconacetobacter diazotrophicus PAl 5 
KingdomBacteria 
Replicon accessionNC_011365 
Strand
Start bp336501 
End bp338441 
Gene Length1941 bp 
Protein Length646 aa 
Translation table11 
GC content73% 
IMG OID643389835 
ProductTetratricopeptide TPR_2 repeat protein 
Protein accessionYP_002274716 
Protein GI209542487 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG5010] Flp pilus assembly protein TadD, contains TPR repeats 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.860081 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones36 
Fosmid unclonability p-value0.111747 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACAAAC GGCAGGCGGC CCCCGACGGG CCGGACAGCG CGGCACGGGC CATGGCGCTG 
CTTGAAGCCG GGGACCGCGA CGGGGCGCTG GCGCTGCTTC ACGCCGCCCT GTCCGCGCGG
CCCGCGTGCC GGGATGCCGA CGTCCTGCAC GGCATGGCCT GCGTCGCCCG TGCCGCCGGG
CGGCCGGACC TTGCGATCGG GCTGGCGGGC AAGGCCGTCG CGCTGCTGCC GGCGGCGCAT
TTTCACATCA CCCTGGGCTG CGCCCTGCGC GAGCAGGGCC ATGTGGAGGA GGCCCGCGCC
GCGCTGGCGG TGGCCGTACT GCGCGAACCG CGCGACGCGC GCGCCCACGC CGCCCTGGCC
GGCGCGCTGG GTGAACTGGG CCGCTGGGCG GAGGCCGAGG CCAGCCTGCG CGCGGCGCGG
GCCCTGTGTC CCGGTGACAT GGCCCTGCTA CTGGAATGGG CCCGCGCGTG CATCCATGGC
GGGGACCACG CCGCGGCGAC GGCGGAGATC GTGGCCGGGG CGGACCGCTT TGCGCCCGAC
CATGCCGGCG CCCTGCACGG GCTGGCCACC CTACTGGCGG ATCGGGGCCA GCCCGCGGGG
GCCGAGGCGC TGTATCGGCA GATCGTCCGG CTTCTCCCCG ATGACGGGGC GGCCTGGGCC
AATCATGGCG CCGCGCTGTT CGCGCTGAAC CGGCACGAGG ACGCCCGCGT CGCGCTGGAG
CACGCCGCTG CCCTGGCGCC CGGCGTCGCG GAAACGCAGA ACAATCTGGG GCTGGTCCTG
ATGGCGCTGG GTCATCTGCC GCAGGCCCGG ACCGCCCTGG AACGAGCCAG GATTCTGGCG
CCGGGCGATG CGCGGATCGC GGTCAATGCC GCGACCATTC TGGACGAACT GGCCGAGGGG
GATGCGGCCG AGGCCCTGTA CCGCGCCGTC CTGCGCGACC CCATCCTGGC GCGGGAGGCG
GAGGGCGCGC GGGCCCAGTT CAATCTGGGA ACATTGCTGC TGGCGCGTGG CGCGTACGCC
GAGGGATGGC GCCATCTGGA AGCCCGATCC CGCCTGCTGC CACCCATGCG CGGGCAGGGC
GTCGCGGAAT GGGACGGGGC GTCACTGCCG CGCGGGCGCG TGCTGCTATA TGCCGAGCAG
GGGCTGGGTG ACGCGATCCA GTTCCTGCGC TACCTGCCCG ACTGCCTGCG CCGCGCGTCC
GTTGTGCTGG ATGTGCCGCA CAGCCTGCAC CGGCTGTTGC AAACGATGCC CGATCCGGAC
GGGCAGATCG CGACGCGGTG CACCGTCCTG CCGCCAGGGG ACCCGCTGCC GGACGATGTG
GTGGCGCGCT GCGGCCTGAT CAGCCTGCCG CATCGGCTGG GCATGACCGA TATTCCGCCC
TTCGCGCCCT ATCTGCTGCC GGCACCCGCG CCCGACCTGG GGGAGAGGCC CCGGGTAGGG
TTGTGCTGGG CGGGCAATCC GTCCTTCCGC TTCGATCGAA GGCGGTCGAT CCCGGCGCAT
CGGCTGGCCC CACTGGCCGA CGTGCCGGGC CTGTTTTTCG TCAGCCTCCA GCACGGTCCG
GCCGCCGCCG CGCCGCCCTT CGCGCTGGAG CGGTCAGCGG AAGGCGACAT GCTGGACACC
GCCCGGATCG TCGCCGGACT GGATCTGGTG ATCACCGTCG ATACCGCCAT CGCCCATCTG
GCCGGCGCGA TGGGCAGACC AGTCTGGCTG CTGAACCGCT TCGGCGGCGA CTGGCGCTGG
TCCGCCACCT TCGACCGTGC CGAGCCCCCG CGCTGGGGCG ACCGGGGCAG CCGCTGGTAT
CCTTCTCTGG AACAGTTCCG CCAGCACCAG CCGGACGATC CCGACACCGC CTGGGCCGCG
CCGATCGAGG CCGTGCACGC GGCGTTGCTT CGCTGGCGGG TTGGTTTCGC CACGGGGCCT
CTTGCGGATA AATCCGCGTA G
 
Protein sequence
MDKRQAAPDG PDSAARAMAL LEAGDRDGAL ALLHAALSAR PACRDADVLH GMACVARAAG 
RPDLAIGLAG KAVALLPAAH FHITLGCALR EQGHVEEARA ALAVAVLREP RDARAHAALA
GALGELGRWA EAEASLRAAR ALCPGDMALL LEWARACIHG GDHAAATAEI VAGADRFAPD
HAGALHGLAT LLADRGQPAG AEALYRQIVR LLPDDGAAWA NHGAALFALN RHEDARVALE
HAAALAPGVA ETQNNLGLVL MALGHLPQAR TALERARILA PGDARIAVNA ATILDELAEG
DAAEALYRAV LRDPILAREA EGARAQFNLG TLLLARGAYA EGWRHLEARS RLLPPMRGQG
VAEWDGASLP RGRVLLYAEQ GLGDAIQFLR YLPDCLRRAS VVLDVPHSLH RLLQTMPDPD
GQIATRCTVL PPGDPLPDDV VARCGLISLP HRLGMTDIPP FAPYLLPAPA PDLGERPRVG
LCWAGNPSFR FDRRRSIPAH RLAPLADVPG LFFVSLQHGP AAAAPPFALE RSAEGDMLDT
ARIVAGLDLV ITVDTAIAHL AGAMGRPVWL LNRFGGDWRW SATFDRAEPP RWGDRGSRWY
PSLEQFRQHQ PDDPDTAWAA PIEAVHAALL RWRVGFATGP LADKSA