Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Gdia_0304 |
Symbol | |
ID | 6973696 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Gluconacetobacter diazotrophicus PAl 5 |
Kingdom | Bacteria |
Replicon accession | NC_011365 |
Strand | - |
Start bp | 336501 |
End bp | 338441 |
Gene Length | 1941 bp |
Protein Length | 646 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 643389835 |
Product | Tetratricopeptide TPR_2 repeat protein |
Protein accession | YP_002274716 |
Protein GI | 209542487 |
COG category | [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG5010] Flp pilus assembly protein TadD, contains TPR repeats |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 0.860081 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 36 |
Fosmid unclonability p-value | 0.111747 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGACAAAC GGCAGGCGGC CCCCGACGGG CCGGACAGCG CGGCACGGGC CATGGCGCTG CTTGAAGCCG GGGACCGCGA CGGGGCGCTG GCGCTGCTTC ACGCCGCCCT GTCCGCGCGG CCCGCGTGCC GGGATGCCGA CGTCCTGCAC GGCATGGCCT GCGTCGCCCG TGCCGCCGGG CGGCCGGACC TTGCGATCGG GCTGGCGGGC AAGGCCGTCG CGCTGCTGCC GGCGGCGCAT TTTCACATCA CCCTGGGCTG CGCCCTGCGC GAGCAGGGCC ATGTGGAGGA GGCCCGCGCC GCGCTGGCGG TGGCCGTACT GCGCGAACCG CGCGACGCGC GCGCCCACGC CGCCCTGGCC GGCGCGCTGG GTGAACTGGG CCGCTGGGCG GAGGCCGAGG CCAGCCTGCG CGCGGCGCGG GCCCTGTGTC CCGGTGACAT GGCCCTGCTA CTGGAATGGG CCCGCGCGTG CATCCATGGC GGGGACCACG CCGCGGCGAC GGCGGAGATC GTGGCCGGGG CGGACCGCTT TGCGCCCGAC CATGCCGGCG CCCTGCACGG GCTGGCCACC CTACTGGCGG ATCGGGGCCA GCCCGCGGGG GCCGAGGCGC TGTATCGGCA GATCGTCCGG CTTCTCCCCG ATGACGGGGC GGCCTGGGCC AATCATGGCG CCGCGCTGTT CGCGCTGAAC CGGCACGAGG ACGCCCGCGT CGCGCTGGAG CACGCCGCTG CCCTGGCGCC CGGCGTCGCG GAAACGCAGA ACAATCTGGG GCTGGTCCTG ATGGCGCTGG GTCATCTGCC GCAGGCCCGG ACCGCCCTGG AACGAGCCAG GATTCTGGCG CCGGGCGATG CGCGGATCGC GGTCAATGCC GCGACCATTC TGGACGAACT GGCCGAGGGG GATGCGGCCG AGGCCCTGTA CCGCGCCGTC CTGCGCGACC CCATCCTGGC GCGGGAGGCG GAGGGCGCGC GGGCCCAGTT CAATCTGGGA ACATTGCTGC TGGCGCGTGG CGCGTACGCC GAGGGATGGC GCCATCTGGA AGCCCGATCC CGCCTGCTGC CACCCATGCG CGGGCAGGGC GTCGCGGAAT GGGACGGGGC GTCACTGCCG CGCGGGCGCG TGCTGCTATA TGCCGAGCAG GGGCTGGGTG ACGCGATCCA GTTCCTGCGC TACCTGCCCG ACTGCCTGCG CCGCGCGTCC GTTGTGCTGG ATGTGCCGCA CAGCCTGCAC CGGCTGTTGC AAACGATGCC CGATCCGGAC GGGCAGATCG CGACGCGGTG CACCGTCCTG CCGCCAGGGG ACCCGCTGCC GGACGATGTG GTGGCGCGCT GCGGCCTGAT CAGCCTGCCG CATCGGCTGG GCATGACCGA TATTCCGCCC TTCGCGCCCT ATCTGCTGCC GGCACCCGCG CCCGACCTGG GGGAGAGGCC CCGGGTAGGG TTGTGCTGGG CGGGCAATCC GTCCTTCCGC TTCGATCGAA GGCGGTCGAT CCCGGCGCAT CGGCTGGCCC CACTGGCCGA CGTGCCGGGC CTGTTTTTCG TCAGCCTCCA GCACGGTCCG GCCGCCGCCG CGCCGCCCTT CGCGCTGGAG CGGTCAGCGG AAGGCGACAT GCTGGACACC GCCCGGATCG TCGCCGGACT GGATCTGGTG ATCACCGTCG ATACCGCCAT CGCCCATCTG GCCGGCGCGA TGGGCAGACC AGTCTGGCTG CTGAACCGCT TCGGCGGCGA CTGGCGCTGG TCCGCCACCT TCGACCGTGC CGAGCCCCCG CGCTGGGGCG ACCGGGGCAG CCGCTGGTAT CCTTCTCTGG AACAGTTCCG CCAGCACCAG CCGGACGATC CCGACACCGC CTGGGCCGCG CCGATCGAGG CCGTGCACGC GGCGTTGCTT CGCTGGCGGG TTGGTTTCGC CACGGGGCCT CTTGCGGATA AATCCGCGTA G
|
Protein sequence | MDKRQAAPDG PDSAARAMAL LEAGDRDGAL ALLHAALSAR PACRDADVLH GMACVARAAG RPDLAIGLAG KAVALLPAAH FHITLGCALR EQGHVEEARA ALAVAVLREP RDARAHAALA GALGELGRWA EAEASLRAAR ALCPGDMALL LEWARACIHG GDHAAATAEI VAGADRFAPD HAGALHGLAT LLADRGQPAG AEALYRQIVR LLPDDGAAWA NHGAALFALN RHEDARVALE HAAALAPGVA ETQNNLGLVL MALGHLPQAR TALERARILA PGDARIAVNA ATILDELAEG DAAEALYRAV LRDPILAREA EGARAQFNLG TLLLARGAYA EGWRHLEARS RLLPPMRGQG VAEWDGASLP RGRVLLYAEQ GLGDAIQFLR YLPDCLRRAS VVLDVPHSLH RLLQTMPDPD GQIATRCTVL PPGDPLPDDV VARCGLISLP HRLGMTDIPP FAPYLLPAPA PDLGERPRVG LCWAGNPSFR FDRRRSIPAH RLAPLADVPG LFFVSLQHGP AAAAPPFALE RSAEGDMLDT ARIVAGLDLV ITVDTAIAHL AGAMGRPVWL LNRFGGDWRW SATFDRAEPP RWGDRGSRWY PSLEQFRQHQ PDDPDTAWAA PIEAVHAALL RWRVGFATGP LADKSA
|
| |