Gene Gdia_3478 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGdia_3478 
Symbol 
ID6976930 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGluconacetobacter diazotrophicus PAl 5 
KingdomBacteria 
Replicon accessionNC_011365 
Strand
Start bp3808810 
End bp3810321 
Gene Length1512 bp 
Protein Length503 aa 
Translation table11 
GC content63% 
IMG OID643392999 
Productflagellar hook-associated protein FlgK 
Protein accessionYP_002277818 
Protein GI209545589 
COG category[N] Cell motility 
COG ID[COG1256] Flagellar hook-associated protein 
TIGRFAM ID[TIGR02492] flagellar hook-associated protein FlgK 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones36 
Fosmid unclonability p-value0.109948 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACCTGC TGTCATCGCT GTCAACGGCC ACCAGCGGAC TGAACGGGAT CGAGTATCAG 
TTGGGCGTGC TGTCCAATAA CGTCTCGAAC TCGAGCACCA CGGGATATGT CAGCGAGACG
GCCGAGGTAT CGTCGGCCGT GGCGGGTGGC GTCGGGGTCG GTATCAAGAT CGGCACGACG
CAACTGAGTG TGAACAAGGC CCTGATGGCC GCTCTTTACG GACAGAATGC GCAGGTGGCG
TCGTTGACGG CGACGAACAA CTCGCTGGCG GCGGTTTCAG ACATCCAGGG ATCGACATCG
GCGGATGCGG GAAGCACGAC CACGTTGGCC GACGAACTTG GCAATGTGCA AAGCGCCCTC
ACGACCCTGA CCTCGACCCC CACGAACAGC GCGTCGCAAT CGGCCGTGTA TTCCGCCGCG
CAATCCCTGA CCACCACCAT CCAGTCGCTT TCCTCGACCT ATACGGCCCA GCGCCAGGAC
GCCGAAAACA GCGTTGTCTC GACGGTATCC TCCGTCAATT ACGACCTGAC GCAGCTCGGG
CAGCTCTCGC AGCAGATCAT GAGCCTGCGG GCCAGCGGAG GAAGCACCGC CGACGTCGAG
AACCAGCGTC TGCAGGTGAT GTCGAGCCTG TCGTCCGAAC TTTCGGTCAC GTTTTCCGAG
ACGTCGACCG GCGACATGAT CGTCCGGACC GCCGATGGAA CCGAGCTTCC GACCCGTCCC
GACCAGATCG GGGAAAATGA CAGCACGGTC ACGCTGCCCA CCAGCACATG GCCGCTTTCG
ACATCCGGCA GCACCATTAC CCCATCGTCG TACTACCAGG CTGGAGATAC CAACTCGACG
ATCTCCGGGA TCATGCTGAA CGGCACGGAC ATCACCGCGC ATCTGACCGG TGGAACGCTC
GGAGCGAACA TCACGCTGCG CGACAGCACG TATCCGACGA TGCAGGCCCA GCTGGACTCG
TTTTCTTCCA CGCTCGCGAC CCGGTTTTCC GATGCCGGGC TTTCGCTGTT CACCGATGGT
ACGGGGGCTG TTCCGGCAAC GGACCCGACG GCAGAGACGC CCAGCGGCAT CGTCGGCCTG
TCGTCGGTGA TCAGCGTGGA TACGTCCGCG CCGCTGACGA CGGACGGCGA TACGTCGACG
ATTACGGCGG TCCTCAGCAC GGCTTTCGGA ACCGCTTCGA CGGATGTGAG CGGTTCGCTT
GAAGCGCCGT CAAGCGGCCT TGGACCGGAG GGCAATCTGT CGACCGGATA TTCGGGCACC
CAGGGACTGG TGGCCCTTGC CACGTCCCTG ACCTCGGCCC AGGGCGCGGT CATCGGCGAC
GCCACCGACG ATCTGACGTC CGCTACCTCG GTGCAGACAA CGTTGCAGAC GTCTGTCGCC
AACGTGTCCG GCGTAAACGT GGACGATCAG ATGTCGACGG TCGTCGCGCT GCAGAACGCC
TACGCGGCCA ATGCGAAAGT GGTGACCGCG GTGCAGACGA TGTTCACCGC GCTTCTCGAC
GCGATCCAAT AG
 
Protein sequence
MDLLSSLSTA TSGLNGIEYQ LGVLSNNVSN SSTTGYVSET AEVSSAVAGG VGVGIKIGTT 
QLSVNKALMA ALYGQNAQVA SLTATNNSLA AVSDIQGSTS ADAGSTTTLA DELGNVQSAL
TTLTSTPTNS ASQSAVYSAA QSLTTTIQSL SSTYTAQRQD AENSVVSTVS SVNYDLTQLG
QLSQQIMSLR ASGGSTADVE NQRLQVMSSL SSELSVTFSE TSTGDMIVRT ADGTELPTRP
DQIGENDSTV TLPTSTWPLS TSGSTITPSS YYQAGDTNST ISGIMLNGTD ITAHLTGGTL
GANITLRDST YPTMQAQLDS FSSTLATRFS DAGLSLFTDG TGAVPATDPT AETPSGIVGL
SSVISVDTSA PLTTDGDTST ITAVLSTAFG TASTDVSGSL EAPSSGLGPE GNLSTGYSGT
QGLVALATSL TSAQGAVIGD ATDDLTSATS VQTTLQTSVA NVSGVNVDDQ MSTVVALQNA
YAANAKVVTA VQTMFTALLD AIQ